Frederik Ferner
2010-Mar-04 11:21 UTC
[Lustre-discuss] problems restoring from MDT backup (test file system)
Hi List, on my test file system I''m currently trying to verify once more that I can restore the MDT from my backups but I''m running into some problems. The MDS is running RHEL5.3 and Lustre 1.6.7.2. I use the following procedure to backup the MDT: create LVM snapshot from MDT device mount LVM snapshot as ldiskfs extract EAs with getfattr tar up the whole MDT tree from ldiskfs using ''/bin/tar czSf /tmp/${BACKUP_FILE} --acls --numeric-owner .'' In previous tests the restore appeared to work fine even though I was not quite sure about the ACLs as I had not recorded them before destroying the MDT at that time. This time I have some problems with the tar file when I want to verify the list of files before destroying the MDT: <snip> tar tizf test_MDT_Backup.tar.gz .... ./ROOT/tmp/frederik/cs04r-sc-com02-04/ ./ROOT/tmp/frederik/cs04r-sc-com02-04/iozone.DUMMY.47 tar: Unexpected EOF in archive tar: Error is not recoverable: exiting now </snip> Doing the same with older backup files or backups from our production file system don''t show this error but for the current test file system, I can reproduce this easily with any new backup file that I create. The list of files that I see when creating a new tar file (adding -v to the tar options) from the test file system does include many files after the last one in the output above, but the new file has exactly the same problem at the same place. Has anyone seen something like this before? What could we try to recover the data from the old backup? Or is this most likely impossible? Could this indicate a problem on the files system? I''ve not tried to run fsck on the MDT as I''d like to extract the files from the tar files if possible independent of fixing the existing file system as an exercise. Any suggestions are welcome. Frederik -- Frederik Ferner Computer Systems Administrator phone: +44 1235 77 8624 Diamond Light Source Ltd. mob: +44 7917 08 5110 (Apologies in advance for the lines below. Some bits are a legal requirement and I have no control over them.)
Brian J. Murrell
2010-Mar-04 12:22 UTC
[Lustre-discuss] problems restoring from MDT backup (test file system)
On Thu, 2010-03-04 at 11:21 +0000, Frederik Ferner wrote:> > tar tizf test_MDT_Backup.tar.gz > .... > ./ROOT/tmp/frederik/cs04r-sc-com02-04/ > ./ROOT/tmp/frederik/cs04r-sc-com02-04/iozone.DUMMY.47 > tar: Unexpected EOF in archive > tar: Error is not recoverable: exiting now > </snip>Looks to me like either your tar executable is broken or your archive is broken. A typical process of elimination should help you discover which is the case. b. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 197 bytes Desc: This is a digitally signed message part Url : http://lists.lustre.org/pipermail/lustre-discuss/attachments/20100304/9b4d56f2/attachment-0001.bin
Frederik Ferner
2010-Mar-04 12:46 UTC
[Lustre-discuss] problems restoring from MDT backup (test file system)
Brian, thanks for your reply. Brian J. Murrell wrote:> On Thu, 2010-03-04 at 11:21 +0000, Frederik Ferner wrote: >> tar tizf test_MDT_Backup.tar.gz >> .... >> ./ROOT/tmp/frederik/cs04r-sc-com02-04/ >> ./ROOT/tmp/frederik/cs04r-sc-com02-04/iozone.DUMMY.47 >> tar: Unexpected EOF in archive >> tar: Error is not recoverable: exiting now >> </snip> > > Looks to me like either your tar executable is broken or your archive is > broken. A typical process of elimination should help you discover which > is the case.It certainly looks like it''s the tar archive that is broken. I get the same when I copy it over to a different machine. Unless is the tar executable that is broken so that it creates the broken archive as every time I create a new archive it seems to be broken at the same place. Other tar files created on the same machine don''t have that problem, but I''ll try creating a new archive with a new executable. Thanks, Frederik -- Frederik Ferner Computer Systems Administrator phone: +44 1235 77 8624 Diamond Light Source Ltd. mob: +44 7917 08 5110 (Apologies in advance for the lines below. Some bits are a legal requirement and I have no control over them.)
Andreas Dilger
2010-Mar-05 10:08 UTC
[Lustre-discuss] problems restoring from MDT backup (test file system)
On 2010-03-04, at 05:46, Frederik Ferner wrote:> Brian J. Murrell wrote: >> On Thu, 2010-03-04 at 11:21 +0000, Frederik Ferner wrote: >>> tar tizf test_MDT_Backup.tar.gz >>> .... >>> ./ROOT/tmp/frederik/cs04r-sc-com02-04/ >>> ./ROOT/tmp/frederik/cs04r-sc-com02-04/iozone.DUMMY.47 >>> tar: Unexpected EOF in archive >>> tar: Error is not recoverable: exiting now >>> </snip> >> >> Looks to me like either your tar executable is broken or your >> archive is >> broken. A typical process of elimination should help you discover >> which >> is the case. > > It certainly looks like it''s the tar archive that is broken. I get the > same when I copy it over to a different machine. Unless is the tar > executable that is broken so that it creates the broken archive as > every > time I create a new archive it seems to be broken at the same place. > > Other tar files created on the same machine don''t have that problem, > but > I''ll try creating a new archive with a new executable.Make sure you use "--sparse" so that tar isn''t mistakenly creating huge archives full of zeroes. Cheers, Andreas -- Andreas Dilger Sr. Staff Engineer, Lustre Group Sun Microsystems of Canada, Inc.