Hi , Im trying to figure out what is the best way to recover a failed OST , basicly we have 10 OST''s each has DRBD + HA on top of raid 6 so its kind of redundent and suppose to be solid just want to notice for the other post that asked of that configuration that its working ok and the performance is fairly ok considering that redundency is more important then full speed of the cluster at list in this case . Regarding the backup strategy , we make a client backups to tapes of all the important stuff and also a seperate backup of the OST files only to a USB HD (daily on each OST) that backup is made possible by mounting the OST with -t ldiskfs insted of lustre the by running rsync to the USB HD so the main thing i dont understand is if an OST failed as in hardware problem then to avoid full file system recovery from tapes there is a need to restore the OST only data from the USB drive or tapes to the new OST , then the lustre procedure e.g e2fsck -n -v --mdsdb /tmp/ostdb /dev/{ostdev} on all OST''s then lfsck -n -v ............. /mnt/mainfs the only things i see possibly is to write zero holes on files that were changed for example seens the last backup of the OST file system itslef with rsync to the USB drive or tapes so baicly what will happen to a mysql table file that has inconcitency on its tripes how is possible to restore it the best way possible , i realize that it must suffer some kind of data lose but its better then loading the entire lustre file system backup wich will take days is some cases . Thanks for any help . ---------------------------------------------------------- Outgoing messages are virus free checked by NOD32 system -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.lustre.org/pipermail/lustre-discuss/attachments/20080220/aadda6a0/attachment-0002.html