RAIDz set, lost a disk, replaced it... lost another disk during resilver.
Replaced it, ran another resilver, and now it shows all disks with too many
errors.
Safe to say this is getting rebuilt and restored, or is there hope to recover
some of the data? I assume this is the case because rpool/filemover has errors,
is that fixable?
# zpool status -v
pool: rpool
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://www.sun.com/msg/ZFS-8000-8A
scrub: resilver completed after 4h51m with 190449 errors on Sat Aug 25 05:45:12
2012
config:
NAME STATE READ WRITE CKSUM
rpool DEGRADED 455K 0 0
raidz1 DEGRADED 455K 0 0
c3t0d0 DEGRADED 0 0 0 too many errors
c2t1d0 DEGRADED 0 0 0 too many errors
replacing UNAVAIL 0 0 0 insufficient replicas
c2t0d0s0/o FAULTED 0 0 0 too many errors
c2t0d0 FAULTED 0 0 0 too many errors
c3t1d0 DEGRADED 0 0 0 too many errors
c4t0d0 DEGRADED 0 0 0 too many errors
c4t1d0 DEGRADED 0 0 0 too many errors
errors: Permanent errors have been detected in the following files:
rpool/filemover:<0x1>
# zfs list
NAME USED AVAIL REFER MOUNTPOINT
rpool 6.64T 0 29.9K /rpool
rpool/filemover 6.64T 323G 6.32T -
Thanks
Mark
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20120827/b0ce3781/attachment.html>
Sašo Kiselkov
2012-Aug-27 19:41 UTC
[zfs-discuss] Zpool recovery after too many failed disks
On 08/27/2012 09:02 PM, Mark Wolek wrote:> RAIDz set, lost a disk, replaced it... lost another disk during resilver. Replaced it, ran another resilver, and now it shows all disks with too many errors. > > Safe to say this is getting rebuilt and restored, or is there hope to recover some of the data? I assume this is the case because rpool/filemover has errors, is that fixable?It seems you fell into the standard two-disk failure mode during resilver. If this is really the case, it seems like your pool is lost, because raidz works by treating each block as a single raidz stripe and spreading it over the component devices - as a result, most of your blocks will probably be missing some data. You can try and retrieve as much data from the pool as possible (via something like rsync or tar), though I''m not exactly certain how well (or whether at all) that will work. -- Saso