Albert Chin
2009-Aug-24 18:34 UTC
[zfs-discuss] Why so many data errors with raidz2 config and one failing drive?
Added a third raidz2 vdev to my pool:
pool: tww
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://www.sun.com/msg/ZFS-8000-8A
scrub: resilver in progress for 0h57m, 13.36% done, 6h9m to go
config:
NAME STATE READ
WRITE CKSUM
tww DEGRADED 0
0 16.9K
raidz2 ONLINE 0
0 0
c6t600A0B8000299966000005964668CB39d0 ONLINE 0
0 0
c6t600A0B8000299CCC000006C84744C892d0 ONLINE 0
0 0
c6t600A0B8000299CCC000005B44668CC6Ad0 ONLINE 0
0 0
c6t600A0B8000299966000005A44668CC3Fd0 ONLINE 0
0 0
c6t600A0B8000299CCC000005BA4668CD2Ed0 ONLINE 0
0 0
c6t600A0B8000299966000005AA4668CDB1d0 ONLINE 0
0 0
c6t600A0B80002999660000073547C5CED9d0 ONLINE 0
0 0
raidz2 ONLINE 0
0 0
c6t600A0B8000299966000005B04668F17Dd0 ONLINE 0
0 0
c6t600A0B8000299CCC0000099E4A400B94d0 ONLINE 0
0 0
c6t600A0B8000299966000005B64668F26Fd0 ONLINE 0
0 0
c6t600A0B8000299CCC000005CC4668F30Ed0 ONLINE 0
0 0
c6t600A0B8000299966000005BC4668F305d0 ONLINE 0
0 0
c6t600A0B8000299CCC0000099B4A400A9Cd0 ONLINE 0
0 0
c6t600A0B8000299966000005C24668F39Bd0 ONLINE 0
0 0
raidz2 DEGRADED 0
0 34.0K
c6t600A0B8000299CCC00000A154A89E426d0 ONLINE 0
0 0
c6t600A0B8000299966000009F74A89E1A5d0 ONLINE 0
0 7 4K resilvered
c6t600A0B8000299CCC00000A174A89E520d0 ONLINE 0
0 2 4K resilvered
c6t600A0B8000299966000009F94A89E24Bd0 ONLINE 0
0 48 24.5K resilvered
replacing DEGRADED 0
0 78.7K
c6t600A0B8000299CCC00000A194A89E634d0 UNAVAIL 20
277K 0 experienced I/O failures
c6t600A0B8000299966000009EE4A89DA51d0 ONLINE 0
0 0 38.1M resilvered
c6t600A0B8000299CCC00000A0C4A89DDE8d0 ONLINE 0
0 6 6K resilvered
c6t600A0B8000299966000009F04A89DB1Bd0 ONLINE 0
0 86 92K resilvered
spares
c6t600A0B8000299CCC000005D84668F448d0 AVAIL
c6t600A0B8000299966000005C84668F461d0 AVAIL
errors: 17097 data errors, use ''-v'' for a list
Seems some of the new drives are having problems, resulting in CKSUM
errors. I don''t understand why I have so many data errors though. Why
does the third raidz2 vdev report 34.0K CKSUM errors?
The number of data errors appears to be increasing as well as the
resilver process continues.
--
albert chin (china at thewrittenword.com)
Bob Friesenhahn
2009-Aug-24 19:01 UTC
[zfs-discuss] Why so many data errors with raidz2 config and one failing drive?
On Mon, 24 Aug 2009, Albert Chin wrote:> > Seems some of the new drives are having problems, resulting in CKSUM > errors. I don''t understand why I have so many data errors though. Why > does the third raidz2 vdev report 34.0K CKSUM errors?Is it possible that this third raidz2 is inflicted with a shared problem such as a cable, controller, backplane, or power supply? Only one drive is reported as being unscathed.> The number of data errors appears to be increasing as well as the > resilver process continues.Bad news. Do you periodically scrub your array? Bob -- Bob Friesenhahn bfriesen at simple.dallas.tx.us, http://www.simplesystems.org/users/bfriesen/ GraphicsMagick Maintainer, http://www.GraphicsMagick.org/
Albert Chin
2009-Aug-24 19:12 UTC
[zfs-discuss] Why so many data errors with raidz2 config and one failing drive?
On Mon, Aug 24, 2009 at 02:01:39PM -0500, Bob Friesenhahn wrote:> On Mon, 24 Aug 2009, Albert Chin wrote: >> >> Seems some of the new drives are having problems, resulting in CKSUM >> errors. I don''t understand why I have so many data errors though. Why >> does the third raidz2 vdev report 34.0K CKSUM errors? > > Is it possible that this third raidz2 is inflicted with a shared > problem such as a cable, controller, backplane, or power supply? Only > one drive is reported as being unscathed.Well, we''re just using unused drives on the existing array. No other changes.> Do you periodically scrub your array?No. Guess we will now :) But, I think all of the data loss is a result of the new drives, not ones that were already part of the two previous vdevs. -- albert chin (china at thewrittenword.com)