Maurilio Longo
2009-Jun-12 14:05 UTC
[zfs-discuss] fmd writes tons of errors during a resilver
Hi, I''m trying to expand a raidz pool made up of six drives replacing one at a time with a bigger disk and waiting for resilver on a snv114 system. While resilvering fmd writes 8/10Mb each second inside /var/fm/fmd/errlog I had to disable it since it was filling up my boot disk. Is this expected? # zpool status pool: nas stato: DEGRADED condizione: viene eseguita la risincronizzazione di uno o pi? dispositivi. Il pool continuer? a funzionare normalmente oppure in stato degradato. azione: attendere la fine della risincronizzazione. scrub: resilver in progress for 6h52m, 79,82% done, 1h44m to go config: NAME STATE READ WRITE CKSUM nas DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 c2t1d0 ONLINE 0 0 0 replacing DEGRADED 0 0 7,58M c2t4d0s0/o FAULTED 0 0 0 dati danneggiati c2t4d0 ONLINE 0 0 0 141G resilvered c2t5d0 ONLINE 0 0 0 c2t3d0 ONLINE 0 0 0 c2t2d0 ONLINE 0 0 0 c2t0d0 ONLINE 0 0 0 errori: nessun errore di dati rilevato Here c1t0d0 is the boot disk (still on UFS) extended device statistics r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device 0,0 3711,7 0,0 9661,0 0,0 0,6 0,0 0,2 2 60 c1t0d0 452,1 0,0 9707,5 0,0 0,4 0,2 0,9 0,4 13 20 c2t0d0 464,1 0,0 9785,0 0,0 0,3 0,2 0,6 0,4 11 18 c2t1d0 467,1 0,0 9695,0 0,0 0,4 0,2 0,8 0,4 12 19 c2t2d0 445,1 0,0 9743,0 0,0 0,4 0,2 0,9 0,4 14 19 c2t3d0 236,0 309,1 9842,1 9279,9 24,0 1,0 44,0 1,8 99 99 c2t4d0 420,1 0,0 9682,5 0,0 0,4 0,2 1,0 0,5 14 20 c2t5d0 cpu us sy wt id 12 32 0 55 extended device statistics r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device 0,0 3992,5 0,0 10281,7 0,0 0,6 0,0 0,2 2 63 c1t0d0 318,0 0,0 6042,7 0,0 0,3 0,2 1,1 0,7 13 22 c2t0d0 307,0 0,0 6129,7 0,0 0,4 0,2 1,4 0,8 15 24 c2t1d0 315,0 0,0 6137,7 0,0 0,4 0,2 1,2 0,7 17 22 c2t2d0 351,0 0,0 6020,2 0,0 0,4 0,2 1,0 0,6 15 21 c2t3d0 223,0 273,0 6453,7 6134,7 25,8 1,0 52,0 2,0 96 97 c2t4d0 346,0 0,0 5988,2 0,0 0,3 0,2 0,8 0,5 11 17 c2t5d0 in a few seconds it wrote # svcadm disable fmd # ls -l /var/fm/fmd/ totale 55934 drwx------ 3 root sys 512 10 apr 17:41 ckpt -rw-r--r-- 1 root root 28599664 12 giu 15:56 errlog -rw-r--r-- 1 root root 3410 12 giu 15:55 fltlog drwx------ 2 root sys 512 12 giu 15:55 rsrc drwx------ 2 root sys 512 13 dic 2007 xprt the content of the file is not printable. Maurilio. -- This message posted from opensolaris.org
Eric Schrock
2009-Jun-12 19:40 UTC
[zfs-discuss] fmd writes tons of errors during a resilver
On 06/12/09 07:05, Maurilio Longo wrote:> Hi, > > I''m trying to expand a raidz pool made up of six drives replacing one at a time with a bigger disk and waiting for resilver on a snv114 system.This is: 6807339 spurious checksum errors when replacing a vdev - Eric> > While resilvering fmd writes 8/10Mb each second inside > > /var/fm/fmd/errlog > > I had to disable it since it was filling up my boot disk. > > Is this expected? > > # zpool status > pool: nas > stato: DEGRADED > condizione: viene eseguita la risincronizzazione di uno o pi? dispositivi. Il pool > continuer? a funzionare normalmente oppure in stato degradato. > azione: attendere la fine della risincronizzazione. > scrub: resilver in progress for 6h52m, 79,82% done, 1h44m to go > config: > > NAME STATE READ WRITE CKSUM > nas DEGRADED 0 0 0 > raidz1 DEGRADED 0 0 0 > c2t1d0 ONLINE 0 0 0 > replacing DEGRADED 0 0 7,58M > c2t4d0s0/o FAULTED 0 0 0 dati danneggiati > c2t4d0 ONLINE 0 0 0 141G resilvered > c2t5d0 ONLINE 0 0 0 > c2t3d0 ONLINE 0 0 0 > c2t2d0 ONLINE 0 0 0 > c2t0d0 ONLINE 0 0 0 > > errori: nessun errore di dati rilevato > > Here c1t0d0 is the boot disk (still on UFS) > > extended device statistics > r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device > 0,0 3711,7 0,0 9661,0 0,0 0,6 0,0 0,2 2 60 c1t0d0 > 452,1 0,0 9707,5 0,0 0,4 0,2 0,9 0,4 13 20 c2t0d0 > 464,1 0,0 9785,0 0,0 0,3 0,2 0,6 0,4 11 18 c2t1d0 > 467,1 0,0 9695,0 0,0 0,4 0,2 0,8 0,4 12 19 c2t2d0 > 445,1 0,0 9743,0 0,0 0,4 0,2 0,9 0,4 14 19 c2t3d0 > 236,0 309,1 9842,1 9279,9 24,0 1,0 44,0 1,8 99 99 c2t4d0 > 420,1 0,0 9682,5 0,0 0,4 0,2 1,0 0,5 14 20 c2t5d0 > cpu > us sy wt id > 12 32 0 55 > extended device statistics > r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device > 0,0 3992,5 0,0 10281,7 0,0 0,6 0,0 0,2 2 63 c1t0d0 > 318,0 0,0 6042,7 0,0 0,3 0,2 1,1 0,7 13 22 c2t0d0 > 307,0 0,0 6129,7 0,0 0,4 0,2 1,4 0,8 15 24 c2t1d0 > 315,0 0,0 6137,7 0,0 0,4 0,2 1,2 0,7 17 22 c2t2d0 > 351,0 0,0 6020,2 0,0 0,4 0,2 1,0 0,6 15 21 c2t3d0 > 223,0 273,0 6453,7 6134,7 25,8 1,0 52,0 2,0 96 97 c2t4d0 > 346,0 0,0 5988,2 0,0 0,3 0,2 0,8 0,5 11 17 c2t5d0 > > in a few seconds it wrote > > # svcadm disable fmd > # ls -l /var/fm/fmd/ > totale 55934 > drwx------ 3 root sys 512 10 apr 17:41 ckpt > -rw-r--r-- 1 root root 28599664 12 giu 15:56 errlog > -rw-r--r-- 1 root root 3410 12 giu 15:55 fltlog > drwx------ 2 root sys 512 12 giu 15:55 rsrc > drwx------ 2 root sys 512 13 dic 2007 xprt > > the content of the file is not printable. > > Maurilio.-- Eric Schrock, Fishworks http://blogs.sun.com/eschrock