Maurilio Longo
2009-Jun-12 14:05 UTC
[zfs-discuss] fmd writes tons of errors during a resilver
Hi,
I''m trying to expand a raidz pool made up of six drives replacing one
at a time with a bigger disk and waiting for resilver on a snv114 system.
While resilvering fmd writes 8/10Mb each second inside
/var/fm/fmd/errlog
I had to disable it since it was filling up my boot disk.
Is this expected?
# zpool status
pool: nas
stato: DEGRADED
condizione: viene eseguita la risincronizzazione di uno o pi? dispositivi. Il
pool
continuer? a funzionare normalmente oppure in stato degradato.
azione: attendere la fine della risincronizzazione.
scrub: resilver in progress for 6h52m, 79,82% done, 1h44m to go
config:
NAME STATE READ WRITE CKSUM
nas DEGRADED 0 0 0
raidz1 DEGRADED 0 0 0
c2t1d0 ONLINE 0 0 0
replacing DEGRADED 0 0 7,58M
c2t4d0s0/o FAULTED 0 0 0 dati danneggiati
c2t4d0 ONLINE 0 0 0 141G resilvered
c2t5d0 ONLINE 0 0 0
c2t3d0 ONLINE 0 0 0
c2t2d0 ONLINE 0 0 0
c2t0d0 ONLINE 0 0 0
errori: nessun errore di dati rilevato
Here c1t0d0 is the boot disk (still on UFS)
extended device statistics
r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device
0,0 3711,7 0,0 9661,0 0,0 0,6 0,0 0,2 2 60 c1t0d0
452,1 0,0 9707,5 0,0 0,4 0,2 0,9 0,4 13 20 c2t0d0
464,1 0,0 9785,0 0,0 0,3 0,2 0,6 0,4 11 18 c2t1d0
467,1 0,0 9695,0 0,0 0,4 0,2 0,8 0,4 12 19 c2t2d0
445,1 0,0 9743,0 0,0 0,4 0,2 0,9 0,4 14 19 c2t3d0
236,0 309,1 9842,1 9279,9 24,0 1,0 44,0 1,8 99 99 c2t4d0
420,1 0,0 9682,5 0,0 0,4 0,2 1,0 0,5 14 20 c2t5d0
cpu
us sy wt id
12 32 0 55
extended device statistics
r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device
0,0 3992,5 0,0 10281,7 0,0 0,6 0,0 0,2 2 63 c1t0d0
318,0 0,0 6042,7 0,0 0,3 0,2 1,1 0,7 13 22 c2t0d0
307,0 0,0 6129,7 0,0 0,4 0,2 1,4 0,8 15 24 c2t1d0
315,0 0,0 6137,7 0,0 0,4 0,2 1,2 0,7 17 22 c2t2d0
351,0 0,0 6020,2 0,0 0,4 0,2 1,0 0,6 15 21 c2t3d0
223,0 273,0 6453,7 6134,7 25,8 1,0 52,0 2,0 96 97 c2t4d0
346,0 0,0 5988,2 0,0 0,3 0,2 0,8 0,5 11 17 c2t5d0
in a few seconds it wrote
# svcadm disable fmd
# ls -l /var/fm/fmd/
totale 55934
drwx------ 3 root sys 512 10 apr 17:41 ckpt
-rw-r--r-- 1 root root 28599664 12 giu 15:56 errlog
-rw-r--r-- 1 root root 3410 12 giu 15:55 fltlog
drwx------ 2 root sys 512 12 giu 15:55 rsrc
drwx------ 2 root sys 512 13 dic 2007 xprt
the content of the file is not printable.
Maurilio.
--
This message posted from opensolaris.org
Eric Schrock
2009-Jun-12 19:40 UTC
[zfs-discuss] fmd writes tons of errors during a resilver
On 06/12/09 07:05, Maurilio Longo wrote:> Hi, > > I''m trying to expand a raidz pool made up of six drives replacing one at a time with a bigger disk and waiting for resilver on a snv114 system.This is: 6807339 spurious checksum errors when replacing a vdev - Eric> > While resilvering fmd writes 8/10Mb each second inside > > /var/fm/fmd/errlog > > I had to disable it since it was filling up my boot disk. > > Is this expected? > > # zpool status > pool: nas > stato: DEGRADED > condizione: viene eseguita la risincronizzazione di uno o pi? dispositivi. Il pool > continuer? a funzionare normalmente oppure in stato degradato. > azione: attendere la fine della risincronizzazione. > scrub: resilver in progress for 6h52m, 79,82% done, 1h44m to go > config: > > NAME STATE READ WRITE CKSUM > nas DEGRADED 0 0 0 > raidz1 DEGRADED 0 0 0 > c2t1d0 ONLINE 0 0 0 > replacing DEGRADED 0 0 7,58M > c2t4d0s0/o FAULTED 0 0 0 dati danneggiati > c2t4d0 ONLINE 0 0 0 141G resilvered > c2t5d0 ONLINE 0 0 0 > c2t3d0 ONLINE 0 0 0 > c2t2d0 ONLINE 0 0 0 > c2t0d0 ONLINE 0 0 0 > > errori: nessun errore di dati rilevato > > Here c1t0d0 is the boot disk (still on UFS) > > extended device statistics > r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device > 0,0 3711,7 0,0 9661,0 0,0 0,6 0,0 0,2 2 60 c1t0d0 > 452,1 0,0 9707,5 0,0 0,4 0,2 0,9 0,4 13 20 c2t0d0 > 464,1 0,0 9785,0 0,0 0,3 0,2 0,6 0,4 11 18 c2t1d0 > 467,1 0,0 9695,0 0,0 0,4 0,2 0,8 0,4 12 19 c2t2d0 > 445,1 0,0 9743,0 0,0 0,4 0,2 0,9 0,4 14 19 c2t3d0 > 236,0 309,1 9842,1 9279,9 24,0 1,0 44,0 1,8 99 99 c2t4d0 > 420,1 0,0 9682,5 0,0 0,4 0,2 1,0 0,5 14 20 c2t5d0 > cpu > us sy wt id > 12 32 0 55 > extended device statistics > r/s w/s kr/s kw/s wait actv wsvc_t asvc_t %w %b device > 0,0 3992,5 0,0 10281,7 0,0 0,6 0,0 0,2 2 63 c1t0d0 > 318,0 0,0 6042,7 0,0 0,3 0,2 1,1 0,7 13 22 c2t0d0 > 307,0 0,0 6129,7 0,0 0,4 0,2 1,4 0,8 15 24 c2t1d0 > 315,0 0,0 6137,7 0,0 0,4 0,2 1,2 0,7 17 22 c2t2d0 > 351,0 0,0 6020,2 0,0 0,4 0,2 1,0 0,6 15 21 c2t3d0 > 223,0 273,0 6453,7 6134,7 25,8 1,0 52,0 2,0 96 97 c2t4d0 > 346,0 0,0 5988,2 0,0 0,3 0,2 0,8 0,5 11 17 c2t5d0 > > in a few seconds it wrote > > # svcadm disable fmd > # ls -l /var/fm/fmd/ > totale 55934 > drwx------ 3 root sys 512 10 apr 17:41 ckpt > -rw-r--r-- 1 root root 28599664 12 giu 15:56 errlog > -rw-r--r-- 1 root root 3410 12 giu 15:55 fltlog > drwx------ 2 root sys 512 12 giu 15:55 rsrc > drwx------ 2 root sys 512 13 dic 2007 xprt > > the content of the file is not printable. > > Maurilio.-- Eric Schrock, Fishworks http://blogs.sun.com/eschrock