Dennis Clarke
2010-Feb-17 21:33 UTC
[zfs-discuss] false DEGRADED status based on "cannot open" device at boot.
I find that some servers display a DEGRADED zpool status at boot. More troubling is that this seems to be silent and no notice is given on the console or via a snmp message or other notification process. Let me demonstrate : {0} ok boot -srv Sun Blade 2500 (Silver), No Keyboard Copyright 2005 Sun Microsystems, Inc. All rights reserved. OpenBoot 4.17.3, 4096 MB memory installed, Serial #64510477. Ethernet address 0:3:ba:d8:5a:d, Host ID: 83d85a0d. Rebooting with command: boot -srv Boot device: /pci at 1d,700000/scsi at 4,1/disk at 0,0:a File and args: -srv module /platform/sun4u/kernel/sparcv9/unix: text at [0x1000000, 0x10a3695] data at 0x1800000 module /platform/sun4u/kernel/sparcv9/genunix: text at [0x10a3698, 0x126bbf7] data at 0x1866840 module /platform/SUNW,Sun-Blade-2500/kernel/misc/sparcv9/platmod: text at [0x126bbf8, 0x126c1e7] data at 0x18bc0c8 . . . many lines of verbose messages . . dump on /dev/zvol/dsk/mercury_rpool/swap size 0 MB Loading smf(5) service descriptions: 2/2 Requesting System Maintenance Mode SINGLE USER MODE Root password for system maintenance (control-d to bypass): single-user privilege assigned to /dev/console. Entering System Maintenance Mode # zpool list NAME SIZE USED AVAIL CAP HEALTH ALTROOT mercury_rpool 68G 27.4G 40.6G 40% DEGRADED - # zpool status mercury_rpool pool: mercury_rpool state: DEGRADED status: One or more devices could not be opened. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Attach the missing device and online it using ''zpool online''. see: http://www.sun.com/msg/ZFS-8000-2Q scrub: none requested config: NAME STATE READ WRITE CKSUM mercury_rpool DEGRADED 0 0 0 mirror DEGRADED 0 0 0 c3t0d0s0 ONLINE 0 0 0 c1t2d0s0 UNAVAIL 0 0 0 cannot open errors: No known data errors This is trivial to remedy : # zpool online mercury_rpool c1t2d0s0 # zpool list NAME SIZE USED AVAIL CAP HEALTH ALTROOT mercury_rpool 68G 27.4G 40.6G 40% ONLINE - # zpool status mercury_rpool pool: mercury_rpool state: ONLINE status: The pool is formatted using an older on-disk format. The pool can still be used, but some features are unavailable. action: Upgrade the pool using ''zpool upgrade''. Once this is done, the pool will no longer be accessible on older software versions. scrub: resilver completed after 0h0m with 0 errors on Wed Feb 17 21:26:11 2010 config: NAME STATE READ WRITE CKSUM mercury_rpool ONLINE 0 0 0 mirror ONLINE 0 0 0 c3t0d0s0 ONLINE 0 0 0 c1t2d0s0 ONLINE 0 0 0 14.5M resilvered errors: No known data errors # I have many systems where I keep mirrors on multiple controllers, either fibre or SCSI. It seems that the SCSI devices don''t get detected at boot on the Sparc systems. The x86/AMD64 systems do not seem to have this problem but I may be wrong. Is this a known bug or am I seeing something due to a missing line in /etc/system ? Oh, also, I should point out that it does not matter if I boot with init S or 3 or 6. -- Dennis Clarke dclarke at opensolaris.ca <- Email related to the open source Solaris dclarke at blastwave.org <- Email related to open source for Solaris
Bob Friesenhahn
2010-Feb-17 22:22 UTC
[zfs-discuss] false DEGRADED status based on "cannot open" device at boot.
On Wed, 17 Feb 2010, Dennis Clarke wrote:> > NAME STATE READ WRITE CKSUM > mercury_rpool ONLINE 0 0 0 > mirror ONLINE 0 0 0 > c3t0d0s0 ONLINE 0 0 0 > c1t2d0s0 ONLINE 0 0 0 14.5M resilvered > > errors: No known data errors > # > > I have many systems where I keep mirrors on multiple controllers, either > fibre or SCSI. It seems that the SCSI devices don''t get detected at boot > on the Sparc systems. The x86/AMD64 systems do not seem to have this > problem but I may be wrong. > > Is this a known bug or am I seeing something due to a missing line in > /etc/system ?My Sun Blade 2500 (Red) does see both boot disks. However, I do recall an issue at one time with the Solaris power management daemon in that it shut down the second disk during boot so that it was not seen. It was mentioned in the Solaris release notes (maybe U5 or U6?) and it happened to me. A fix to /etc/power.conf was required. Perhaps that is what is happening to you. Bob -- Bob Friesenhahn bfriesen at simple.dallas.tx.us, http://www.simplesystems.org/users/bfriesen/ GraphicsMagick Maintainer, http://www.GraphicsMagick.org/
Dennis Clarke
2010-Feb-17 22:28 UTC
[zfs-discuss] false DEGRADED status based on "cannot open" device at boot.
> On Wed, 17 Feb 2010, Dennis Clarke wrote: >> >> NAME STATE READ WRITE CKSUM >> mercury_rpool ONLINE 0 0 0 >> mirror ONLINE 0 0 0 >> c3t0d0s0 ONLINE 0 0 0 >> c1t2d0s0 ONLINE 0 0 0 14.5M resilvered >> >> errors: No known data errors >> # >> >> I have many systems where I keep mirrors on multiple controllers, either >> fibre or SCSI. It seems that the SCSI devices don''t get detected at boot >> on the Sparc systems. The x86/AMD64 systems do not seem to have this >> problem but I may be wrong. >> >> Is this a known bug or am I seeing something due to a missing line in >> /etc/system ? > > My Sun Blade 2500 (Red) does see both boot disks. However, I do > recall an issue at one time with the Solaris power management daemon > in that it shut down the second disk during boot so that it was not > seen. It was mentioned in the Solaris release notes (maybe U5 or U6?) > and it happened to me. A fix to /etc/power.conf was required. > Perhaps that is what is happening to you.Good theory, however, this disk is fully external with its own power. Strange. I''ll go have a look at a V490 I have here ( snv_130 ) and install a few SCSI cards just to see what happens. Maybe this is specific to the SB2500 workstations. -- Dennis Clarke dclarke at opensolaris.ca <- Email related to the open source Solaris dclarke at blastwave.org <- Email related to open source for Solaris
Cindy Swearingen
2010-Feb-17 22:43 UTC
[zfs-discuss] false DEGRADED status based on "cannot open" device at boot.
Hi Dennis, You might be running into this issue: http://bugs.opensolaris.org/bugdatabase/view_bug.do?bug_id=6856341 The workaround is to force load the drivers. Thanks, Cindy On 02/17/10 14:33, Dennis Clarke wrote:> I find that some servers display a DEGRADED zpool status at boot. More > troubling is that this seems to be silent and no notice is given on the > console or via a snmp message or other notification process. > > Let me demonstrate : > > {0} ok boot -srv > > Sun Blade 2500 (Silver), No Keyboard > Copyright 2005 Sun Microsystems, Inc. All rights reserved. > OpenBoot 4.17.3, 4096 MB memory installed, Serial #64510477. > Ethernet address 0:3:ba:d8:5a:d, Host ID: 83d85a0d. > > > > Rebooting with command: boot -srv > Boot device: /pci at 1d,700000/scsi at 4,1/disk at 0,0:a File and args: -srv > module /platform/sun4u/kernel/sparcv9/unix: text at [0x1000000, 0x10a3695] > data at 0x1800000 > module /platform/sun4u/kernel/sparcv9/genunix: text at [0x10a3698, > 0x126bbf7] data at 0x1866840 > module /platform/SUNW,Sun-Blade-2500/kernel/misc/sparcv9/platmod: text at > [0x126bbf8, 0x126c1e7] data at 0x18bc0c8 > . > . > . many lines of verbose messages > . > . > > dump on /dev/zvol/dsk/mercury_rpool/swap size 0 MB > Loading smf(5) service descriptions: 2/2 > Requesting System Maintenance Mode > SINGLE USER MODE > > Root password for system maintenance (control-d to bypass): > single-user privilege assigned to /dev/console. > Entering System Maintenance Mode > > # zpool list > NAME SIZE USED AVAIL CAP HEALTH ALTROOT > mercury_rpool 68G 27.4G 40.6G 40% DEGRADED - > # zpool status mercury_rpool > pool: mercury_rpool > state: DEGRADED > status: One or more devices could not be opened. Sufficient replicas > exist for > the pool to continue functioning in a degraded state. > action: Attach the missing device and online it using ''zpool online''. > see: http://www.sun.com/msg/ZFS-8000-2Q > scrub: none requested > config: > > NAME STATE READ WRITE CKSUM > mercury_rpool DEGRADED 0 0 0 > mirror DEGRADED 0 0 0 > c3t0d0s0 ONLINE 0 0 0 > c1t2d0s0 UNAVAIL 0 0 0 cannot open > > errors: No known data errors > > This is trivial to remedy : > > # zpool online mercury_rpool c1t2d0s0 > # zpool list > NAME SIZE USED AVAIL CAP HEALTH ALTROOT > mercury_rpool 68G 27.4G 40.6G 40% ONLINE - > # zpool status mercury_rpool > pool: mercury_rpool > state: ONLINE > status: The pool is formatted using an older on-disk format. The pool can > still be used, but some features are unavailable. > action: Upgrade the pool using ''zpool upgrade''. Once this is done, the > pool will no longer be accessible on older software versions. > scrub: resilver completed after 0h0m with 0 errors on Wed Feb 17 21:26:11 > 2010 > config: > > NAME STATE READ WRITE CKSUM > mercury_rpool ONLINE 0 0 0 > mirror ONLINE 0 0 0 > c3t0d0s0 ONLINE 0 0 0 > c1t2d0s0 ONLINE 0 0 0 14.5M resilvered > > errors: No known data errors > # > > I have many systems where I keep mirrors on multiple controllers, either > fibre or SCSI. It seems that the SCSI devices don''t get detected at boot > on the Sparc systems. The x86/AMD64 systems do not seem to have this > problem but I may be wrong. > > Is this a known bug or am I seeing something due to a missing line in > /etc/system ? > > Oh, also, I should point out that it does not matter if I boot with init S > or 3 or 6. >
Daniel Carosone
2010-Feb-17 22:45 UTC
[zfs-discuss] false DEGRADED status based on "cannot open" device at boot.
On Wed, Feb 17, 2010 at 05:28:03PM -0500, Dennis Clarke wrote:> Good theory, however, this disk is fully external with its own power.It can still be commanded to offline state. -- Dan. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 194 bytes Desc: not available URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100218/24f20958/attachment.bin>