thr3ads.net - zfs discuss - [zfs-discuss] incorrect/conflicting suggestion in error message on a faulted pool [Apr 2008]

If this information is useful, please help other people find it:
Share via:

Haudy Kazemi

2008-Apr-08 06:25 UTC

[zfs-discuss] incorrect/conflicting suggestion in error message on a faulted pool

Hello,

I''m writing to report what I think is an incorrect or conflicting 
suggestion in the error message displayed on a faulted pool that does 
not have redundancy (equiv to RAID0?).  I ran across this while testing 
and learning about ZFS on a clean installation of NexentaCore 1.0.

Here is how to recreate the scenario:

kaz at testpc1:~$ mkfile 200m testdisk1 testdisk2
kaz at testpc1:~$ sudo zpool create mybigpool $PWD/testdisk1 $PWD/testdisk2
Password:
kaz at testpc1:~$ zpool status mybigpool
  pool: mybigpool
 state: ONLINE
 scrub: none requested
config:

        NAME                              STATE     READ WRITE CKSUM
        mybigpool                         ONLINE       0     0     0
          /export/home/kaz/testdisk1  ONLINE       0     0     0
          /export/home/kaz/testdisk2  ONLINE       0     0     0

errors: No known data errors
kaz at testpc1:~$ sudo zpool scrub mybigpool
kaz at testpc1:~$ zpool status mybigpool
  pool: mybigpool
 state: ONLINE
 scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:09:29 2008
config:

        NAME                              STATE     READ WRITE CKSUM
        mybigpool                         ONLINE       0     0     0
          /export/home/kaz/testdisk1  ONLINE       0     0     0
          /export/home/kaz/testdisk2  ONLINE       0     0     0

errors: No known data errors

Up to here everything looks fine.  Now lets destroy one of the virtual 
drives:

kaz at testpc1:~$ rm testdisk2
kaz at testpc1:~$ zpool status mybigpool
  pool: mybigpool
 state: ONLINE
 scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:09:29 2008
config:

        NAME                              STATE     READ WRITE CKSUM
        mybigpool                         ONLINE       0     0     0
          /export/home/kaz/testdisk1  ONLINE       0     0     0
          /export/home/kaz/testdisk2  ONLINE       0     0     0

errors: No known data errors

Okay, still looks fine, but I haven''t tried to read/write to it yet.  
Try a scrub.

kaz at testpc1:~$ sudo zpool scrub mybigpool
kaz at testpc1:~$ zpool status mybigpool
  pool: mybigpool
 state: FAULTED
status: One or more devices could not be opened.  Sufficient replicas 
exist for
        the pool to continue functioning in a degraded state.
action: Attach the missing device and online it using ''zpool
online''.
   see: http://www.sun.com/msg/ZFS-8000-2Q
 scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:10:36 2008
config:

        NAME                              STATE     READ WRITE CKSUM
        mybigpool                         FAULTED      0     0     0  
insufficient replicas
          /export/home/kaz/testdisk1  ONLINE       0     0     0
          /export/home/kaz/testdisk2  UNAVAIL      0     0     0  cannot 
open

errors: No known data errors
kaz at testpc1:~$

There we go.  The pool has faulted as I expected to happen because I 
created it as a non-redundant pool.  I think it was the equivalent of a 
RAID0 pool with checksumming, at least it behaves like one.  The key to 
my reporting this is that the "status" message says "One or more
devices
could not be opened.  Sufficient replicas exist for the pool to continue 
functioning in a degraded state." while the message further down to the 
right of the pool name says "insufficient replicas".

The verbose status message is wrong in this case.  From other forum/list 
posts looks like that status message is also used for degraded pools, 
which isn''t a problem, but here we have a faulted pool. 
Here''s an
example of the same status message used appropriately: 
http://mail.opensolaris.org/pipermail/zfs-discuss/2006-April/031298.html

Is anyone else able to reproduce this?  And if so, is there a ZFS bug 
tracker to report this too? (I didn''t see a public bug tracker when I 
looked.)

Thanks,

Haudy Kazemi

Haudy Kazemi

2008-Apr-09 16:08 UTC

head link

[zfs-discuss] incorrect/conflicting suggestion in error message on a faulted pool

I have reported this bug here: 
http://bugs.opensolaris.org/view_bug.do?bug_id=6685676

I think this bug may be related, but I do not see where to add a note to 
an existing bug report: 
http://bugs.opensolaris.org/view_bug.do?bug_id=6633592
(both bugs refer to ZFS-8000-2Q however my report shows a FAULTED pool 
instead of a DEGRADED pool.)

Thanks,

-hk

Haudy Kazemi wrote:> Hello,
>
> I''m writing to report what I think is an incorrect or conflicting 
> suggestion in the error message displayed on a faulted pool that does 
> not have redundancy (equiv to RAID0?).  I ran across this while testing 
> and learning about ZFS on a clean installation of NexentaCore 1.0.
>
> Here is how to recreate the scenario:
>
> kaz at testpc1:~$ mkfile 200m testdisk1 testdisk2
> kaz at testpc1:~$ sudo zpool create mybigpool $PWD/testdisk1 $PWD/testdisk2
> Password:
> kaz at testpc1:~$ zpool status mybigpool
>   pool: mybigpool
>  state: ONLINE
>  scrub: none requested
> config:
>
>         NAME                              STATE     READ WRITE CKSUM
>         mybigpool                         ONLINE       0     0     0
>           /export/home/kaz/testdisk1  ONLINE       0     0     0
>           /export/home/kaz/testdisk2  ONLINE       0     0     0
>
> errors: No known data errors
> kaz at testpc1:~$ sudo zpool scrub mybigpool
> kaz at testpc1:~$ zpool status mybigpool
>   pool: mybigpool
>  state: ONLINE
>  scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:09:29
2008
> config:
>
>         NAME                              STATE     READ WRITE CKSUM
>         mybigpool                         ONLINE       0     0     0
>           /export/home/kaz/testdisk1  ONLINE       0     0     0
>           /export/home/kaz/testdisk2  ONLINE       0     0     0
>
> errors: No known data errors
>
> Up to here everything looks fine.  Now lets destroy one of the virtual 
> drives:
>
> kaz at testpc1:~$ rm testdisk2
> kaz at testpc1:~$ zpool status mybigpool
>   pool: mybigpool
>  state: ONLINE
>  scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:09:29
2008
> config:
>
>         NAME                              STATE     READ WRITE CKSUM
>         mybigpool                         ONLINE       0     0     0
>           /export/home/kaz/testdisk1  ONLINE       0     0     0
>           /export/home/kaz/testdisk2  ONLINE       0     0     0
>
> errors: No known data errors
>
> Okay, still looks fine, but I haven''t tried to read/write to it
yet.
> Try a scrub.
>
> kaz at testpc1:~$ sudo zpool scrub mybigpool
> kaz at testpc1:~$ zpool status mybigpool
>   pool: mybigpool
>  state: FAULTED
> status: One or more devices could not be opened.  Sufficient replicas 
> exist for
>         the pool to continue functioning in a degraded state.
> action: Attach the missing device and online it using ''zpool
online''.
>    see: http://www.sun.com/msg/ZFS-8000-2Q
>  scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:10:36
2008
> config:
>
>         NAME                              STATE     READ WRITE CKSUM
>         mybigpool                         FAULTED      0     0     0  
> insufficient replicas
>           /export/home/kaz/testdisk1  ONLINE       0     0     0
>           /export/home/kaz/testdisk2  UNAVAIL      0     0     0  cannot 
> open
>
> errors: No known data errors
> kaz at testpc1:~$
>
> There we go.  The pool has faulted as I expected to happen because I 
> created it as a non-redundant pool.  I think it was the equivalent of a 
> RAID0 pool with checksumming, at least it behaves like one.  The key to 
> my reporting this is that the "status" message says "One or
more devices
> could not be opened.  Sufficient replicas exist for the pool to continue 
> functioning in a degraded state." while the message further down to
the
> right of the pool name says "insufficient replicas".
>
> The verbose status message is wrong in this case.  From other forum/list 
> posts looks like that status message is also used for degraded pools, 
> which isn''t a problem, but here we have a faulted pool. 
Here''s an
> example of the same status message used appropriately: 
> http://mail.opensolaris.org/pipermail/zfs-discuss/2006-April/031298.html
>
> Is anyone else able to reproduce this?  And if so, is there a ZFS bug 
> tracker to report this too? (I didn''t see a public bug tracker
when I
> looked.)
>
> Thanks,
>
> Haudy Kazemi
> _______________________________________________
> zfs-discuss mailing list
> zfs-discuss at opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
>

Neil Perrin

2008-Apr-09 23:35 UTC

head link

[zfs-discuss] incorrect/conflicting suggestion in error message on a faulted pool

Haudy,

Thanks for reporting this bug and helping to improve ZFS.
I''m not sure either how you could have added a note to an
existing report. Anyway I''ve gone ahead and done that for you
in the "Related Bugs" field. Though opensolaris doesn''t
reflect it yet

Neil.


Haudy Kazemi wrote:> I have reported this bug here: 
> http://bugs.opensolaris.org/view_bug.do?bug_id=6685676
> 
> I think this bug may be related, but I do not see where to add a note to 
> an existing bug report: 
> http://bugs.opensolaris.org/view_bug.do?bug_id=6633592
> (both bugs refer to ZFS-8000-2Q however my report shows a FAULTED pool 
> instead of a DEGRADED pool.)
> 
> Thanks,
> 
> -hk
> 
> Haudy Kazemi wrote:
>> Hello,
>>
>> I''m writing to report what I think is an incorrect or
conflicting
>> suggestion in the error message displayed on a faulted pool that does 
>> not have redundancy (equiv to RAID0?).  I ran across this while testing
>> and learning about ZFS on a clean installation of NexentaCore 1.0.
>>
>> Here is how to recreate the scenario:
>>
>> kaz at testpc1:~$ mkfile 200m testdisk1 testdisk2
>> kaz at testpc1:~$ sudo zpool create mybigpool $PWD/testdisk1
$PWD/testdisk2
>> Password:
>> kaz at testpc1:~$ zpool status mybigpool
>>   pool: mybigpool
>>  state: ONLINE
>>  scrub: none requested
>> config:
>>
>>         NAME                              STATE     READ WRITE CKSUM
>>         mybigpool                         ONLINE       0     0     0
>>           /export/home/kaz/testdisk1  ONLINE       0     0     0
>>           /export/home/kaz/testdisk2  ONLINE       0     0     0
>>
>> errors: No known data errors
>> kaz at testpc1:~$ sudo zpool scrub mybigpool
>> kaz at testpc1:~$ zpool status mybigpool
>>   pool: mybigpool
>>  state: ONLINE
>>  scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:09:29
2008
>> config:
>>
>>         NAME                              STATE     READ WRITE CKSUM
>>         mybigpool                         ONLINE       0     0     0
>>           /export/home/kaz/testdisk1  ONLINE       0     0     0
>>           /export/home/kaz/testdisk2  ONLINE       0     0     0
>>
>> errors: No known data errors
>>
>> Up to here everything looks fine.  Now lets destroy one of the virtual 
>> drives:
>>
>> kaz at testpc1:~$ rm testdisk2
>> kaz at testpc1:~$ zpool status mybigpool
>>   pool: mybigpool
>>  state: ONLINE
>>  scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:09:29
2008
>> config:
>>
>>         NAME                              STATE     READ WRITE CKSUM
>>         mybigpool                         ONLINE       0     0     0
>>           /export/home/kaz/testdisk1  ONLINE       0     0     0
>>           /export/home/kaz/testdisk2  ONLINE       0     0     0
>>
>> errors: No known data errors
>>
>> Okay, still looks fine, but I haven''t tried to read/write to
it yet.
>> Try a scrub.
>>
>> kaz at testpc1:~$ sudo zpool scrub mybigpool
>> kaz at testpc1:~$ zpool status mybigpool
>>   pool: mybigpool
>>  state: FAULTED
>> status: One or more devices could not be opened.  Sufficient replicas 
>> exist for
>>         the pool to continue functioning in a degraded state.
>> action: Attach the missing device and online it using ''zpool
online''.
>>    see: http://www.sun.com/msg/ZFS-8000-2Q
>>  scrub: scrub completed after 0h0m with 0 errors on Mon Apr  7 22:10:36
2008
>> config:
>>
>>         NAME                              STATE     READ WRITE CKSUM
>>         mybigpool                         FAULTED      0     0     0  
>> insufficient replicas
>>           /export/home/kaz/testdisk1  ONLINE       0     0     0
>>           /export/home/kaz/testdisk2  UNAVAIL      0     0     0 
cannot
>> open
>>
>> errors: No known data errors
>> kaz at testpc1:~$
>>
>> There we go.  The pool has faulted as I expected to happen because I 
>> created it as a non-redundant pool.  I think it was the equivalent of a
>> RAID0 pool with checksumming, at least it behaves like one.  The key to
>> my reporting this is that the "status" message says "One
or more devices
>> could not be opened.  Sufficient replicas exist for the pool to
continue
>> functioning in a degraded state." while the message further down
to the
>> right of the pool name says "insufficient replicas".
>>
>> The verbose status message is wrong in this case.  From other
forum/list
>> posts looks like that status message is also used for degraded pools, 
>> which isn''t a problem, but here we have a faulted pool. 
Here''s an
>> example of the same status message used appropriately: 
>>
http://mail.opensolaris.org/pipermail/zfs-discuss/2006-April/031298.html
>>
>> Is anyone else able to reproduce this?  And if so, is there a ZFS bug 
>> tracker to report this too? (I didn''t see a public bug tracker
when I
>> looked.)
>>
>> Thanks,
>>
>> Haudy Kazemi
>> _______________________________________________
>> zfs-discuss mailing list
>> zfs-discuss at opensolaris.org
>> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
>>   
> 
> _______________________________________________
> zfs-discuss mailing list
> zfs-discuss at opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

zfs discuss - Apr 2008 - incorrect/conflicting suggestion in error message on a faulted pool

[zfs-discuss] incorrect/conflicting suggestion in error message on a faulted pool

[zfs-discuss] incorrect/conflicting suggestion in error message on a faulted pool

[zfs-discuss] incorrect/conflicting suggestion in error message on a faulted pool