thr3ads.net - zfs discuss - [zfs-discuss] Assistance needed expanding RAIDZ with larger drives [Jan 2008]

If this information is useful, please help other people find it:
Share via:

Chris Murray

2008-Jan-10 08:22 UTC

[zfs-discuss] Assistance needed expanding RAIDZ with larger drives

Hi all,

Please can you help with my ZFS troubles:

I currently have 3 x 400 GB Seagate NL35''s and a 500 GB Samsung
Spinpoint in a RAIDZ array that I wish to expand by systematically replacing
each drive with a 750 GB Western Digital Caviar.

After failing miserably, I''d like to start from scratch again if
possible. When I last tried, the replace command hung for an age, network
connectivity was lost (I have to do this by SSH, so that dropped too), and I got
strange outputs from zpool -status where a particular device (but not the one
that was being replaced) was listed twice. I also managed to mess up 3 of the 4
WD drives in such a way that they were detected by the bios, but no system after
that would work:

In solaris, a format wouldn''t show the drive (c2d0 should have been
there, but it wasn''t)
Using an old Ghost boot CD, the old 3 drives would list, but the new one
wouldn''t.
GParted couldn''t see the new drive (the old 3 were there!)

Again, in all of these situations, the drive WAS detected in the BIOS and
everything looked perfectly fine in there. I have now managed to salvage one
using Windows and a USB caddy in another computer. This is formatted as one big
NTFS volume. So the drive does work.

Here is the output from zpool -status:

| # zpool status -v
|   pool: zp
|  state: ONLINE
|  scrub: scrub in progress, 30.76% done, 4h33m to go
| config:
| 
|         NAME        STATE     READ WRITE CKSUM
|         zp          ONLINE       0     0     0
|           raidz1    ONLINE       0     0     0
|             c2d1    ONLINE       0     0     0
|             c3d1    ONLINE       0     0     0
|             c2d0    ONLINE       0     0     0
|             c3d0    ONLINE       0     0     0
|
| errors: No known data errors

And here is the output from ZDB:

| # zdb
| zp
|     version=9
|     name=''zp''
|     state=0
|     txg=110026
|     pool_guid=5629347939003043989
|     hostid=823611165
|     hostname=''mammoth''
|     vdev_tree
|         type=''root''
|         id=0
|         guid=5629347939003043989
|         children[0]
|                 type=''raidz''
|                 id=0
|                 guid=1325151684809734884
|                 nparity=1
|                 metaslab_array=14
|                 metaslab_shift=33
|                 ashift=9
|                 asize=1600289505280
|                 is_log=0
|                 children[0]
|                         type=''disk''
|                         id=0
|                         guid=5385778296365299126
|                         path=''/dev/dsk/c2d1s0''
|                         devid=''id1,cmdk at
AST3400632NS=____________5NF1EDQL/a''
|                         phys_path=''/pci at 0,0/pci-ide at 12/ide at
0/cmdk at 1,0:a''
|                         whole_disk=1
|                         DTL=33
|                 children[1]
|                         type=''disk''
|                         id=1
|                         guid=15098521488705848306
|                         path=''/dev/dsk/c3d1s0''
|                         devid=''id1,cmdk at
AST3400632NS=____________5NF1EDS2/a''
|                         phys_path=''/pci at 0,0/pci-ide at 12/ide at
1/cmdk at 1,0:a''
|                         whole_disk=1
|                         DTL=32
|                 children[2]
|                         type=''disk''
|                         id=2
|                         guid=4518340092563481291
|                         path=''/dev/dsk/c2d0s0''
|                         devid=''id1,cmdk at
ASAMSUNG_HD501LJ=S0MUJ1NPA00154/a''
|                         phys_path=''/pci at 0,0/pci-ide at 12/ide at
0/cmdk at 0,0:a''
|                         whole_disk=1
|                         DTL=31
|                 children[3]
|                         type=''disk''
|                         id=3
|                         guid=7852006658048665355
|                         path=''/dev/dsk/c3d0s0''
|                         devid=''id1,cmdk at
AST3400632NS=____________5NF1EEZ1/a''
|                         phys_path=''/pci at 0,0/pci-ide at 12/ide at
1/cmdk at 0,0:a''
|                         whole_disk=1
|                         DTL=30

I have found many, many posts regarding issues when replacing drives, and have
used bits of them to get myself in a working state .. but I''m now
confused with all the imports, exports, detaches etc... Please can someone help
me out step-by-step with this one and start with:

The array is functioning perfectly but I want to replace c2d0 with a new WD
drive. Once the scrub completes, What do I do?

Many thanks,

Chris Murray
 
 
This message posted from opensolaris.org

Robert

2008-Jan-11 12:35 UTC

head link

[zfs-discuss] Assistance needed expanding RAIDZ with larger drives

> and I got strange outputs from zpool -status where a particular device (but
not the one that was being replaced) was listed twice
About that issue, please check my post in:
opensolaris.org/jive/thread.jspa?threadID=48483&tstart=0
 
 
This message posted from opensolaris.org

Chris Murray

2008-Jan-13 11:10 UTC

head link

[zfs-discuss] Assistance needed expanding RAIDZ with larger drives

> About that issue, please check my post in:
> opensolaris.org/jive/thread.jspa?threadID=48483&tstart=0
Thanks - when I originally tried to replace the first drive, my intention was
to:
1. Move solaris box and drives
2. Power up to test it still works
3. Power down
4. Replace drive.

I suspect I may have missed out 2 & 3, and ran into the same situation that
you did.

Anyhow, I seem to now be in an even bigger mess than earlier - when I tried to
simply swap out one of the old drives with a new one and perform a replace, I
ran into problems:

1. The hard drive light on the PC lit up, and I heard lots of disk noise, as you
would expect
2. The light went off. My continuous ping did the following:

Reply from 192.168.0.10: bytes=32 time<1ms TTL=255
Reply from 192.168.0.10: bytes=32 time<1ms TTL=255
Request timed out.
Request timed out.
Request timed out.
Request timed out.
Request timed out.
Request timed out.
Reply from 192.168.0.10: bytes=32 time=2092ms TTL=255
Reply from 192.168.0.10: bytes=32 time<1ms TTL=255
Reply from 192.168.0.10: bytes=32 time<1ms TTL=255

3. The light came back on again .. more disk noise. Good - perhaps the pause was
just a momentary blip
4. Light goes off (this is about 20 minutes since the start)
5. zpool status reports that a resilver completed, and there are errors in
zp/storage and zp/VMware, and suggests that I should restore from backup
6. I nearly cry, as these are the only 2 files I use.
7. I have heard of ZFS thinking that there are unrecoverable errors before, so I
run zpool scrub and then zpool clear a number of times. Seem to make no
difference.

This whole project started when I wanted to move 900 GB of data from a server
2003 box containing the 4 old disks, to a solaris box. I borrowed 2 x 500 GB
drives from a friend, copied all the data onto them, put the 4 old drives into
the solaris box, created the zpool, created my storage and VMware volumes,
shared them out using iSCSI, created NTFS volumes on the server 2003 box and
copied the data back onto them. Aside from a couple of networking issues, this
worked absolutely perfectly. Then I decided I''d like some more space,
and that''s where it all went wrong.

Despite the reports of corruption, the storage and VMware "drives" do
still work in windows. The iSCSI initiator still picks them up, and if I if I
dir /a /s, I can see all of the files that were on these NTFS volumes before I
tried this morning''s replace. However, should I trust this? I suspect
that even if I ran a chkdsk /f, a successful result may not be all that it
seems. I still have the 2 x 500 GB drives with my data from weeks ago.
I''d be sad to lose a few weeks worth of work, but that would be better
than assuming that ZFS is incorrect in saying the volumes are corrupt and then
discovering in months time that I cannot get at NTFS files because of this root
cause.

Since the report of corruption in these 2 volumes, I had a genius
troubleshooting idea - "what if the problem is not with ZFS, but instead
with Solaris not liking the drives in general?". I exported my current
zpool, disconnected all drives, plugged in the 4 new ones, and waited for the
system to boot again... nothing. The system had stopped in the BIOS, requesting
that I press F1 as SMART reports that one of the drives is bad! Already?!? I
only bought the drives a few days ago!!! Now the problem is that I know which of
these drives is bad, but I don''t know whether this was the one that was
plugged in when zpool status reported all the read/write/checksum errors.

So maybe I have a duff batch of drives .. I leave the remaining 3 plugged in and
create a brand new zpool called test. No problems at all. I create a 1300 GB
volume on it. Also no problem. I''m currently overwriting it with random
data:

dd if=/dev/urandom of=/dev/zvol/rdsk/test/test bs=1048576 count=1331200

I throw in the odd zpool scrub to see how things are doing so far and as yet,
there hasn''t been a single error of any sort. So, 3 of the WD drives
(0430739, 0388708, 0417089) appear to be fine and one is dead already (0373211).

So this leads me to the conclusion that (ignoring the bad one), these drives
work fine with Solaris. They work fine with ZFS too. It''s just the act
of trying to replace a drive from my old zpool with a new one that causes
issues.

My next step will be to run the WD diagnostics on all drives, send the broken
one back, and then have 4 fully functioning 750 GB drives. I''ll also
import the old zpool into the solaris box - it''ll undoubtedly complain
that one of the drives is missing (the one that I tried to add earlier and got
all the errors), so I think I''ll try one more replace to get all 4 old
drives back in the pool.

So, what do I do after that?

1. Create a brand new pool out of the WD drives, share it using iSCSI and copy
onto that my data from my friends drives? I''ll have lost a good few
weeks of work but I''ll be confident that it isn''t corrupt.
2. Ignore the fact that ZFS thought that storage and VMware were corrupt and
overwrite the contents of my friends drives with everything from the current
NTFS volumes? I could then create a new pool out of the new drives and copy the
data back onto the new NTFS volumes. However, if it all goes wrong, I
wouldn''t have anything to restore from.
3. Ignore the ZFS errors again and keep trying to replace each of the old drives
in turn. This obviously hasn''t worked in the past for me and
I''ve spent a LOT of time on this and of course the NTFS volumes could
still be corrupt, but surely this process should work!! After all, if a drive
did break and I replaced it with a new one, will I run into this mess again?!

This all seems to hinge on whether after running a chkdsk /f, I can trust the
results. I appreciate that this isn''t an NTFS discussion group, its a
ZFS one but if anyone has had any experience of a similar scenario I''d
appreciate the help!

Chris
 
 
This message posted from opensolaris.org

Maybe Matching Threads

Search for more possibly parallel threads

zfs discuss - Jan 2008 - Assistance needed expanding RAIDZ with larger drives

[zfs-discuss] Assistance needed expanding RAIDZ with larger drives

[zfs-discuss] Assistance needed expanding RAIDZ with larger drives

[zfs-discuss] Assistance needed expanding RAIDZ with larger drives

Maybe Matching Threads