Hello
I have a CentOS 6.6 Server with 13 disks in a RAID 6. Some weeks ago, i upgraded
it to 17 disks, two of them configured as spare. The reshape worked like normal
in the beginning. But at 69% it stopped.
md2 : active raid6 sdj1[0] sdg1[18](S) sdh1[2] sdi1[5] sdm1[15] sds1[12]
sdr1[14] sdk1[9] sdo1[6] sdn1[13] sdl1[8] sdd1[20] sdf1[19] sdq1[16] sdb1[10]
sde1[17](S) sdc1[21]
19533803520 blocks super 1.2 level 6, 1024k chunk, algorithm 2 [15/15]
[UUUUUUUUUUUUUUU]
[=============>.......] reshape = 69.0% (1347861324/1953380352)
finish=46103134.8min speed=0K/sec
I already tried to stop the raid and start it again, the reshape will start but
stop again after some minutes. If I reboot the server, the reshape won't
start:
md2 : active raid6 sdj1[0] sdg1[18](S) sdh1[2] sdi1[5] sdm1[15] sds1[12]
sdr1[14] sdk1[9] sdo1[6] sdn1[13] sdl1[8] sdd1[20] sdf1[19] sdq1[16] sdb1[10]
sde1[17](S) sdc1[21]
19533803520 blocks super 1.2 level 6, 1024k chunk, algorithm 2 [15/15]
[UUUUUUUUUUUUUUU]
resync=PENDING
Just if I restart the raid again, it will start the reshape process and stop it
like above.
In dmesg and messages logs I just found:
dmesg
md/raid:md2: reshape: not enough stripes. Needed 1024
messages
23:14:56 data kernel: md/raid:md2: not clean -- starting background
reconstruction
23:14:56 data kernel: md/raid:md2: reshape will continue
23:14:56 data kernel: md/raid:md2: device sdj1 operational as raid disk 0
23:14:56 data kernel: md/raid:md2: device sdh1 operational as raid disk 2
23:14:56 data kernel: md/raid:md2: device sdi1 operational as raid disk 5
23:14:56 data kernel: md/raid:md2: device sdn1 operational as raid disk 11
23:14:56 data kernel: md/raid:md2: device sds1 operational as raid disk 3
23:14:56 data kernel: md/raid:md2: device sdm1 operational as raid disk 1
23:14:56 data kernel: md/raid:md2: device sdf1 operational as raid disk 14
23:14:56 data kernel: md/raid:md2: device sdd1 operational as raid disk 13
23:14:56 data kernel: md/raid:md2: device sdb1 operational as raid disk 10
23:14:56 data kernel: md/raid:md2: device sdq1 operational as raid disk 7
23:14:56 data kernel: md/raid:md2: device sdr1 operational as raid disk 4
23:14:56 data kernel: md/raid:md2: device sdl1 operational as raid disk 8
23:14:56 data kernel: md/raid:md2: device sdk1 operational as raid disk 9
23:14:56 data kernel: md/raid:md2: device sdc1 operational as raid disk 12
23:14:56 data kernel: md/raid:md2: device sdo1 operational as raid disk 6
23:14:56 data kernel: md/raid:md2: allocated 0kB
23:14:56 data kernel: md/raid:md2: raid level 6 active with 15 out of 15
devices, algorithm 2
23:14:56 data kernel: md2: Warning: Device sdi1 is misaligned
23:14:56 data kernel: md2: detected capacity change from 0 to 20002614804480
23:14:56 data kernel: md2: unknown partition table
23:14:56 data kernel: XFS (md2): Mounting Filesystem
23:14:56 data kernel: md/raid:md2: reshape: not enough stripes. Needed 1024
23:14:56 data kernel: XFS (md2): Ending clean mount
So i fixed the stripes:
cat /sys/block/md2/md/stripe_cache_size
16384
But the reshape is still not working and the same error still appears in the
logs.
Have anyone some idea?
Regards
Daniel