this has never happened to me before, and I'm somewhat at a loss. got a email from the cron thing... /etc/cron.weekly/99-raid-check: WARNING: mismatch_cnt is not 0 on /dev/md10 WARNING: mismatch_cnt is not 0 on /dev/md11 ok, md10 and md11 are each raid1's made from 2 x 72GB scsi drives, on a dell 2850 or something dual single-core 3ghz server. these two md's are in turn a striped LVM volume group dmesg shows.... md: syncing RAID array md10 md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc. md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for reconstruction. md: using 128k window, over a total of 143374656 blocks. md: syncing RAID array md11 md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc. md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for reconstruction. md: using 128k window, over a total of 143374656 blocks. md: md10: sync done. RAID1 conf printout: --- wd:2 rd:2 disk 0, wo:0, o:1, dev:sdc1 disk 1, wo:0, o:1, dev:sdd1 md: md11: sync done. RAID1 conf printout: --- wd:2 rd:2 disk 0, wo:0, o:1, dev:sde1 disk 1, wo:0, o:1, dev:sdf1 I'm not sure what thats telling me. the last thing prior to this in dmesg was when I added a swap to this vg last week. and mdadm --detail shows... # mdadm --detail /dev/md10 /dev/md10: Version : 0.90 Creation Time : Wed Oct 8 12:54:48 2008 Raid Level : raid1 Array Size : 143374656 (136.73 GiB 146.82 GB) Used Dev Size : 143374656 (136.73 GiB 146.82 GB) Raid Devices : 2 Total Devices : 2 Preferred Minor : 10 Persistence : Superblock is persistent Update Time : Sun Feb 28 04:53:29 2010 State : clean Active Devices : 2 Working Devices : 2 Failed Devices : 0 Spare Devices : 0 UUID : b6da4dc5:c7372d6e:63f32b9c:49fa95f9 Events : 0.84 Number Major Minor RaidDevice State 0 8 33 0 active sync /dev/sdc1 1 8 49 1 active sync /dev/sdd1 # mdadm --detail /dev/md11 /dev/md11: Version : 0.90 Creation Time : Wed Oct 8 12:54:57 2008 Raid Level : raid1 Array Size : 143374656 (136.73 GiB 146.82 GB) Used Dev Size : 143374656 (136.73 GiB 146.82 GB) Raid Devices : 2 Total Devices : 2 Preferred Minor : 11 Persistence : Superblock is persistent Update Time : Sun Feb 28 11:49:45 2010 State : clean Active Devices : 2 Working Devices : 2 Failed Devices : 0 Spare Devices : 0 UUID : be475cd9:b98ee3ff:d18e668c:a5a6e06b Events : 0.62 Number Major Minor RaidDevice State 0 8 65 0 active sync /dev/sde1 1 8 81 1 active sync /dev/sdf1 I don't see anything wrong here ? lvm shows no problems I detect either... # vgdisplay vg1 Volume group "vgdisplay" not found LV VG Attr LSize Origin Snap% Move Log Copy% Convert glassfish vg1 -wi-ao 10.00G lv1 vg1 -wi-ao 97.66G oradata vg1 -wi-ao 30.00G pgdata vg1 -wi-ao 25.00G pgdata_lss_idx vg1 -wi-ao 20.00G pgdata_lss_tab vg1 -wi-ao 20.00G swapper vg1 -wi-ao 3.00G vmware vg1 -wi-ao 50.00G # pvdisplay /dev/md10 /dev/md11 --- Physical volume --- PV Name /dev/md10 VG Name vg1 PV Size 136.73 GB / not usable 2.31 MB Allocatable yes PE Size (KByte) 4096 Total PE 35003 Free PE 1998 Allocated PE 33005 PV UUID oAgJY7-Tmf7-ac35-KoUH-15uz-Q5Ae-bmFCys --- Physical volume --- PV Name /dev/md11 VG Name vg1 PV Size 136.73 GB / not usable 2.31 MB Allocatable yes PE Size (KByte) 4096 Total PE 35003 Free PE 2560 Allocated PE 32443 PV UUID A4Qb3P-j5Lr-8ZEv-FjbC-Iczm-QkC8-bqP0zv
2010/2/28 John R Pierce <pierce at hogranch.com>:> this has never happened to me before, and I'm somewhat at a loss. ?got a > email from the cron thing... > > ? ?/etc/cron.weekly/99-raid-check: > > ? ?WARNING: mismatch_cnt is not 0 on /dev/md10 > ? ?WARNING: mismatch_cnt is not 0 on /dev/md11 > > > ok, md10 and md11 are each raid1's made from 2 x 72GB scsi drives, on a > dell 2850 or something dual single-core 3ghz server. > > these two md's are in turn a striped LVM volume group > > dmesg shows.... > > ? ?md: syncing RAID array md10 > ? ?md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc. > ? ?md: using maximum available idle IO bandwidth (but not more than > 200000 KB/sec) for reconstruction. > ? ?md: using 128k window, over a total of 143374656 blocks. > ? ?md: syncing RAID array md11 > ? ?md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc. > ? ?md: using maximum available idle IO bandwidth (but not more than > 200000 KB/sec) for reconstruction. > ? ?md: using 128k window, over a total of 143374656 blocks. > ? ?md: md10: sync done. > ? ?RAID1 conf printout: > ? ? --- wd:2 rd:2 > ? ? disk 0, wo:0, o:1, dev:sdc1 > ? ? disk 1, wo:0, o:1, dev:sdd1 > ? ?md: md11: sync done. > ? ?RAID1 conf printout: > ? ? --- wd:2 rd:2 > ? ? disk 0, wo:0, o:1, dev:sde1 > ? ? disk 1, wo:0, o:1, dev:sdf1 > > I'm not sure what thats telling me. ?the last thing prior to this in > dmesg was when I added a swap to this vg last week. > > > and mdadm --detail shows... > > # mdadm --detail /dev/md10 > /dev/md10: > ? ? ? ?Version : 0.90 > ?Creation Time : Wed Oct ?8 12:54:48 2008 > ? ? Raid Level : raid1 > ? ? Array Size : 143374656 (136.73 GiB 146.82 GB) > ?Used Dev Size : 143374656 (136.73 GiB 146.82 GB) > ? Raid Devices : 2 > ?Total Devices : 2 > Preferred Minor : 10 > ? ?Persistence : Superblock is persistent > > ? ?Update Time : Sun Feb 28 04:53:29 2010 > ? ? ? ? ?State : clean > ?Active Devices : 2 > Working Devices : 2 > ?Failed Devices : 0 > ?Spare Devices : 0 > > ? ? ? ? ? UUID : b6da4dc5:c7372d6e:63f32b9c:49fa95f9 > ? ? ? ? Events : 0.84 > > ? ?Number ? Major ? Minor ? RaidDevice State > ? ? ? 0 ? ? ? 8 ? ? ? 33 ? ? ? ?0 ? ? ?active sync ? /dev/sdc1 > ? ? ? 1 ? ? ? 8 ? ? ? 49 ? ? ? ?1 ? ? ?active sync ? /dev/sdd1 > # mdadm --detail /dev/md11 > /dev/md11: > ? ? ? ?Version : 0.90 > ?Creation Time : Wed Oct ?8 12:54:57 2008 > ? ? Raid Level : raid1 > ? ? Array Size : 143374656 (136.73 GiB 146.82 GB) > ?Used Dev Size : 143374656 (136.73 GiB 146.82 GB) > ? Raid Devices : 2 > ?Total Devices : 2 > Preferred Minor : 11 > ? ?Persistence : Superblock is persistent > > ? ?Update Time : Sun Feb 28 11:49:45 2010 > ? ? ? ? ?State : clean > ?Active Devices : 2 > Working Devices : 2 > ?Failed Devices : 0 > ?Spare Devices : 0 > > ? ? ? ? ? UUID : be475cd9:b98ee3ff:d18e668c:a5a6e06b > ? ? ? ? Events : 0.62 > > ? ?Number ? Major ? Minor ? RaidDevice State > ? ? ? 0 ? ? ? 8 ? ? ? 65 ? ? ? ?0 ? ? ?active sync ? /dev/sde1 > ? ? ? 1 ? ? ? 8 ? ? ? 81 ? ? ? ?1 ? ? ?active sync ? /dev/sdf1 > > > > I don't see anything wrong here ? > > lvm shows no problems I detect either... > > # vgdisplay vg1 > ?Volume group "vgdisplay" not found > ?LV ? ? ? ? ? ? VG ? Attr ? LSize ?Origin Snap% ?Move Log Copy% ?Convert > ?glassfish ? ? ?vg1 ?-wi-ao 10.00G > ?lv1 ? ? ? ? ? ?vg1 ?-wi-ao 97.66G > ?oradata ? ? ? ?vg1 ?-wi-ao 30.00G > ?pgdata ? ? ? ? vg1 ?-wi-ao 25.00G > ?pgdata_lss_idx vg1 ?-wi-ao 20.00G > ?pgdata_lss_tab vg1 ?-wi-ao 20.00G > ?swapper ? ? ? ?vg1 ?-wi-ao ?3.00G > ?vmware ? ? ? ? vg1 ?-wi-ao 50.00G > > > # pvdisplay /dev/md10 /dev/md11 > ?--- Physical volume --- > ?PV Name ? ? ? ? ? ? ? /dev/md10 > ?VG Name ? ? ? ? ? ? ? vg1 > ?PV Size ? ? ? ? ? ? ? 136.73 GB / not usable 2.31 MB > ?Allocatable ? ? ? ? ? yes > ?PE Size (KByte) ? ? ? 4096 > ?Total PE ? ? ? ? ? ? ?35003 > ?Free PE ? ? ? ? ? ? ? 1998 > ?Allocated PE ? ? ? ? ?33005 > ?PV UUID ? ? ? ? ? ? ? oAgJY7-Tmf7-ac35-KoUH-15uz-Q5Ae-bmFCys > > ?--- Physical volume --- > ?PV Name ? ? ? ? ? ? ? /dev/md11 > ?VG Name ? ? ? ? ? ? ? vg1 > ?PV Size ? ? ? ? ? ? ? 136.73 GB / not usable 2.31 MB > ?Allocatable ? ? ? ? ? yes > ?PE Size (KByte) ? ? ? 4096 > ?Total PE ? ? ? ? ? ? ?35003 > ?Free PE ? ? ? ? ? ? ? 2560 > ?Allocated PE ? ? ? ? ?32443 > ?PV UUID ? ? ? ? ? ? ? A4Qb3P-j5Lr-8ZEv-FjbC-Iczm-QkC8-bqP0zvmaybe?this?helps:?http://www.arrfab.net/blog/?p=199 -- Eero
Am 28.02.2010 22:03, schrieb John R Pierce:> WARNING: mismatch_cnt is not 0 onHave a look at http://www.arrfab.net/blog/?p=199 It says:> A `echo repair >/sys/block/md0/md/sync_action` followed by a `echo > check >/sys/block/md0/md/sync_action` seems to have corrected it. Now > `cat /sys/block/md0/md/mismatch_cnt` returns 0 ?Regards, Peter -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 260 bytes Desc: OpenPGP digital signature URL: <http://lists.centos.org/pipermail/centos/attachments/20100228/2c04a780/attachment.sig>
Peter Hinse wrote:> Am 28.02.2010 22:03, schrieb John R Pierce: > >> WARNING: mismatch_cnt is not 0 on >> > > Have a look at http://www.arrfab.net/blog/?p=199 > It says: > > >> A `echo repair >/sys/block/md0/md/sync_action` followed by a `echo >> check >/sys/block/md0/md/sync_action` seems to have corrected it. Now >> `cat /sys/block/md0/md/mismatch_cnt` returns 0 ? >>Thanks. I was trying to figure out how from the mdadm commands (UGH!) to do a scan. # cat /sys/block/md10/md/mismatch_cnt 8448 # cat /sys/block/md11/md/mismatch_cnt 7296 fugly. Since the mirrors aren't checksummed, can i assume this means there's likely some data messups here? Anyways, the repair is running on both md10 and md11, i'll check back with my final results...