this has never happened to me before, and I'm somewhat at a loss.  got a 
email from the cron thing...
    /etc/cron.weekly/99-raid-check:
    WARNING: mismatch_cnt is not 0 on /dev/md10
    WARNING: mismatch_cnt is not 0 on /dev/md11
ok, md10 and md11 are each raid1's made from 2 x 72GB scsi drives, on a 
dell 2850 or something dual single-core 3ghz server.
these two md's are in turn a striped LVM volume group
dmesg shows....
    md: syncing RAID array md10
    md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc.
    md: using maximum available idle IO bandwidth (but not more than 
200000 KB/sec) for reconstruction.
    md: using 128k window, over a total of 143374656 blocks.
    md: syncing RAID array md11
    md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc.
    md: using maximum available idle IO bandwidth (but not more than 
200000 KB/sec) for reconstruction.
    md: using 128k window, over a total of 143374656 blocks.
    md: md10: sync done.
    RAID1 conf printout:
     --- wd:2 rd:2
     disk 0, wo:0, o:1, dev:sdc1
     disk 1, wo:0, o:1, dev:sdd1
    md: md11: sync done.
    RAID1 conf printout:
     --- wd:2 rd:2
     disk 0, wo:0, o:1, dev:sde1
     disk 1, wo:0, o:1, dev:sdf1
I'm not sure what thats telling me.  the last thing prior to this in 
dmesg was when I added a swap to this vg last week.
and mdadm --detail shows...
# mdadm --detail /dev/md10
/dev/md10:
        Version : 0.90
  Creation Time : Wed Oct  8 12:54:48 2008
     Raid Level : raid1
     Array Size : 143374656 (136.73 GiB 146.82 GB)
  Used Dev Size : 143374656 (136.73 GiB 146.82 GB)
   Raid Devices : 2
  Total Devices : 2
Preferred Minor : 10
    Persistence : Superblock is persistent
    Update Time : Sun Feb 28 04:53:29 2010
          State : clean
 Active Devices : 2
Working Devices : 2
 Failed Devices : 0
  Spare Devices : 0
           UUID : b6da4dc5:c7372d6e:63f32b9c:49fa95f9
         Events : 0.84
    Number   Major   Minor   RaidDevice State
       0       8       33        0      active sync   /dev/sdc1
       1       8       49        1      active sync   /dev/sdd1
# mdadm --detail /dev/md11
/dev/md11:
        Version : 0.90
  Creation Time : Wed Oct  8 12:54:57 2008
     Raid Level : raid1
     Array Size : 143374656 (136.73 GiB 146.82 GB)
  Used Dev Size : 143374656 (136.73 GiB 146.82 GB)
   Raid Devices : 2
  Total Devices : 2
Preferred Minor : 11
    Persistence : Superblock is persistent
    Update Time : Sun Feb 28 11:49:45 2010
          State : clean
 Active Devices : 2
Working Devices : 2
 Failed Devices : 0
  Spare Devices : 0
           UUID : be475cd9:b98ee3ff:d18e668c:a5a6e06b
         Events : 0.62
    Number   Major   Minor   RaidDevice State
       0       8       65        0      active sync   /dev/sde1
       1       8       81        1      active sync   /dev/sdf1
I don't see anything wrong here ?
lvm shows no problems I detect either...
# vgdisplay vg1
  Volume group "vgdisplay" not found
  LV             VG   Attr   LSize  Origin Snap%  Move Log Copy%  Convert
  glassfish      vg1  -wi-ao 10.00G                                     
  lv1            vg1  -wi-ao 97.66G                                     
  oradata        vg1  -wi-ao 30.00G                                     
  pgdata         vg1  -wi-ao 25.00G                                     
  pgdata_lss_idx vg1  -wi-ao 20.00G                                     
  pgdata_lss_tab vg1  -wi-ao 20.00G                                     
  swapper        vg1  -wi-ao  3.00G                                     
  vmware         vg1  -wi-ao 50.00G             
# pvdisplay /dev/md10 /dev/md11
  --- Physical volume ---
  PV Name               /dev/md10
  VG Name               vg1
  PV Size               136.73 GB / not usable 2.31 MB
  Allocatable           yes
  PE Size (KByte)       4096
  Total PE              35003
  Free PE               1998
  Allocated PE          33005
  PV UUID               oAgJY7-Tmf7-ac35-KoUH-15uz-Q5Ae-bmFCys
  
  --- Physical volume ---
  PV Name               /dev/md11
  VG Name               vg1
  PV Size               136.73 GB / not usable 2.31 MB
  Allocatable           yes
  PE Size (KByte)       4096
  Total PE              35003
  Free PE               2560
  Allocated PE          32443
  PV UUID               A4Qb3P-j5Lr-8ZEv-FjbC-Iczm-QkC8-bqP0zv
2010/2/28 John R Pierce <pierce at hogranch.com>:> this has never happened to me before, and I'm somewhat at a loss. ?got a > email from the cron thing... > > ? ?/etc/cron.weekly/99-raid-check: > > ? ?WARNING: mismatch_cnt is not 0 on /dev/md10 > ? ?WARNING: mismatch_cnt is not 0 on /dev/md11 > > > ok, md10 and md11 are each raid1's made from 2 x 72GB scsi drives, on a > dell 2850 or something dual single-core 3ghz server. > > these two md's are in turn a striped LVM volume group > > dmesg shows.... > > ? ?md: syncing RAID array md10 > ? ?md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc. > ? ?md: using maximum available idle IO bandwidth (but not more than > 200000 KB/sec) for reconstruction. > ? ?md: using 128k window, over a total of 143374656 blocks. > ? ?md: syncing RAID array md11 > ? ?md: minimum _guaranteed_ reconstruction speed: 1000 KB/sec/disc. > ? ?md: using maximum available idle IO bandwidth (but not more than > 200000 KB/sec) for reconstruction. > ? ?md: using 128k window, over a total of 143374656 blocks. > ? ?md: md10: sync done. > ? ?RAID1 conf printout: > ? ? --- wd:2 rd:2 > ? ? disk 0, wo:0, o:1, dev:sdc1 > ? ? disk 1, wo:0, o:1, dev:sdd1 > ? ?md: md11: sync done. > ? ?RAID1 conf printout: > ? ? --- wd:2 rd:2 > ? ? disk 0, wo:0, o:1, dev:sde1 > ? ? disk 1, wo:0, o:1, dev:sdf1 > > I'm not sure what thats telling me. ?the last thing prior to this in > dmesg was when I added a swap to this vg last week. > > > and mdadm --detail shows... > > # mdadm --detail /dev/md10 > /dev/md10: > ? ? ? ?Version : 0.90 > ?Creation Time : Wed Oct ?8 12:54:48 2008 > ? ? Raid Level : raid1 > ? ? Array Size : 143374656 (136.73 GiB 146.82 GB) > ?Used Dev Size : 143374656 (136.73 GiB 146.82 GB) > ? Raid Devices : 2 > ?Total Devices : 2 > Preferred Minor : 10 > ? ?Persistence : Superblock is persistent > > ? ?Update Time : Sun Feb 28 04:53:29 2010 > ? ? ? ? ?State : clean > ?Active Devices : 2 > Working Devices : 2 > ?Failed Devices : 0 > ?Spare Devices : 0 > > ? ? ? ? ? UUID : b6da4dc5:c7372d6e:63f32b9c:49fa95f9 > ? ? ? ? Events : 0.84 > > ? ?Number ? Major ? Minor ? RaidDevice State > ? ? ? 0 ? ? ? 8 ? ? ? 33 ? ? ? ?0 ? ? ?active sync ? /dev/sdc1 > ? ? ? 1 ? ? ? 8 ? ? ? 49 ? ? ? ?1 ? ? ?active sync ? /dev/sdd1 > # mdadm --detail /dev/md11 > /dev/md11: > ? ? ? ?Version : 0.90 > ?Creation Time : Wed Oct ?8 12:54:57 2008 > ? ? Raid Level : raid1 > ? ? Array Size : 143374656 (136.73 GiB 146.82 GB) > ?Used Dev Size : 143374656 (136.73 GiB 146.82 GB) > ? Raid Devices : 2 > ?Total Devices : 2 > Preferred Minor : 11 > ? ?Persistence : Superblock is persistent > > ? ?Update Time : Sun Feb 28 11:49:45 2010 > ? ? ? ? ?State : clean > ?Active Devices : 2 > Working Devices : 2 > ?Failed Devices : 0 > ?Spare Devices : 0 > > ? ? ? ? ? UUID : be475cd9:b98ee3ff:d18e668c:a5a6e06b > ? ? ? ? Events : 0.62 > > ? ?Number ? Major ? Minor ? RaidDevice State > ? ? ? 0 ? ? ? 8 ? ? ? 65 ? ? ? ?0 ? ? ?active sync ? /dev/sde1 > ? ? ? 1 ? ? ? 8 ? ? ? 81 ? ? ? ?1 ? ? ?active sync ? /dev/sdf1 > > > > I don't see anything wrong here ? > > lvm shows no problems I detect either... > > # vgdisplay vg1 > ?Volume group "vgdisplay" not found > ?LV ? ? ? ? ? ? VG ? Attr ? LSize ?Origin Snap% ?Move Log Copy% ?Convert > ?glassfish ? ? ?vg1 ?-wi-ao 10.00G > ?lv1 ? ? ? ? ? ?vg1 ?-wi-ao 97.66G > ?oradata ? ? ? ?vg1 ?-wi-ao 30.00G > ?pgdata ? ? ? ? vg1 ?-wi-ao 25.00G > ?pgdata_lss_idx vg1 ?-wi-ao 20.00G > ?pgdata_lss_tab vg1 ?-wi-ao 20.00G > ?swapper ? ? ? ?vg1 ?-wi-ao ?3.00G > ?vmware ? ? ? ? vg1 ?-wi-ao 50.00G > > > # pvdisplay /dev/md10 /dev/md11 > ?--- Physical volume --- > ?PV Name ? ? ? ? ? ? ? /dev/md10 > ?VG Name ? ? ? ? ? ? ? vg1 > ?PV Size ? ? ? ? ? ? ? 136.73 GB / not usable 2.31 MB > ?Allocatable ? ? ? ? ? yes > ?PE Size (KByte) ? ? ? 4096 > ?Total PE ? ? ? ? ? ? ?35003 > ?Free PE ? ? ? ? ? ? ? 1998 > ?Allocated PE ? ? ? ? ?33005 > ?PV UUID ? ? ? ? ? ? ? oAgJY7-Tmf7-ac35-KoUH-15uz-Q5Ae-bmFCys > > ?--- Physical volume --- > ?PV Name ? ? ? ? ? ? ? /dev/md11 > ?VG Name ? ? ? ? ? ? ? vg1 > ?PV Size ? ? ? ? ? ? ? 136.73 GB / not usable 2.31 MB > ?Allocatable ? ? ? ? ? yes > ?PE Size (KByte) ? ? ? 4096 > ?Total PE ? ? ? ? ? ? ?35003 > ?Free PE ? ? ? ? ? ? ? 2560 > ?Allocated PE ? ? ? ? ?32443 > ?PV UUID ? ? ? ? ? ? ? A4Qb3P-j5Lr-8ZEv-FjbC-Iczm-QkC8-bqP0zvmaybe?this?helps:?http://www.arrfab.net/blog/?p=199 -- Eero
Am 28.02.2010 22:03, schrieb John R Pierce:> WARNING: mismatch_cnt is not 0 onHave a look at http://www.arrfab.net/blog/?p=199 It says:> A `echo repair >/sys/block/md0/md/sync_action` followed by a `echo > check >/sys/block/md0/md/sync_action` seems to have corrected it. Now > `cat /sys/block/md0/md/mismatch_cnt` returns 0 ?Regards, Peter -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 260 bytes Desc: OpenPGP digital signature URL: <http://lists.centos.org/pipermail/centos/attachments/20100228/2c04a780/attachment.sig>
Peter Hinse wrote:> Am 28.02.2010 22:03, schrieb John R Pierce: > >> WARNING: mismatch_cnt is not 0 on >> > > Have a look at http://www.arrfab.net/blog/?p=199 > It says: > > >> A `echo repair >/sys/block/md0/md/sync_action` followed by a `echo >> check >/sys/block/md0/md/sync_action` seems to have corrected it. Now >> `cat /sys/block/md0/md/mismatch_cnt` returns 0 ? >>Thanks. I was trying to figure out how from the mdadm commands (UGH!) to do a scan. # cat /sys/block/md10/md/mismatch_cnt 8448 # cat /sys/block/md11/md/mismatch_cnt 7296 fugly. Since the mirrors aren't checksummed, can i assume this means there's likely some data messups here? Anyways, the repair is running on both md10 and md11, i'll check back with my final results...