Dear list, I have inherited a CentOS-5 box with a LSI Sata Raid controller. It is configured as a RAID5 w/hot-standby. My concern is that I have no present means of determining the RAID status w/o downing the system and going into the bios to get status. If a disk has failed and brought the standby disk online, I would like to know about it, the day it happens by seeing it in syslog or getting an e-mail message. The only solution I have found is on the LSI web-site. It is tool called "MegaMon for Linux". It most recent release was in 2005 and it requires that I use their driver. Just the package title gives me chills. It runs as a daemon and is almost constantly polling the h/w . I would much prefer a simple command line tool that I can wrap a script around and run it under cron. My questions are: 1. Does anyone know of a command line tool ? 2. Failing a positive response to #1, has anyone any experience with "MegaMon for Linux", good or otherwise ? Thanks to all, Ray The h/w is a dual Opteron dual-core system with lots of memory. The relevant dmesg lines and lspci output are shown below: dmesg: megaraid cmm: 2.20.2.7 (Release Date: Sun Jul 16 00:01:03 EST 2006) megaraid: 2.20.5.1 (Release Date: Thu Nov 16 15:32:35 EST 2006) megaraid: probe new device 0x1000:0x0409:0x1000:0x3008: bus 3:slot 14:func 0 GSI 17 sharing vector 0xA8 and IRQ 17 ACPI: PCI Interrupt 0000:03:0e.0[C] -> GSI 18 (level, low) -> IRQ 17 megaraid: fw version:[814B] bios version:[H431] scsi5 : LSI Logic MegaRAID driver scsi[5]: scanning scsi channel 0 [Phy 0] for non-raid devices scsi[5]: scanning scsi channel 1 [virtual] for logical drives Vendor: MegaRAID Model: LD 0 RAID5 1144G Rev: 814B Type: Direct-Access ANSI SCSI revision: 02 *SCSI device sdc: 2343745536 512-byte hdwr sectors (1199998 MB) *sdc: Write Protect is off *sdc: Mode Sense: 00 00 00 00 *sdc: asking for cache data failed *sdc: assuming drive cache: write through *SCSI device sdc: 2343745536 512-byte hdwr sectors (1199998 MB) *sdc: Write Protect is off *sdc: Mode Sense: 00 00 00 00 *sdc: asking for cache data failed *sdc: assuming drive cache: write through * sdc: sdc1 sdc2 sdc3 *sd 5:1:0:0: Attached scsi disk sdc lspci -v 03:0e.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID (rev 0a) Subsystem: LSI Logic / Symbios Logic MegaRAID SATA 300-8X RAID Controller Flags: bus master, stepping, 66MHz, medium devsel, latency 64, IRQ 17 Memory at cfcf0000 (32-bit, prefetchable) [size=64K] Memory at fe500000 (32-bit, non-prefetchable) [size=1M] Expansion ROM at fe4e0000 [disabled] [size=128K] Capabilities: [c0] Power Management version 2 Capabilities: [d0] Message Signalled Interrupts: 64bit+ Queue=0/1 Enable- Capabilities: [e0] PCI-X non-bridge device
http://www.monitoringexchange.org/cgi-bin/page.cgi?g=2416.html;d=1 google has more. On Wednesday, July 29, 2009, Raymond Lillard <rlillard at sonic.net> wrote:> > Dear list, > > I have inherited a CentOS-5 box with a LSI Sata Raid controller. > It is configured as a RAID5 w/hot-standby. ?My concern is that > I have no present means of determining the RAID status w/o > downing the system and going into the bios to get status. ?If > a disk has failed and brought the standby disk online, I would > like to know about it, the day it happens by seeing it in syslog > or getting an e-mail message. > > The only solution I have found is on the LSI web-site. ?It is > tool called "MegaMon for Linux". ?It most recent release was > in 2005 and it requires that I use their driver. ?Just the > package title gives me chills. ?It runs as a daemon and is > almost constantly polling the h/w . > > I would much prefer a simple command line tool that I can > wrap a script around and run it under cron. > > My questions are: > > 1. ?Does anyone know of a command line tool ? > > 2. ?Failing a positive response to #1, has anyone any experience > ? ?with "MegaMon for Linux", good or otherwise ? > > Thanks to all, > Ray > > > The h/w is a dual Opteron dual-core system with lots of memory. > The relevant dmesg lines and lspci output are shown below: > > dmesg: > > ?megaraid cmm: 2.20.2.7 (Release Date: Sun Jul 16 00:01:03 EST 2006) > ?megaraid: 2.20.5.1 (Release Date: Thu Nov 16 15:32:35 EST 2006) > ?megaraid: probe new device 0x1000:0x0409:0x1000:0x3008: bus 3:slot 14:func 0 > ?GSI 17 sharing vector 0xA8 and IRQ 17 > ?ACPI: PCI Interrupt 0000:03:0e.0[C] -> GSI 18 (level, low) -> IRQ 17 > ?megaraid: fw version:[814B] bios version:[H431] > ?scsi5 : LSI Logic MegaRAID driver > ?scsi[5]: scanning scsi channel 0 [Phy 0] for non-raid devices > ?scsi[5]: scanning scsi channel 1 [virtual] for logical drives > ? Vendor: MegaRAID ?Model: LD 0 RAID5 1144G ?Rev: 814B > ? Type: ? Direct-Access ? ? ? ? ? ? ? ? ? ? ?ANSI SCSI revision: 02 > *SCSI device sdc: 2343745536 512-byte hdwr sectors (1199998 MB) > *sdc: Write Protect is off > *sdc: Mode Sense: 00 00 00 00 > *sdc: asking for cache data failed > *sdc: assuming drive cache: write through > *SCSI device sdc: 2343745536 512-byte hdwr sectors (1199998 MB) > *sdc: Write Protect is off > *sdc: Mode Sense: 00 00 00 00 > *sdc: asking for cache data failed > *sdc: assuming drive cache: write through > * sdc: sdc1 sdc2 sdc3 > *sd 5:1:0:0: Attached scsi disk sdc > > > lspci -v > > ?03:0e.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID (rev 0a) > ? ? Subsystem: LSI Logic / Symbios Logic MegaRAID SATA 300-8X RAID Controller > ? ? Flags: bus master, stepping, 66MHz, medium devsel, latency 64, IRQ 17 > ? ? Memory at cfcf0000 (32-bit, prefetchable) [size=64K] > ? ? Memory at fe500000 (32-bit, non-prefetchable) [size=1M] > ? ? Expansion ROM at fe4e0000 [disabled] [size=128K] > ? ? Capabilities: [c0] Power Management version 2 > ? ? Capabilities: [d0] Message Signalled Interrupts: 64bit+ Queue=0/1 Enable- > ? ? Capabilities: [e0] PCI-X non-bridge device > _______________________________________________ > CentOS mailing list > CentOS at centos.org > http://lists.centos.org/mailman/listinfo/centos >-- Regards, Eugene Vilensky evilensky at gmail.com
>I would much prefer a simple command line tool that I can >wrap a script around and run it under cron.I did this when I didn't have a Nagios environment or an snmp setup (LSI provides an SNMP monitoring package for it). Simply use the MegaCLI to query for status of all LD's and grep for failure modes.
Thanks to both of you who replied. Mr. Vilensky's approach seemed easiest and his google powers exceed mine. The link he provided was the key. http://www.monitoringexchange.org/cgi-bin/page.cgi?g=2416.html;d=1 The only issue for me was that the megarc URL has been changed. I found the package at: http://www.lsi.com/DistributionSystem/AssetDocument/files/support/rsa/utilities/megaconf/ut_linux_megarc_1.11.zip Problem solved. Best to all, Ray Raymond Lillard wrote:> Dear list, > > I have inherited a CentOS-5 box with a LSI Sata Raid controller. > It is configured as a RAID5 w/hot-standby. My concern is that > I have no present means of determining the RAID status w/o > downing the system and going into the bios to get status. If > a disk has failed and brought the standby disk online, I would > like to know about it, the day it happens by seeing it in syslog > or getting an e-mail message. > > The only solution I have found is on the LSI web-site. It is > tool called "MegaMon for Linux". It most recent release was > in 2005 and it requires that I use their driver. Just the > package title gives me chills. It runs as a daemon and is > almost constantly polling the h/w . > > I would much prefer a simple command line tool that I can > wrap a script around and run it under cron. > > My questions are: > > 1. Does anyone know of a command line tool ? > > 2. Failing a positive response to #1, has anyone any experience > with "MegaMon for Linux", good or otherwise ? > > Thanks to all, > Ray > > > The h/w is a dual Opteron dual-core system with lots of memory. > The relevant dmesg lines and lspci output are shown below: > > dmesg: > > megaraid cmm: 2.20.2.7 (Release Date: Sun Jul 16 00:01:03 EST 2006) > megaraid: 2.20.5.1 (Release Date: Thu Nov 16 15:32:35 EST 2006) > megaraid: probe new device 0x1000:0x0409:0x1000:0x3008: bus 3:slot 14:func 0 > GSI 17 sharing vector 0xA8 and IRQ 17 > ACPI: PCI Interrupt 0000:03:0e.0[C] -> GSI 18 (level, low) -> IRQ 17 > megaraid: fw version:[814B] bios version:[H431] > scsi5 : LSI Logic MegaRAID driver > scsi[5]: scanning scsi channel 0 [Phy 0] for non-raid devices > scsi[5]: scanning scsi channel 1 [virtual] for logical drives > Vendor: MegaRAID Model: LD 0 RAID5 1144G Rev: 814B > Type: Direct-Access ANSI SCSI revision: 02 > *SCSI device sdc: 2343745536 512-byte hdwr sectors (1199998 MB) > *sdc: Write Protect is off > *sdc: Mode Sense: 00 00 00 00 > *sdc: asking for cache data failed > *sdc: assuming drive cache: write through > *SCSI device sdc: 2343745536 512-byte hdwr sectors (1199998 MB) > *sdc: Write Protect is off > *sdc: Mode Sense: 00 00 00 00 > *sdc: asking for cache data failed > *sdc: assuming drive cache: write through > * sdc: sdc1 sdc2 sdc3 > *sd 5:1:0:0: Attached scsi disk sdc > > > lspci -v > > 03:0e.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID (rev 0a) > Subsystem: LSI Logic / Symbios Logic MegaRAID SATA 300-8X RAID Controller > Flags: bus master, stepping, 66MHz, medium devsel, latency 64, IRQ 17 > Memory at cfcf0000 (32-bit, prefetchable) [size=64K] > Memory at fe500000 (32-bit, non-prefetchable) [size=1M] > Expansion ROM at fe4e0000 [disabled] [size=128K] > Capabilities: [c0] Power Management version 2 > Capabilities: [d0] Message Signalled Interrupts: 64bit+ Queue=0/1 Enable- > Capabilities: [e0] PCI-X non-bridge device > _______________________________________________ > CentOS mailing list > CentOS at centos.org > http://lists.centos.org/mailman/listinfo/centos
On Wed, Jul 29, 2009 at 11:17 PM, Raymond Lillard<rlillard at sonic.net> wrote:> > Dear list, > > I have inherited a CentOS-5 box with a LSI Sata Raid controller. > It is configured as a RAID5 w/hot-standby. ?My concern is that > I have no present means of determining the RAID status w/o > downing the system and going into the bios to get status. ?If > a disk has failed and brought the standby disk online, I would > like to know about it, the day it happens by seeing it in syslog > or getting an e-mail message. > > The only solution I have found is on the LSI web-site. ?It is > tool called "MegaMon for Linux". ?It most recent release was > in 2005 and it requires that I use their driver. ?Just the > package title gives me chills. ?It runs as a daemon and is > almost constantly polling the h/w . > > I would much prefer a simple command line tool that I can > wrap a script around and run it under cron. > > My questions are: > > 1. ?Does anyone know of a command line tool ?I've had pretty good luck with mpt-status: http://prefetch.net/blog/index.php/2009/04/18/compiling-mpt-status-on-centos-hosts/ http://prefetch.net/blog/index.php/2007/02/03/monitoring-lsi-logic-raid-controllers/> 2. ?Failing a positive response to #1, has anyone any experience > ? ?with "MegaMon for Linux", good or otherwise ?I haven't had a chance to look at MegaMon, but would love to hear your thoughts if you get a chance to play with it. - Ryan -- http://prefetch.net