I'm looking to do a bit more monitoring of my 3ware 9550 with smartd, and wanted to see what others were doing with smart for monitoring 3ware hardware. Do you have the smartd.conf configured to test, or simply monitor health status? Are you monitoring the drive as centos sees it (/dev/sdX) or are you using the 3ware /dev/twaX for monitoring? Opinions and discussions are welcome :-P -- During times of universal deceit, telling the truth becomes a revolutionary act. George Orwell
On Tue, 2009-02-10 at 21:42 -0500, Jim Perrin wrote:> I'm looking to do a bit more monitoring of my 3ware 9550 with smartd, > and wanted to see what others were doing with smart for monitoring > 3ware hardware. > > Do you have the smartd.conf configured to test, or simply monitor health status? > Are you monitoring the drive as centos sees it (/dev/sdX) or are you > using the 3ware /dev/twaX for monitoring? > > Opinions and discussions are welcome :-PThis is my smartd.conf for monitoring drives on a 9550SX: /dev/twa0 -d 3ware,0 -H -m root /dev/twa0 -d 3ware,1 -H -m root /dev/twa0 -d 3ware,2 -H -m root /dev/twa0 -d 3ware,3 -H -m root Using smartctl is similar: # smartctl -Hd 3ware,0 /dev/twa0 It's straightforward to do testing with smartctl, but the above -H/--health output gives you some warning that things aren't right before the drive fails, especially the later lines of output (e.g. Current_Pending_Sector, Offline_Uncorrectable). I run it as a weekly cron job. Steve
On Wednesday 11 February 2009, Jim Perrin wrote:> I'm looking to do a bit more monitoring of my 3ware 9550 with smartd, > and wanted to see what others were doing with smart for monitoring > 3ware hardware. > > Do you have the smartd.conf configured to test, or simply monitor health > status? Are you monitoring the drive as centos sees it (/dev/sdX) or are > you using the 3ware /dev/twaX for monitoring?I don't use smartd against them but I do run smartctl from time to time. I have had issues with using /dev/sdX and now only use /dev/twaX (but I can't really remember what bit me...). /Peter> Opinions and discussions are welcome :-P-------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: This is a digitally signed message part. URL: <http://lists.centos.org/pipermail/centos/attachments/20090211/24b80c1e/attachment.sig>
On Tue, 10 Feb 2009 at 9:42pm, Jim Perrin wrote> I'm looking to do a bit more monitoring of my 3ware 9550 with smartd, > and wanted to see what others were doing with smart for monitoring > 3ware hardware. > > Do you have the smartd.conf configured to test, or simply monitor health status? > Are you monitoring the drive as centos sees it (/dev/sdX) or are you > using the 3ware /dev/twaX for monitoring? > > Opinions and discussions are welcome :-PHave you thought about tying tw_cli into nagios? That's one of my round-tuit projects. I'm sure there are already plugins for it, and it seems like you may get better info. -- Joshua Baker-LePain QB3 Shared Cluster Sysadmin UCSF
Jim Perrin wrote:> I'm looking to do a bit more monitoring of my 3ware 9550 with smartd, > and wanted to see what others were doing with smart for monitoring > 3ware hardware. > > Do you have the smartd.conf configured to test, or simply monitor health status? > Are you monitoring the drive as centos sees it (/dev/sdX) or are you > using the 3ware /dev/twaX for monitoring? > > Opinions and discussions are welcome :-P > >I run smart tests weekly in a staggered fashion during off hours on my 3ware arrays. Like this: /dev/twa0 -d 3ware,0 -H -l selftest -l error -o on -S on -s (O/../../1/18|S/../../2/22|L/../../3/01) -m root /dev/twa0 -d 3ware,1 -H -l selftest -l error -o on -S on -s (O/../../2/18|S/../../3/22|L/../../4/01) -m root I've found that the smart monitors are pretty good about giving me at least some warning about imminent drive failures when I do this. -- Benjamin Franz