similar to: Tuning disk failure detection?

Displaying 20 results from an estimated 1000 matches similar to: "Tuning disk failure detection?"

2012 Jan 11
3
Unable to allocate dma memory for extra SGL
Hi all; We have a Solaris 10 U9 x86 instance running on Silicon Mechanics / SuperMicro hardware. Occasionally under high load (ZFS scrub for example), the box becomes non-responsive (it continues to respond to ping but nothing else works -- not even the local console). Our only solution is to hard reset after which everything comes up normally. Logs are showing the following: Jan 8
2012 May 30
11
Disk failure chokes all the disks attached to the failing disk HBA
Dear All, It may be this not the correct mailing list, but I''m having a ZFS issue when a disk is failing. The system is a supermicro motherboard X8DTH-6F in a 4U chassis (SC847E1-R1400LPB) and an external SAS2 JBOD (SC847E16-RJBOD1). It makes a system with a total of 4 backplanes (2x SAS + 2x SAS2) each of them connected to a 4 different HBA (2x LSI 3081E-R (1068 chip) + 2x LSI
2009 Jan 17
2
Comparison between the S-TEC Zeus and the Intel X25-E ??
I''m looking at the newly-orderable (via Sun) STEC Zeus SSDs, and they''re outrageously priced. http://www.stec-inc.com/product/zeusssd.php I just looked at the Intel X25-E series, and they look comparable in performance. At about 20% of the cost. http://www.intel.com/design/flash/nand/extreme/index.htm Can anyone enlighten me as to any possible difference between an STEC
2010 Jan 11
25
Is LSI SAS3081E-R suitable for a ZFS NAS ?
According to various posts the LSI SAS3081E-R seems to work well with OpenSolaris. But I''ve got pretty chilled-out from my recent problems with Areca-1680''s. Could anyone please confirm that the LSI SAS3081E-R works well ? Is hotplug supported ? Anything else I should know before buying one of these cards ? Thanks, Arnaud
2010 Jun 18
6
WD caviar/mpt issues
I know that this has been well-discussed already, but it''s been a few months - WD caviars with mpt/mpt_sas generating lots of retryable read errors, spitting out lots of beloved " Log info 31080000 received for target" messages, and just generally not working right. (SM 836EL1 and 836TQ chassis - though I have several variations on theme depending on date of purchase: 836EL2s,
2010 Mar 09
0
snv_133 mpt_sas driver
Hi all, Today a new message has been seen in my system and another freeze has happen to it. The message is : Mar 9 06:20:01 zfs01 failed to configure smp w50016360001e06bf Mar 9 06:20:01 zfs01 mpt: [ID 201859 kern.warning] WARNING: smp_start do passthru error 16 Mar 9 06:20:01 zfs01 scsi: [ID 243001 kern.warning] WARNING: /pci at 0,0/pci8086,3410 at 9/pci1000,3150 at 0 (mpt2): Mar 9
2008 Oct 08
5
Resilver hanging?
How can I diagnose why a resilver appears to be hanging at a certain percentage, seemingly doing nothing for quite a while, even though the HDD LED is lit up permanently (no apparent head seeking)? The drives in the pool are WD Raid Editions, thus have TLER and should time out on errors in just seconds. ZFS nor the syslog however were reporting any IO errors, so it weren''t the disks.
2010 Nov 06
10
Apparent SAS HBA failure-- now what?
My setup: A SuperMicro 24-drive chassis with Intel dual-processor motherboard, three LSI SAS3081E controllers, and 24 SATA 2TB hard drives, divided into three pools with each pool a single eight-disk RAID-Z2. (Boot is an SSD connected to motherboard SATA.) This morning I got a cheerful email from my monitoring script: "Zchecker has discovered a problem on bigdawg." The full output is
2011 Aug 16
2
solaris 10u8 hangs with message Disconnected command timeout for Target 0
Hi, My solaris storage hangs. I login to the console and there is messages[1] display on the console. I can''t login into the console and seems the IO is totally blocked. The system is solaris 10u8 on Dell R710 with disk array Dell MD3000. 2 HBA cable connect the server and MD3000. The symptom is random. It is very appreciated if any one can help me out. Regards, Ding [1] Aug 16
2011 Aug 11
19
Intel 320 as ZIL?
Are any of you using the Intel 320 as ZIL? It''s MLC based, but I understand its wear and performance characteristics can be bumped up significantly by increasing the overprovisioning to 20% (dropping usable capacity to 80%). Anyone have experience with this? Ray
2009 Mar 24
3
LSI Logic raid status
Hi, I have a LSI Logic sata/sas raid running, is there a way to see the state of the volume, like optimal, degraded or resyncing? I've tried several commands with camcontrol but I cant figure it out. -- Peter Ankerst?l peter@pean.org http://www.pean.org/
2008 May 20
4
awstats, webalizer or...
So what does everyone out there use to generate web statistics these days? Are the tried and true awstats or webalizer still the best out there? Ray
2010 May 05
3
Another MPT issue - kernel crash
Hi all, I have faced yet another kernel panic that seems to be related to mpt driver. This time i was trying to add a new disk to a running system (snv_134) and this new disk was not being detected...following a tip i ran the lsitool to reset the bus and this lead to a system panic. MPT driver : BAD TRAP: type=e (#pf Page fault) rp=ffffff001fc98020 addr=4 occurred in module "mpt" due
2006 Mar 14
3
Adaptec AIC9410
Can anyone tell me if/when the Adaptec AIC9410 SAS/SATA controller will be supported? Michael Grant
2006 Nov 07
4
Problems with LTO-3 and U320 on Centos 4.4
I am currently testing an Overland Tape storage unit with a LTO-3 drive on CentOS 4.4. After a random amount of time but usually when we have backed up about 0.5-1TB we get scsi errors which cause the backup to fail. I have rebuilt the box with Solaris 10 x86 and Windoze to prove the hardware and have successfully backed up over 3TB on each OS without error. Can anyone offer advice on how to find
2006 Mar 16
3
LSI Logic controller status
Hello, Recently we bought HP bl35 blades with LSI Logic SAS controllers. mpt* drivers works fine but I would like to somehow view status from command line. To monitor e.g. if one of the HDDs failed. I found mptutil on LSI Logic site but it just shows me configuration in quite cryptic form. Thanks, Mindaugas
2009 Jan 29
1
7.1, mpt and slow writes
Hello, I think this needs a few more eyes: http://lists.freebsd.org/pipermail/freebsd-scsi/2009-January/003782.html In short, writes are slow, likely do to the write-cache being enabled on the controller. The sysctl used in 6.x to turn the cache off don't seem to be in 7.x. Thanks, Charles ___ Charles Sprickman NetEng/SysAdmin Bway.net - New York's Best Internet - www.bway.net
2007 Jul 28
2
Any SELinux gurus around?
Hey all, I'm having some trouble getting PHP5 w/ CentOS 5 to connect to a remote MySQL server using the standard mysql_connect() call. Yes, MySQL libraries are installed along with php-mysql... The only way I can get it to work is to run setenforce Permissive, otherwise the connection fails. The annoying thing is that nothing at all shows up in my /var/log/messages file describing why
2010 Aug 24
7
SCSI write retry errors on ZIL SSD drives...
I posted a thread on this once long ago[1] -- but we''re still fighting with this problem and I wanted to throw it out here again. All of our hardware is from Silicon Mechanics (SuperMicro chassis and motherboards). Up until now, all of the hardware has had a single 24-disk expander / backplane -- but we recently got one of the new SC847-based models with 24 disks up front and 12 in the
2010 Feb 11
2
RAID, temperature and FAN status manage and monitoring tool for RHEL4 Update 4 ia64 AS Linux
Hi, Currently I've been using an RHEL4 Update 4 ia64 AS Linux version and running in HP rx6600 server. I tried to google-out the RAID, temperature and FAN status manage and monitoring tool for RHEL4 Update 4 ia64 AS Linux But i can't able to findout the same for RHEL4 Update 4 ia64 AS Linux server Can anyone suggest me the write status monitoring and manage tool for