Paul R. Ganci
2006-Jan-04 01:34 UTC
[CentOS] Centos locking up system with mptscsih driver error
I have a Tyan Tiger S2466 MPX motherboard with Dual Atlon MP 2800+ CPUs and 1GB PC2100 DDR SDRAM. For disk drive I have an LSI53C1030 and 4 Seagate ST336607LWs in a software raid 5 configuration. I installed Centos 4.1 and everything was fine running kernel-smp-2.6.9-11.EL. However when I upated to Centos 4.2 I have run into problems. Namely after a finite amount of disk traffic the system locks completely up. On the monitor console I get message after message from the MPT Fusion SCSI driver mptscsih indicating that there was a failure and that an "ABORT was successful". Unfortunately I don't have the exact message, but I have tracked the problem to kernel-smp-2.6.9-22.0.1.EL. Since I updated from 4.1 I still had the smp-2.6.9-11 kernel around and as long as I boot into kernel-smp-2.6.9-11.EL with all other 4.2 updates installed everything is stable. I can reliably get the mptscsih driver to fail after a few minutes of system uptime (or shorter time if doing disk writes) when booted into kernel-smp-2.6.9-22.0.1.EL. I checked the Centos archives and did not find anything related to this SCSI card or motherboard. Now before I get lambasted for not having the exact SCSI error message (yes I am willing boot into kernel-smp-2.6.9-22.0.1.EL despite that my raid partition has to be rebuilt afterwards) to get the message. I was wondering if anyone else has had problems with this hardware/driver and/or kernel-smp-2.6.9-22.0.1.EL. If this is a new mptscsih problem I will post more details of my system but I thought I would start with just a general question in case I missed something. -- Paul (ganci at nurdog.com)
Johnny Hughes
2006-Jan-04 01:41 UTC
[CentOS] Centos locking up system with mptscsih driver error
On Tue, 2006-01-03 at 18:34 -0700, Paul R. Ganci wrote:> I have a Tyan Tiger S2466 MPX motherboard with Dual Atlon MP 2800+ CPUs > and 1GB PC2100 DDR SDRAM. For disk drive I have an LSI53C1030 and 4 > Seagate ST336607LWs in a software raid 5 configuration. I installed > Centos 4.1 and everything was fine running kernel-smp-2.6.9-11.EL. > However when I upated to Centos 4.2 I have run into problems. Namely > after a finite amount of disk traffic the system locks completely up. On > the monitor console I get message after message from the MPT Fusion SCSI > driver mptscsih indicating that there was a failure and that an "ABORT > was successful". Unfortunately I don't have the exact message, but I > have tracked the problem to kernel-smp-2.6.9-22.0.1.EL. Since I updated > from 4.1 I still had the smp-2.6.9-11 kernel around and as long as I > boot into kernel-smp-2.6.9-11.EL with all other 4.2 updates installed > everything is stable. I can reliably get the mptscsih driver to fail > after a few minutes of system uptime (or shorter time if doing disk > writes) when booted into kernel-smp-2.6.9-22.0.1.EL. > > I checked the Centos archives and did not find anything related to this > SCSI card or motherboard. Now before I get lambasted for not having the > exact SCSI error message (yes I am willing boot into > kernel-smp-2.6.9-22.0.1.EL despite that my raid partition has to be > rebuilt afterwards) to get the message. I was wondering if anyone else > has had problems with this hardware/driver and/or > kernel-smp-2.6.9-22.0.1.EL. If this is a new mptscsih problem I will > post more details of my system but I thought I would start with just a > general question in case I missed something. >I have not seen this particular problem ... do you want to try the new 2.6.9-27.EL kernel that was released as part of EL4-u3beta Also, verify you have the latest BIOS updates from you motherboard. -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: This is a digitally signed message part URL: <http://lists.centos.org/pipermail/centos/attachments/20060103/5b03a622/attachment.sig>
Craig White
2006-Jan-04 01:43 UTC
[CentOS] Centos locking up system with mptscsih driver error
On Tue, 2006-01-03 at 18:34 -0700, Paul R. Ganci wrote:> I have a Tyan Tiger S2466 MPX motherboard with Dual Atlon MP 2800+ CPUs > and 1GB PC2100 DDR SDRAM. For disk drive I have an LSI53C1030 and 4 > Seagate ST336607LWs in a software raid 5 configuration. I installed > Centos 4.1 and everything was fine running kernel-smp-2.6.9-11.EL. > However when I upated to Centos 4.2 I have run into problems. Namely > after a finite amount of disk traffic the system locks completely up. On > the monitor console I get message after message from the MPT Fusion SCSI > driver mptscsih indicating that there was a failure and that an "ABORT > was successful". Unfortunately I don't have the exact message, but I > have tracked the problem to kernel-smp-2.6.9-22.0.1.EL. Since I updated > from 4.1 I still had the smp-2.6.9-11 kernel around and as long as I > boot into kernel-smp-2.6.9-11.EL with all other 4.2 updates installed > everything is stable. I can reliably get the mptscsih driver to fail > after a few minutes of system uptime (or shorter time if doing disk > writes) when booted into kernel-smp-2.6.9-22.0.1.EL. > > I checked the Centos archives and did not find anything related to this > SCSI card or motherboard. Now before I get lambasted for not having the > exact SCSI error message (yes I am willing boot into > kernel-smp-2.6.9-22.0.1.EL despite that my raid partition has to be > rebuilt afterwards) to get the message. I was wondering if anyone else > has had problems with this hardware/driver and/or > kernel-smp-2.6.9-22.0.1.EL. If this is a new mptscsih problem I will > post more details of my system but I thought I would start with just a > general question in case I missed something.---- firmware up to date? Craig
Rodrigo Barbosa
2006-Jan-04 02:05 UTC
2.6.9-27.EL (Was: [CentOS] Centos locking up system with mptscsih driver error)
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Tue, Jan 03, 2006 at 07:41:36PM -0600, Johnny Hughes wrote:> On Tue, 2006-01-03 at 18:34 -0700, Paul R. Ganci wrote: > I have not seen this particular problem ... do you want to try the new > 2.6.9-27.EL kernel that was released as part of EL4-u3betaDo you have any idea if UDF Write support was backported into it ? - -- Rodrigo Barbosa <rodrigob at suespammers.org> "Quid quid Latine dictum sit, altum viditur" "Be excellent to each other ..." - Bill & Ted (Wyld Stallyns) -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (GNU/Linux) iD8DBQFDuy1qpdyWzQ5b5ckRAtN7AJoD5XXjpsFu+bgE52bE27/Ubsx3UACgs4q6 kdL7v7feeseE0vQkP8vvHNs=DCpe -----END PGP SIGNATURE-----
Paul R. Ganci
2006-Jan-04 02:23 UTC
[CentOS] Centos locking up system with mptscsih driver error
Johnny Hughes wrote:>On Tue, 2006-01-03 at 18:34 -0700, Paul R. Ganci wrote: > > >>I was wondering if anyone else >>has had problems with this hardware/driver and/or >>kernel-smp-2.6.9-22.0.1.EL. If this is a new mptscsih problem I will >>post more details of my system but I thought I would start with just a >>general question in case I missed something. >> >I have not seen this particular problem ... do you want to try the new >2.6.9-27.EL kernel that was released as part of EL4-u3beta > >Also, verify you have the latest BIOS updates from you motherboard. > >Alas, I have the original BIOS installed. I was checking and it does appear there are BIOS updates available. I always hate the idea of flashing a ROM given the implications of something going wrong. However, I was just checking /var/log/dmesg and am seeing things like: mtrr: v2.0 (20020519) mtrr: your CPUs had inconsistent fixed MTRR settings mtrr: probably your BIOS does not setup all CPUs. mtrr: corrected configuration. BIOS failed to enable PCI standards compliance, fixing this error. so perhaps I have a root cause to address. I may just try the beta kernel anyhow ... if it doesn't work I will be able to capture the actual mptscsih error message. :( -- Paul (ganci at nurdog.com)
Possibly Parallel Threads
- Dom0 lvm/software raid rhel4.1 booting issues.
- passthrough PCI SCSI device
- Intermittent problem, likely disk IO related - mptscsih: ioc0: attempting task abort!
- Intermittent problem, likely disk IO related - mptscsih: ioc0: attempting task abort!
- Intermittent problem, likely disk IO related - mptscsih: ioc0: attempting task abort!