Willard Korfhage
2010-May-19 02:18 UTC
[zfs-discuss] scsi messages and mpt warning in log - harmless, or indicating a problem?
This afternoon, messages like the following started appearing in /var/adm/messages: May 18 13:46:37 fs8 scsi: [ID 365881 kern.info] /pci at 0,0/pci8086,29f1 at 1/pci15d9,a680 at 0 (mpt0): May 18 13:46:37 fs8 Log info 0x31080000 received for target 5. May 18 13:46:37 fs8 scsi_status=0x0, ioc_status=0x804b, scsi_state=0x1 May 18 13:46:38 fs8 scsi: [ID 365881 kern.info] /pci at 0,0/pci8086,29f1 at 1/pci15d9,a680 at 0 (mpt0): May 18 13:46:38 fs8 Log info 0x31080000 received for target 5. May 18 13:46:38 fs8 scsi_status=0x0, ioc_status=0x804b, scsi_state=0x0 May 18 13:46:40 fs8 scsi: [ID 365881 kern.info] /pci at 0,0/pci8086,29f1 at 1/pci15d9,a680 at 0 (mpt0): May 18 13:46:40 fs8 Log info 0x31080000 received for target 5. May 18 13:46:40 fs8 scsi_status=0x0, ioc_status=0x804b, scsi_state=0x0 The pool has no errors, so I don''t know if these represent a potential problem or not. During this time I was copying files from one fileset to another in the same pool, so it was fairly I/O intensive. Typically you get one every 1-5 seconds for 10 to 20 seconds, sometimes longer, and then it is quiet for many minutes before they occur again. Is this indicating a problem, or just a harmless message? I just kicked off a scrub on the pool as I was writing this, and I am seeing a lot of these messages. I see that zpool status shows c4t5d0 has 12.5K repaired already. The scrub has been in progress for just 6 minutes, and it says I have 170629h54m to go, and it gets longer every time I check the status. I ran a scrub on this a few weeks ago, and had no such problem. I also see two warnings earlier today: May 18 19:14:09 fs8 scsi: [ID 243001 kern.warning] WARNING: /pci at 0,0/pci8086,29f1 at 1/pci15d9,a680 at 0 (mpt0): May 18 19:14:09 fs8 mpt_handle_event_sync: IOCStatus=0x8000, IOCLogInfo=0x31110900 May 18 19:14:09 fs8 scsi: [ID 243001 kern.warning] WARNING: /pci at 0,0/pci8086,29f1 at 1/pci15d9,a680 at 0 (mpt0): May 18 19:14:09 fs8 mpt_handle_event: IOCStatus=0x8000, IOCLogInfo=0x31110900 and two more of these 1 minute and 10 seconds later. So, is my system in trouble or not? Particulars of my system: % uname -a SunOS fs8 5.11 snv_134 i86pc i386 i86pc The hardware is an Asus server motherboard carrying 4GB of ECC memory and a current Xeon CPU, and a SuperMicro AOC-USASLP-L8I card (it uses the 1068E) with 8 Samsung Spinpoint F3EG HD203WI 2TB disks attached. -- This message posted from opensolaris.org
Carson Gaspar
2010-May-19 07:40 UTC
[zfs-discuss] scsi messages and mpt warning in log - harmless, or indicating a problem?
Willard Korfhage wrote:> This afternoon, messages like the following started appearing in /var/adm/messages: > > May 18 13:46:37 fs8 scsi: [ID 365881 kern.info] /pci at 0,0/pci8086,29f1 at 1/pci15d9,a680 at 0 (mpt0): > May 18 13:46:37 fs8 Log info 0x31080000 received for target 5. > May 18 13:46:37 fs8 scsi_status=0x0, ioc_status=0x804b, scsi_state=0x1 > May 18 13:46:38 fs8 scsi: [ID 365881 kern.info] /pci at 0,0/pci8086,29f1 at 1/pci15d9,a680 at 0 (mpt0): > May 18 13:46:38 fs8 Log info 0x31080000 received for target 5. > May 18 13:46:38 fs8 scsi_status=0x0, ioc_status=0x804b, scsi_state=0x0 > May 18 13:46:40 fs8 scsi: [ID 365881 kern.info] /pci at 0,0/pci8086,29f1 at 1/pci15d9,a680 at 0 (mpt0): > May 18 13:46:40 fs8 Log info 0x31080000 received for target 5. > May 18 13:46:40 fs8 scsi_status=0x0, ioc_status=0x804b, scsi_state=0x0...> So, is my system in trouble or not? > > Particulars of my system: > > % uname -a > SunOS fs8 5.11 snv_134 i86pc i386 i86pcWelcome to the mpt driver / firmware / something bug! I forget if your symptoms were indicative of the card not liking the drives (Hitachis in particular, which I fixed by upgrading to larger Seagates) or an issue with MSI support (which I fixed by adding "set xpv_psm:xen_support_msi = -1" to /etc/system, but I was running a Xen enabled kernel). I suggest searching the list archives. -- Carson