I have some systems which I plan to upgrade from 8.4 to 10.3 once 10.3 is released. In the meantime, I'm testing 10.3-BETA2 and have found what appears to be a regression in the MPT driver. The system is a Dell PowerEdge R300 with a Dell SAS6 controller: mpt0 at pci0:5:0:0: class=0x010000 card=0x1f0e1028 chip=0x00581000 rev=0x08 hdr=0x00 vendor = 'LSI Logic / Symbios Logic' device = 'SAS1068E PCI-Express Fusion-MPT SAS' class = mass storage subclass = SCSI Both the system BIOS and the SAS6 firmware are at the latest revisions from Dell (which haven't changed in years). On the 8.4 system, "grep mpt /var/run/dmesg.boot" reports: mpt0: <LSILogic SAS/SATA Adapter> port 0xec00-0xecff mem 0xdfcec000-0xdfceffff,0xdfcf0000-0xdfcfffff irq 16 at device 0.0 on pci5 mpt0: [ITHREAD] mpt0: MPI Version=1.5.18.0 mpt0: Capabilities: ( RAID-0 RAID-1E RAID-1 ) mpt0: 1 Active Volume (2 Max) mpt0: 2 Hidden Drive Members (14 Max) mpt0:vol0(mpt0:0:0): Settings ( Hot-Plug-Spares High-Priority-ReSync ) mpt0:vol0(mpt0:0:0): Using Spare Pool: 0 mpt0:vol0(mpt0:0:0): 2 Members: (mpt0:1:9:0): Primary Online (mpt0:1:1:0): Secondary Online mpt0:vol0(mpt0:0:0): RAID-1 - Optimal mpt0:vol0(mpt0:0:0): Status ( Enabled ) (mpt0:vol0:1): Physical (mpt0:0:1:0), Pass-thru (mpt0:1:0:0) (mpt0:vol0:1): Online (mpt0:vol0:0): Physical (mpt0:0:9:0), Pass-thru (mpt0:1:1:0) (mpt0:vol0:0): Online (probe0:mpt0:0:0:0): REPORT LUNS. CDB: a0 00 00 00 00 00 00 00 00 10 00 00 (probe0:mpt0:0:0:0): CAM status: SCSI Status Error (probe0:mpt0:0:0:0): SCSI status: Check Condition (probe0:mpt0:0:0:0): SCSI sense: ILLEGAL REQUEST info?:39000000 asc:0,0 (No additional sense information) ses0 at mpt0 bus 0 scbus0 target 8 lun 0 pass2 at mpt0 bus 1 scbus1 target 0 lun 0 da0 at mpt0 bus 0 scbus0 target 0 lun 0 [I'm not sure what that ILLEGAL REQUEST is about.] On the same system, running 10.3-BETA2 r295785, I see: mpt0: <LSILogic SAS/SATA Adapter> port 0xec00-0xecff mem 0xdfcec000-0xdfceffff,0xdfcf0000-0xdfcfffff irq 16 at device 0.0 on pci5 mpt0: MPI Version=1.5.18.0 mpt0: Capabilities: ( RAID-0 RAID-1E RAID-1 ) mpt0: 1 Active Volume (2 Max) mpt0: 2 Hidden Drive Members (14 Max) mpt0:vol0(mpt0:0:0): Settings ( Hot-Plug-Spares High-Priority-ReSync ) mpt0:vol0(mpt0:0:0): Using Spare Pool: mpt0:vol0(mpt0:0:0): 2 Members: (mpt0:1:9:0): Primary Online (mpt0:1:1:0): Secondary Online mpt0:vol0(mpt0:0:0): RAID-1 - Optimal mpt0:vol0(mpt0:0:0): Status ( Enabled ) (mpt0:vol0:1): Physical (mpt0:0:1:0), Pass-thru (mpt0:1:0:0) (mpt0:vol0:1): Online (mpt0:vol0:0): Physical (mpt0:0:9:0), Pass-thru (mpt0:1:1:0) (mpt0:vol0:0): Online (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe64:mpt0:1:1:0): Retrying command (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe64:mpt0:1:1:0): Retrying command (probe0:mpt0:0:0:0): REPORT LUNS. CDB: a0 00 00 00 00 00 00 00 00 10 00 00 (probe0:mpt0:0:0:0): CAM status: SCSI Status Error (probe0:mpt0:0:0:0): SCSI status: Check Condition (probe0:mpt0:0:0:0): SCSI sense: ILLEGAL REQUEST asc:ffffffff,ffffffff (Reserved ASC/ASCQ pair) (probe0:mpt0:0:0:0): Error 22, Unretryable error (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe64:mpt0:1:1:0): Retrying command (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe64:mpt0:1:1:0): Retrying command (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe64:mpt0:1:1:0): Error 5, Retries exhausted (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe1:mpt0:1:1:0): Retrying command (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe1:mpt0:1:1:0): Retrying command (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe1:mpt0:1:1:0): Retrying command (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe1:mpt0:1:1:0): Retrying command (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error (probe1:mpt0:1:1:0): Error 5, Retries exhausted da0 at mpt0 bus 0 scbus0 target 0 lun 0 ses0 at mpt0 bus 0 scbus0 target 8 lun 0 pass2 at mpt0 bus 1 scbus1 target 0 lun 0 I can try to narrow down when this regression was introduced, but I fig- ured I'd report it in case somebody has an "ah-hah" moment from seeing it. Also, there has always been an issue with passthru on these controllers - as you can see above, there are 2 physical disks attached to the controller, used as a mirror volume. But only one of the members appears as a passN de- vice, which means that the other one can't be monitored with smartmontools. If I'm remembering correctly, a volume with more than 2 drives creates a passN device for all but one of the drives. Terry Kennedy http://www.glaver.org New York, NY USA
Hi tmk,> On Feb 19, 2016, at 12:08 AM, Terry Kennedy <TERRY at glaver.org> wrote: > > I have some systems which I plan to upgrade from 8.4 to 10.3 once 10.3 > is released. In the meantime, I'm testing 10.3-BETA2 and have found what > appears to be a regression in the MPT driver. > > The system is a Dell PowerEdge R300 with a Dell SAS6 controller: > > mpt0 at pci0:5:0:0: class=0x010000 card=0x1f0e1028 chip=0x00581000 rev=0x08 hdr=0x00 > vendor = 'LSI Logic / Symbios Logic' > device = 'SAS1068E PCI-Express Fusion-MPT SAS' > class = mass storage > subclass = SCSI > > Both the system BIOS and the SAS6 firmware are at the latest revisions > from Dell (which haven't changed in years). >Can you get the status of the controller and disks via mptutil ? Also what does camcontrol devlist -v show ?> On the 8.4 system, "grep mpt /var/run/dmesg.boot" reports: > > mpt0: <LSILogic SAS/SATA Adapter> port 0xec00-0xecff mem 0xdfcec000-0xdfceffff,0xdfcf0000-0xdfcfffff irq 16 at device 0.0 on pci5 > mpt0: [ITHREAD] > mpt0: MPI Version=1.5.18.0 > mpt0: Capabilities: ( RAID-0 RAID-1E RAID-1 ) > mpt0: 1 Active Volume (2 Max) > mpt0: 2 Hidden Drive Members (14 Max) > mpt0:vol0(mpt0:0:0): Settings ( Hot-Plug-Spares High-Priority-ReSync ) > mpt0:vol0(mpt0:0:0): Using Spare Pool: 0 > mpt0:vol0(mpt0:0:0): 2 Members: > (mpt0:1:9:0): Primary Online > (mpt0:1:1:0): Secondary Online > mpt0:vol0(mpt0:0:0): RAID-1 - Optimal > mpt0:vol0(mpt0:0:0): Status ( Enabled ) > (mpt0:vol0:1): Physical (mpt0:0:1:0), Pass-thru (mpt0:1:0:0) > (mpt0:vol0:1): Online > (mpt0:vol0:0): Physical (mpt0:0:9:0), Pass-thru (mpt0:1:1:0) > (mpt0:vol0:0): Online > (probe0:mpt0:0:0:0): REPORT LUNS. CDB: a0 00 00 00 00 00 00 00 00 10 00 00 > (probe0:mpt0:0:0:0): CAM status: SCSI Status Error > (probe0:mpt0:0:0:0): SCSI status: Check Condition > (probe0:mpt0:0:0:0): SCSI sense: ILLEGAL REQUEST info?:39000000 asc:0,0 (No additional sense information) > ses0 at mpt0 bus 0 scbus0 target 8 lun 0 > pass2 at mpt0 bus 1 scbus1 target 0 lun 0 > da0 at mpt0 bus 0 scbus0 target 0 lun 0 > > [I'm not sure what that ILLEGAL REQUEST is about.] > > On the same system, running 10.3-BETA2 r295785, I see: > > mpt0: <LSILogic SAS/SATA Adapter> port 0xec00-0xecff mem 0xdfcec000-0xdfceffff,0xdfcf0000-0xdfcfffff irq 16 at device 0.0 on pci5 > mpt0: MPI Version=1.5.18.0 > mpt0: Capabilities: ( RAID-0 RAID-1E RAID-1 ) > mpt0: 1 Active Volume (2 Max) > mpt0: 2 Hidden Drive Members (14 Max) > mpt0:vol0(mpt0:0:0): Settings ( Hot-Plug-Spares High-Priority-ReSync ) > mpt0:vol0(mpt0:0:0): Using Spare Pool: > mpt0:vol0(mpt0:0:0): 2 Members: > (mpt0:1:9:0): Primary Online > (mpt0:1:1:0): Secondary Online > mpt0:vol0(mpt0:0:0): RAID-1 - Optimal > mpt0:vol0(mpt0:0:0): Status ( Enabled ) > (mpt0:vol0:1): Physical (mpt0:0:1:0), Pass-thru (mpt0:1:0:0) > (mpt0:vol0:1): Online > (mpt0:vol0:0): Physical (mpt0:0:9:0), Pass-thru (mpt0:1:1:0) > (mpt0:vol0:0): Online > (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe64:mpt0:1:1:0): Retrying command > (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe64:mpt0:1:1:0): Retrying command > (probe0:mpt0:0:0:0): REPORT LUNS. CDB: a0 00 00 00 00 00 00 00 00 10 00 00 > (probe0:mpt0:0:0:0): CAM status: SCSI Status Error > (probe0:mpt0:0:0:0): SCSI status: Check Condition > (probe0:mpt0:0:0:0): SCSI sense: ILLEGAL REQUEST asc:ffffffff,ffffffff (Reserved ASC/ASCQ pair) > (probe0:mpt0:0:0:0): Error 22, Unretryable error > (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe64:mpt0:1:1:0): Retrying command > (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe64:mpt0:1:1:0): Retrying command > (probe64:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe64:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe64:mpt0:1:1:0): Error 5, Retries exhausted > (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe1:mpt0:1:1:0): Retrying command > (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe1:mpt0:1:1:0): Retrying command > (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe1:mpt0:1:1:0): Retrying command > (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe1:mpt0:1:1:0): Retrying command > (probe1:mpt0:1:1:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe1:mpt0:1:1:0): CAM status: Unrecoverable Host Bus Adapter Error > (probe1:mpt0:1:1:0): Error 5, Retries exhausted > da0 at mpt0 bus 0 scbus0 target 0 lun 0 > ses0 at mpt0 bus 0 scbus0 target 8 lun 0 > pass2 at mpt0 bus 1 scbus1 target 0 lun 0 > > I can try to narrow down when this regression was introduced, but I fig- > ured I'd report it in case somebody has an "ah-hah" moment from seeing it. > > Also, there has always been an issue with passthru on these controllers - > as you can see above, there are 2 physical disks attached to the controller, > used as a mirror volume. But only one of the members appears as a passN de- > vice, which means that the other one can't be monitored with smartmontools. > If I'm remembering correctly, a volume with more than 2 drives creates a > passN device for all but one of the drives. > > Terry Kennedy http://www.glaver.org New York, NY USA > _______________________________________________ > freebsd-stable at freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe at freebsd.org"--- Mark Saad | nonesuch at longcount.org
> Can you get the status of the controller and disks via mptutil ? Also > what does camcontrol devlist -v show ?8.4 (8-STABLE): # mptutil show adapter mpt0 Adapter: Board Name: SAS6IR Board Assembly: Chip Name: C1068E Chip Revision: UNUSED RAID Levels: RAID0, RAID1, RAID1E RAID0 Stripes: 64k RAID1E Stripes: 64k RAID0 Drives/Vol: 2-10 RAID1 Drives/Vol: 2 RAID1E Drives/Vol: 3-10 # mptutil show drives mpt0 Physical Drives: 0 ( 137G) ONLINE <SEAGATE ST3300657SS-H EH04> SAS bus 0 id 1 1 ( 137G) ONLINE <SEAGATE ST3300657SS-H EH04> SAS bus 0 id 9 # mptutil show volumes mpt0 Volumes: Id Size Level Stripe State Write-Cache Name 0 ( 136G) RAID-1 OPTIMAL Enabled # camcontrol devlist -v scbus0 on mpt0 bus 0: <Dell VIRTUAL DISK 1028> at scbus0 target 0 lun 0 (da0,pass0) <DP BACKPLANE 1.05> at scbus0 target 8 lun 0 (ses0,pass1) <> at scbus0 target -1 lun -1 () scbus1 on mpt0 bus 1: <SEAGATE ST3300657SS-H EH04> at scbus1 target 0 lun 0 (pass2) <> at scbus1 target -1 lun -1 () scbus-1 on xpt0 bus 0: <> at scbus-1 target -1 lun -1 (xpt0) 10.3-BETA2: # mptutil show adapter mpt0 Adapter: Board Name: SAS6IR Board Assembly: Chip Name: C1068E Chip Revision: UNUSED RAID Levels: RAID0, RAID1, RAID1E RAID0 Stripes: 64K RAID1E Stripes: 64K RAID0 Drives/Vol: 2-10 RAID1 Drives/Vol: 2 RAID1E Drives/Vol: 3-10 # mptutil show drives mpt0 Physical Drives: 0 ( 137G) ONLINE <SEAGATE ST3300657SS-H EH04> SAS bus 0 id 1 1 ( 137G) ONLINE <SEAGATE ST3300657SS-H EH04> SAS bus 0 id 9 # mptutil show volumes mpt0 Volumes: Id Size Level Stripe State Write-Cache Name 0 ( 136G) RAID-1 OPTIMAL Enabled # camcontrol devlist -v scbus0 on mpt0 bus 0: <Dell VIRTUAL DISK 1028> at scbus0 target 0 lun 0 (pass0,da0) <DP BACKPLANE 1.05> at scbus0 target 8 lun 0 (ses0,pass1) <> at scbus0 target -1 lun ffffffff () scbus1 on mpt0 bus 1: <SEAGATE ST3300657SS-H EH04> at scbus1 target 0 lun 0 (pass2) <> at scbus1 target -1 lun ffffffff () scbus2 on ata2 bus 0: <TSSTcorp DVD-ROM TS-L333A D100> at scbus2 target 1 lun 0 (pass3,cd0) <> at scbus2 target -1 lun ffffffff () scbus3 on ata3 bus 0: <> at scbus3 target -1 lun ffffffff () scbus4 on ata4 bus 0: <> at scbus4 target -1 lun ffffffff () scbus5 on ata5 bus 0: <> at scbus5 target -1 lun ffffffff () scbus-1 on xpt0 bus 0: <> at scbus-1 target -1 lun ffffffff (xpt0) To clarify, things seem to work fine on 10.3-BETA2 after the system has booted, but there is a _long_ pause while the kernel is probing the mpt0 controller, followed by the spew of CAM error messages from the probes. Let me know if you need any additional info. Terry Kennedy http://www.glaver.org New York, NY USA