Jay
2008-Dec-30 14:22 UTC
[zfs-discuss] read/write errors on storage pool (poss. ahci/hw related?)
hi *, i''m currently playing around with the setup of an opensolaris server as home nas and am experiencing occasional read/write problems with the zfs pool. the short version (details below/attached): * 6-disk raidz pool attached to the sata controller on an nvidia MCP78S chipset * first scrub of the pool with some data on it marks device sd5 as faulted due to "WARNING: ahci0: watchdog port 5 satapkt 0xffffff01c7d8d660 timed out" and a plethora of "Error for Command: read(10)" (see attached messages) * these messages appeared also for sd1, sd2 and sd3, but only sd5 failed in the end * replaced the disk, resilvering started * the same timeouts appear for sd0 and sd1 while resilvering, to prevent the pool from failing completely, i (rather brute force) rebooted the machine * resilvering ends eventually, data seems intact * everything seems normal for a few days, reading/writing is ok, no errors show up, the data is accessible today, i saw the same errors reported for sd4 in the logfile and when trying a ''zpool status'' it became unresponsive, with timeouts showing up for sd0. after another reboot, everything still looks ok, zpool status is ok, read and write access are ok. the disks themselves should be ok, i had them running a burn-in before installing opensolaris and the WD diagnostics passed them - even the faulted one i replaced passed another test as being perfectly ok. can anybody shed some light on this? i''m guessing it''s related to the sata controller, but i''d appreciate any help or insight. (at the moment, i''m not really worried about data loss as you might guess from the brute force rebooting, all the data on the pool is also stored on an old linux machine. i''m reacquainting myself with solaris, so it''s more or less a playground for now. but i''d like to replace the old linux server sometime - mainly because of zfs) thanks, jay ----- the hardware setup is * MSI K9N2GM-FIH mainboard, geforce 8200 chipset (nvidia MCP78S) * amd athlon x24450e * 4G ram * 6x WD 750GB disks (WD75000AACS) * old samsung 200G as system disk on the IDE controller installed is opensolaris 2008.11 (snv_101b) with a raidz pool spanning the 6 WD disks. ----- jay at space:/var/adm# zpool status storage-b pool: storage-b state: ONLINE scrub: none requested config: NAME STATE READ WRITE CKSUM storage-b ONLINE 0 0 0 raidz1 ONLINE 0 0 0 c3t0d0 ONLINE 0 0 0 c3t1d0 ONLINE 0 0 0 c3t2d0 ONLINE 0 0 0 c3t3d0 ONLINE 0 0 0 c3t4d0 ONLINE 0 0 0 c3t5d0 ONLINE 0 0 0 errors: No known data errors ----- jay at space:/var/adm# cfgadm -la Ap_Id Type Receptacle Occupant Condition sata4/0::dsk/c3t0d0 disk connected configured ok sata4/1::dsk/c3t1d0 disk connected configured ok sata4/2::dsk/c3t2d0 disk connected configured ok sata4/3::dsk/c3t3d0 disk connected configured ok sata4/4::dsk/c3t4d0 disk connected configured ok sata4/5::dsk/c3t5d0 disk connected configured ok attached: the output of prtconf -vp and the log of the failing scrub and the resilvering -- This message posted from opensolaris.org -------------- next part -------------- System Configuration: Sun Microsystems i86pc Memory size: 3968 Megabytes System Peripherals (PROM Nodes): Node 0x000001 bios-boot-device: ''80'' stdout: 00000000 name: ''i86pc'' Node 0x000002 existing: 00d94000.00000000.028e7801.00000000 name: ''ramdisk'' Node 0x000003 bus-type: ''isa'' device_type: ''isa'' name: ''isa'' Node 0x000004 compatible: ''pciex_root_complex'' device_type: ''pciex'' reg: 00000000.00000000.00000000 #size-cells: 00000002 #address-cells: 00000003 name: ''pci'' Node 0x000005 reg: 00000000.00000000.00000000.00000000.00000000 compatible: ''pci10de,754.1462.7508.a2'' + ''pci10de,754.1462.7508'' + ''pci1462,7508'' + ''pci10de,754.a2'' + ''pci10de,754'' + ''pciclass,050000'' + ''pciclass,0500'' model: ''Ram'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 max-latency: 00000000 min-grant: 00000000 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''0'' class-code: 00050000 revision-id: 000000a2 vendor-id: 000010de device-id: 00000754 name: ''pci1462,7508'' Node 0x000006 assigned-addresses: 81000810.00000000.00002f00.00000000.00000100 reg: 00000800.00000000.00000000.00000000.00000000.01000810.00000000.00000000.00000000.00000100 compatible: ''pci10de,75c.1462.7508.a2'' + ''pci10de,75c.1462.7508'' + ''pci1462,7508'' + ''pci10de,75c.a2'' + ''pci10de,75c'' + ''pciclass,060100'' + ''pciclass,0601'' model: ''ISA bridge'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 max-latency: 00000000 min-grant: 00000000 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''1'' class-code: 00060100 revision-id: 000000a2 vendor-id: 000010de device-id: 0000075c name: ''pci1462,7508'' Node 0x000007 assigned-addresses: 81000910.00000000.00002900.00000000.00000040.81000920.00000000.00002d00.00000000.00000040.81000924.00000000.00002e00.00000000.00000040 reg: 00000900.00000000.00000000.00000000.00000000.01000910.00000000.00000000.00000000.00000040.01000920.00000000.00000000.00000000.00000040.01000924.00000000.00000000.00000000.00000040 compatible: ''pci10de,752.1462.7508.a1'' + ''pci10de,752.1462.7508'' + ''pci1462,7508'' + ''pci10de,752.a1'' + ''pci10de,752'' + ''pciclass,0c0500'' + ''pciclass,0c05'' model: ''SMBus (System Management Bus)'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 interrupts: 00000001 max-latency: 00000000 min-grant: 00000000 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''1,1'' class-code: 000c0500 revision-id: 000000a1 vendor-id: 000010de device-id: 00000752 name: ''pci1462,7508'' Node 0x000008 reg: 00000a00.00000000.00000000.00000000.00000000 compatible: ''pci10de,751.1462.7508.a1'' + ''pci10de,751.1462.7508'' + ''pci1462,7508'' + ''pci10de,751.a1'' + ''pci10de,751'' + ''pciclass,050000'' + ''pciclass,0500'' model: ''Ram'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 max-latency: 00000000 min-grant: 00000000 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''1,2'' class-code: 00050000 revision-id: 000000a1 vendor-id: 000010de device-id: 00000751 name: ''pci1462,7508'' Node 0x000009 assigned-addresses: 82000b10.00000000.fce80000.00000000.00080000 reg: 00000b00.00000000.00000000.00000000.00000000.02000b10.00000000.00000000.00000000.00080000 compatible: ''pci10de,753.1462.7508.a2'' + ''pci10de,753.1462.7508'' + ''pci1462,7508'' + ''pci10de,753.a2'' + ''pci10de,753'' + ''pciclass,0b4000'' + ''pciclass,0b40'' model: ''Co-processor'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 interrupts: 00000002 max-latency: 00000001 min-grant: 00000003 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''1,3'' class-code: 000b4000 revision-id: 000000a2 vendor-id: 000010de device-id: 00000753 name: ''pci1462,7508'' Node 0x00000a reg: 00000c00.00000000.00000000.00000000.00000000 compatible: ''pci10de,568.1462.7508.a1'' + ''pci10de,568.1462.7508'' + ''pci1462,7508'' + ''pci10de,568.a1'' + ''pci10de,568'' + ''pciclass,050000'' + ''pciclass,0500'' model: ''Ram'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 max-latency: 00000000 min-grant: 00000000 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''1,4'' class-code: 00050000 revision-id: 000000a1 vendor-id: 000010de device-id: 00000568 name: ''pci1462,7508'' Node 0x00000b #size-cells: 00000000 #address-cells: 00000001 device_type: ''pci-ide'' assigned-addresses: 81003010.00000000.000001f0.00000000.00000008.81003014.00000000.000003f6.00000000.00000001.81003018.00000000.00000170.00000000.00000008.8100301c.00000000.00000376.00000000.00000001.81003020.00000000.0000ffa0.00000000.00000010 reg: 00003000.00000000.00000000.00000000.00000000.81003010.00000000.000001f0.00000000.00000008.81003014.00000000.000003f6.00000000.00000001.81003018.00000000.00000170.00000000.00000008.8100301c.00000000.00000376.00000000.00000001.01003020.00000000.00000000.00000000.00000010 compatible: ''pci10de,759.1462.7508.a1'' + ''pci10de,759.1462.7508'' + ''pci1462,7508'' + ''pci10de,759.a1'' + ''pci10de,759'' + ''pciclass,01018a'' + ''pciclass,0101'' model: ''IDE controller'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 max-latency: 00000001 min-grant: 00000003 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''6'' class-code: 0001018a revision-id: 000000a1 vendor-id: 000010de device-id: 00000759 name: ''pci-ide'' Node 0x00000c reg: 00000000 name: ''ide'' Node 0x00000d reg: 00000001 name: ''ide'' Node 0x00000e assigned-addresses: 82003810.00000000.fce78000.00000000.00004000 reg: 00003800.00000000.00000000.00000000.00000000.02003810.00000000.00000000.00000000.00004000 compatible: ''pci10de,774.1462.7508.a1'' + ''pci10de,774.1462.7508'' + ''pci1462,7508'' + ''pci10de,774.a1'' + ''pci10de,774'' + ''pciclass,040300'' + ''pciclass,0403'' model: ''Mixed Mode device'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 interrupts: 00000001 max-latency: 00000005 min-grant: 00000002 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''7'' class-code: 00040300 revision-id: 000000a1 vendor-id: 000010de device-id: 00000774 name: ''pci1462,7508'' Node 0x00000f slot-names: 00000300.746f6c53.6c530032.0031746f reg: 00004000.00000000.00000000.00000000.00000000 compatible: ''pci10de,75a.a1'' + ''pci10de,75a'' + ''pciclass,060401'' + ''pciclass,0604'' model: ''Subtractive Decode PCI-PCI bridge'' ranges: 81000000.00000000.0000d000.81000000.00000000.0000d000.00000000.00001000.82000000.00000000.fcf00000.82000000.00000000.fcf00000.00000000.00100000 bus-range: 00000001.00000001 #size-cells: 00000002 #address-cells: 00000003 device_type: ''pci'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 unit-address: ''8'' class-code: 00060401 revision-id: 000000a1 vendor-id: 000010de device-id: 0000075a name: ''pci10de,75a'' Node 0x00001a assigned-addresses: 81014010.00000000.0000d800.00000000.00000100.82014014.00000000.fcfffc00.00000000.00000100 reg: 00014000.00000000.00000000.00000000.00000000.01014010.00000000.00000000.00000000.00000100.02014014.00000000.00000000.00000000.00000100 compatible: ''pci10ec,8169.1385.311a.10'' + ''pci10ec,8169.1385.311a'' + ''pci1385,311a'' + ''pci10ec,8169.10'' + ''pci10ec,8169'' + ''pciclass,020000'' + ''pciclass,0200'' model: ''Ethernet controller'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000001 interrupts: 00000001 max-latency: 00000040 min-grant: 00000020 subsystem-vendor-id: 00001385 subsystem-id: 0000311a unit-address: ''8'' class-code: 00020000 revision-id: 00000010 vendor-id: 000010ec device-id: 00008169 name: ''pci1385,311a'' Node 0x000010 assigned-addresses: 81004810.00000000.0000c480.00000000.00000008.81004814.00000000.0000c400.00000000.00000004.81004818.00000000.0000c080.00000000.00000008.8100481c.00000000.0000c000.00000000.00000004.81004820.00000000.0000bc00.00000000.00000010.82004824.00000000.fce7c000.00000000.00002000 reg: 00004800.00000000.00000000.00000000.00000000.01004810.00000000.00000000.00000000.00000008.01004814.00000000.00000000.00000000.00000004.01004818.00000000.00000000.00000000.00000008.0100481c.00000000.00000000.00000000.00000004.01004820.00000000.00000000.00000000.00000010.02004824.00000000.00000000.00000000.00002000 compatible: ''pci10de,ad4.1462.7508.a2'' + ''pci10de,ad4.1462.7508'' + ''pci1462,7508'' + ''pci10de,ad4.a2'' + ''pci10de,ad4'' + ''pciclass,010601'' + ''pciclass,0106'' model: ''SATA AHCI 1.0 Interface'' power-consumption: 00000001.00000001 66mhz-capable: fast-back-to-back: devsel-speed: 00000000 interrupts: 00000001 max-latency: 00000001 min-grant: 00000003 subsystem-vendor-id: 00001462 subsystem-id: 00007508 unit-address: ''9'' class-code: 00010601 revision-id: 000000a2 vendor-id: 000010de device-id: 00000ad4 name: ''pci1462,7508'' Node 0x000011 reg: 00005800.00000000.00000000.00000000.00000000 compatible: ''pci10de,569.a1'' + ''pci10de,569'' + ''pciclass,060400'' + ''pciclass,0604'' model: ''PCI-PCI bridge'' ranges: 81000000.00000000.0000e000.81000000.00000000.0000e000.00000000.00001000.82000000.00000000.fd000000.82000000.00000000.fd000000.00000000.01b00000.c2000000.00000000.ce000000.c2000000.00000000.ce000000.00000000.12000000 bus-range: 00000002.00000002 #size-cells: 00000002 #address-cells: 00000003 device_type: ''pci'' power-consumption: 00000001.00000001 devsel-speed: 00000000 unit-address: ''b'' class-code: 00060400 revision-id: 000000a1 vendor-id: 000010de device-id: 00000569 name: ''pci10de,569'' Node 0x00001b assigned-addresses: 82020010.00000000.fd000000.00000000.01000000.c3020014.00000000.d0000000.00000000.10000000.c302001c.00000000.ce000000.00000000.02000000.81020024.00000000.0000ec00.00000000.00000080.a1020000.00000000.000003b0.00000000.0000000c.a1020000.00000000.000003c0.00000000.00000020.82020000.00000000.000a0000.00000000.00020000 reg: 00020000.00000000.00000000.00000000.00000000.02020010.00000000.00000000.00000000.01000000.43020014.00000000.00000000.00000000.10000000.4302001c.00000000.00000000.00000000.02000000.01020024.00000000.00000000.00000000.00000080.a1020000.00000000.000003b0.00000000.0000000c.a1020000.00000000.000003c0.00000000.00000020.82020000.00000000.000a0000.00000000.00020000 compatible: ''pci10de,849.1462.7508.a2'' + ''pci10de,849.1462.7508'' + ''pci1462,7508'' + ''pci10de,849.a2'' + ''pci10de,849'' + ''pciclass,030000'' + ''pciclass,0300'' model: ''VGA compatible controller'' power-consumption: 00000001.00000001 devsel-speed: 00000000 interrupts: 00000001 max-latency: 00000000 min-grant: 00000000 subsystem-vendor-id: 00001462 subsystem-id: 00007508 device_type: ''display'' unit-address: ''0'' class-code: 00030000 revision-id: 000000a2 vendor-id: 000010de device-id: 00000849 name: ''display'' Node 0x000012 reg: 00008000.00000000.00000000.00000000.00000000 compatible: ''pciex10de,778.a1'' + ''pciex10de,778'' + ''pciexclass,060400'' + ''pciexclass,0604'' + ''pci10de,778.a1'' + ''pci10de,778'' + ''pciclass,060400'' + ''pciclass,0604'' model: ''PCI-PCI bridge'' bus-range: 00000003.00000003 #size-cells: 00000002 #address-cells: 00000003 device_type: ''pciex'' power-consumption: 00000001.00000001 slot-names: 00000001.65696370.00000031 physical-slot#: 00000001 devsel-speed: 00000000 interrupts: 00000001 unit-address: ''10'' class-code: 00060400 revision-id: 000000a1 vendor-id: 000010de device-id: 00000778 pcie-capid-pointer: 00000080 pcie-capid-reg: 00000142 pcie-slotcap-reg: 00082580 name: ''pci10de,778'' Node 0x000013 reg: 00009000.00000000.00000000.00000000.00000000 compatible: ''pciex10de,75b.a1'' + ''pciex10de,75b'' + ''pciexclass,060400'' + ''pciexclass,0604'' + ''pci10de,75b.a1'' + ''pci10de,75b'' + ''pciclass,060400'' + ''pciclass,0604'' model: ''PCI-PCI bridge'' bus-range: 00000004.00000004 #size-cells: 00000002 #address-cells: 00000003 device_type: ''pciex'' power-consumption: 00000001.00000001 slot-names: 00000001.65696370.00000033 physical-slot#: 00000003 devsel-speed: 00000000 interrupts: 00000001 unit-address: ''12'' class-code: 00060400 revision-id: 000000a1 vendor-id: 000010de device-id: 0000075b pcie-capid-pointer: 00000080 pcie-capid-reg: 00000141 pcie-slotcap-reg: 00180500 name: ''pci10de,75b'' Node 0x000014 reg: 00009800.00000000.00000000.00000000.00000000 compatible: ''pciex10de,77a.a1'' + ''pciex10de,77a'' + ''pciexclass,060400'' + ''pciexclass,0604'' + ''pci10de,77a.a1'' + ''pci10de,77a'' + ''pciclass,060400'' + ''pciclass,0604'' model: ''PCI-PCI bridge'' ranges: 82000000.00000000.feb00000.82000000.00000000.feb00000.00000000.00100000 bus-range: 00000005.00000005 #size-cells: 00000002 #address-cells: 00000003 device_type: ''pciex'' power-consumption: 00000001.00000001 slot-names: 00000001.65696370.00000034 physical-slot#: 00000004 devsel-speed: 00000000 interrupts: 00000001 unit-address: ''13'' class-code: 00060400 revision-id: 000000a1 vendor-id: 000010de device-id: 0000077a pcie-capid-pointer: 00000080 pcie-capid-reg: 00000141 pcie-slotcap-reg: 00200500 name: ''pci10de,77a'' Node 0x00001c assigned-addresses: 82050010.00000000.febff800.00000000.00000800.82050014.00000000.febff400.00000000.00000080.82050020.00000000.febff000.00000000.00000080.82050024.00000000.febfec00.00000000.00000080 reg: 00050000.00000000.00000000.00000000.00000000.02050010.00000000.00000000.00000000.00000800.02050014.00000000.00000000.00000000.00000080.02050020.00000000.00000000.00000000.00000080.02050024.00000000.00000000.00000000.00000080 compatible: ''pciex197b,2380.1462.508d.0'' + ''pciex197b,2380.1462.508d'' + ''pciex197b,2380.0'' + ''pciex197b,2380'' + ''pciexclass,0c0010'' + ''pciexclass,0c00'' + ''pci197b,2380.1462.508d.0'' + ''pci197b,2380.1462.508d'' + ''pci1462,508d'' + ''pci197b,2380.0'' + ''pci197b,2380'' + ''pciclass,0c0010'' + ''pciclass,0c00'' model: ''FireWire (IEEE 1394) OpenHCI compliant'' power-consumption: 00000001.00000001 devsel-speed: 00000000 interrupts: 00000001 subsystem-vendor-id: 00001462 subsystem-id: 0000508d unit-address: ''0'' class-code: 000c0010 revision-id: 00000000 vendor-id: 0000197b device-id: 00002380 pcie-capid-pointer: 00000080 pcie-capid-reg: 00000001 name: ''pci1462,508d'' Node 0x000015 reg: 0000a000.00000000.00000000.00000000.00000000 compatible: ''pciex10de,77a.a1'' + ''pciex10de,77a'' + ''pciexclass,060400'' + ''pciexclass,0604'' + ''pci10de,77a.a1'' + ''pci10de,77a'' + ''pciclass,060400'' + ''pciclass,0604'' model: ''PCI-PCI bridge'' bus-range: 00000006.00000006 #size-cells: 00000002 #address-cells: 00000003 device_type: ''pciex'' power-consumption: 00000001.00000001 slot-names: 00000001.65696370.00000035 physical-slot#: 00000005 devsel-speed: 00000000 interrupts: 00000001 unit-address: ''14'' class-code: 00060400 revision-id: 000000a1 vendor-id: 000010de device-id: 0000077a pcie-capid-pointer: 00000080 pcie-capid-reg: 00000141 pcie-slotcap-reg: 00280500 name: ''pci10de,77a'' Node 0x000016 reg: 0000c000.00000000.00000000.00000000.00000000 compatible: ''pci1022,1100.0'' + ''pci1022,1100'' + ''pciclass,060000'' + ''pciclass,0600'' model: ''Host bridge'' power-consumption: 00000001.00000001 devsel-speed: 00000000 max-latency: 00000000 min-grant: 00000000 unit-address: ''18'' class-code: 00060000 revision-id: 00000000 vendor-id: 00001022 device-id: 00001100 name: ''pci1022,1100'' Node 0x000017 reg: 0000c100.00000000.00000000.00000000.00000000 compatible: ''pci1022,1101.0'' + ''pci1022,1101'' + ''pciclass,060000'' + ''pciclass,0600'' model: ''Host bridge'' power-consumption: 00000001.00000001 devsel-speed: 00000000 max-latency: 00000000 min-grant: 00000000 unit-address: ''18,1'' class-code: 00060000 revision-id: 00000000 vendor-id: 00001022 device-id: 00001101 name: ''pci1022,1101'' Node 0x000018 reg: 0000c200.00000000.00000000.00000000.00000000 compatible: ''pci1022,1102.0'' + ''pci1022,1102'' + ''pciclass,060000'' + ''pciclass,0600'' model: ''Host bridge'' power-consumption: 00000001.00000001 devsel-speed: 00000000 max-latency: 00000000 min-grant: 00000000 unit-address: ''18,2'' class-code: 00060000 revision-id: 00000000 vendor-id: 00001022 device-id: 00001102 name: ''pci1022,1102'' Node 0x000019 reg: 0000c300.00000000.00000000.00000000.00000000 compatible: ''pci1022,1103.0'' + ''pci1022,1103'' + ''pciclass,060000'' + ''pciclass,0600'' model: ''Host bridge'' power-consumption: 00000001.00000001 devsel-speed: 00000000 max-latency: 00000000 min-grant: 00000000 unit-address: ''18,3'' class-code: 00060000 revision-id: 00000000 vendor-id: 00001022 device-id: 00001103 name: ''pci1022,1103'' -------------- next part -------------- A non-text attachment was scrubbed... Name: messages.0 Type: application/octet-stream Size: 278813 bytes Desc: not available URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20081230/01868ea1/attachment.obj>
Richard Elling
2008-Dec-30 18:11 UTC
[zfs-discuss] read/write errors on storage pool (poss. ahci/hw related?)
Jay wrote:> hi *, > > i''m currently playing around with the setup of an opensolaris server as > home nas and am experiencing occasional read/write problems with the > zfs pool. > > the short version (details below/attached): > * 6-disk raidz pool attached to the sata controller on an nvidia MCP78S chipset > * first scrub of the pool with some data on it marks device sd5 as faulted > due to "WARNING: ahci0: watchdog port 5 satapkt 0xffffff01c7d8d660 timed out" > and a plethora of "Error for Command: read(10)" (see attached messages) >Jay, if you search the bugs database for this error message, http://bugs.opensolaris.org you will find a number of hits. Many possibly related bugs have been fixed by b101, but there may be more. You should also ask this question on the drivers-discuss forum as that is where the device driver writers hang out. -- richard> * these messages appeared also for sd1, sd2 and sd3, but only sd5 failed in the end > * replaced the disk, resilvering started > * the same timeouts appear for sd0 and sd1 while resilvering, to prevent the pool from > failing completely, i (rather brute force) rebooted the machine > * resilvering ends eventually, data seems intact > * everything seems normal for a few days, reading/writing is ok, no errors show up, the > data is accessible > > today, i saw the same errors reported for sd4 in the logfile and when trying a ''zpool status'' > it became unresponsive, with timeouts showing up for sd0. after another reboot, everything still looks ok, zpool status is ok, read and write access are ok. > > the disks themselves should be ok, i had them running a burn-in before installing opensolaris and the WD diagnostics passed them - even the faulted one i replaced passed another test as being perfectly ok. > > can anybody shed some light on this? i''m guessing it''s related to the sata controller, but i''d appreciate any help or insight. > > (at the moment, i''m not really worried about data loss as you might guess from the brute > force rebooting, all the data on the pool is also stored on an old linux machine. i''m reacquainting myself with solaris, so it''s more or less a playground for now. but i''d like to replace the old linux server sometime - mainly because of zfs) > > thanks, > jay > >
Jay
2008-Dec-31 11:44 UTC
[zfs-discuss] read/write errors on storage pool (poss. ahci/hw related?)
hi richard, the bugs database ... figures ... now that you said it, it''s really quite obvious :) thanks, and thanks for the hint towards the drivers-discuss forum. bye, jay -- This message posted from opensolaris.org