Hi I've had problems before with GEOM mirror and my SATA drives, and i've posted about it here before too. The solution seemd to be a change of motherboard, this reduced the crash very much (and also the speeds archieved was greatly improved, from 10-15MB/s to 40-50MB/s..). However after the change i had one or two crashes, but now it has been running for well over 50-60 days or so without any problems. Then, 11 days ago I upgraded to 6.1... And now I got these "crashe"s again (the mirror is crashed that is, the system still runs fine): May 21 02:04:58 elfi kernel: ad6: FAILURE - device detached May 21 02:04:58 elfi kernel: subdisk6: detached May 21 02:04:58 elfi kernel: ad6: detached May 21 02:04:58 elfi kernel: GEOM_MIRROR: Device gm0s1: provider ad6s1 disconnected. May 21 02:04:58 elfi kernel: g_vfs_done():mirror/gm0s1f[READ (offset=11006308352, length=2048)]error = 6 May 21 02:04:58 elfi kernel: g_vfs_done():mirror/gm0s1f[READ (offset=164847927296, length=131072)]error = 6 May 21 02:04:58 elfi kernel: g_vfs_done():mirror/gm0s1f[READ (offset=256680296448, length=32768)]error = 6 Some info about the controller and disks: May 9 22:46:52 elfi kernel: ata1: <ATA channel 1> on atapci0 May 9 22:46:52 elfi kernel: atapci1: <nVidia nForce2 Pro SATA150 controller> port 0xec00-0xec07,0xe880-0xe883,0xe800-0xe807,0xe480-0xe483,0x7f00-0x7f0f, 0x7c0 0-0x7c7f irq 22 at device 11.0 on pci0 May 9 22:46:52 elfi kernel: ad4: 286188MB <Maxtor 7L300S0 BANC1G10> at ata2-master SATA150 May 9 22:46:52 elfi kernel: ad6: 286188MB <Maxtor 7L300S0 BANC1G10> at ata3-master SATA150 May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1 created (id=4118114647). May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider ad4s1 detected. May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider ad6s1 detected. May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider ad6s1 activated. May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider ad4s1 activated. May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider mirror/gm0s1 launched. May 9 22:46:52 elfi kernel: Trying to mount root from ufs:/dev/ mirror/gm0s1a Anyone got any new clues? Afaik the disks should be working fine (they are 6 months old and this same problem has occured multiple times...) Hope to solve this ;) Thanks Johan
Hi, Sorry this is only a 'me too' message... On Sun, 21 May 2006 11:16:14 +0200, Johan Str?m <johan@stromnet.org> said:> May 21 02:04:58 elfi kernel: ad6: FAILURE - device detached > May 21 02:04:58 elfi kernel: subdisk6: detached > May 21 02:04:58 elfi kernel: ad6: detached > May 21 02:04:58 elfi kernel: GEOM_MIRROR: Device gm0s1: provider > ad6s1 disconnected.I have a similar problem on a different M/B (Intel D925XECV2). I'm not sure if it is only a coincidence or somewhat related. May 21 07:43:49 elvenbow kernel: ad4: FAILURE - device detached May 21 07:43:49 elvenbow kernel: subdisk4: detached May 21 07:43:49 elvenbow kernel: ad4: detached May 21 07:43:49 elvenbow kernel: GEOM_MIRROR: Device gm0s1: provider ad4s1 disconnected. excerpts from dmesg: atapci0: <Intel ICH6 UDMA100 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 31.1 on pci0 ata0: <ATA channel 0> on atapci0 ata1: <ATA channel 1> on atapci0 atapci1: <Intel ICH6 SATA150 controller> port 0xec00-0xec07,0xe800-0xe803,0xe400-0xe407,0xe000-0xe003,0xdc00-0xdc0f mem 0xf3afbc00-0xf3afbfff irq 19 at device 31.2 on pci0 ata2: <ATA channel 0> on atapci1 ata3: <ATA channel 1> on atapci1 ata4: <ATA channel 2> on atapci1 ata5: <ATA channel 3> on atapci1 ad4: 239372MB <Maxtor 7V250F0 VA111610> at ata2-master SATA150 ad6: 239372MB <Maxtor 7V250F0 VA111610> at ata3-master SATA150 I purchased and started using this new PC last December, and the problem occurred several times by now. Both ad4 and ad6 have been detached (not at a time). 'atacontrol reinit' paused the system for a second, and returned without detecting the detached device. I need a complete power cycle or the device won't recognized by BIOS again. There is no SMART error recorded on these drives. I'm considering to change M/B, but it is difficult right now... dmesg.boot is attached. Ah, the system is running FreeBSD 6.1-STABLE amd64. FreeBSD elvenbow.cc.kyushu-u.ac.jp 6.1-STABLE FreeBSD 6.1-STABLE #0: Mon May 8 16:54:22 JST 2006 root@elvenbow.cc.kyushu-u.ac.jp:/usr/obj/usr/src/sys/ELVENBOW amd64 -- Yoshiaki Kasahara Computing and Communications Center, Kyushu University kasahara@nc.kyushu-u.ac.jp -------------- next part -------------- Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.1-STABLE #0: Mon May 8 16:54:22 JST 2006 root@elvenbow.cc.kyushu-u.ac.jp:/usr/obj/usr/src/sys/ELVENBOW Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Pentium(R) 4 CPU 3.00GHz (3000.10-MHz K8-class CPU) Origin = "GenuineIntel" Id = 0xf43 Stepping = 3 Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> Features2=0x649d<SSE3,RSVD2,MON,DS_CPL,EST,CNTX-ID,CX16,<b14>> AMD Features=0x20100800<SYSCALL,NX,LM> Logical CPUs per core: 2 real memory = 2145579008 (2046 MB) avail memory = 2060705792 (1965 MB) ACPI APIC Table: <INTEL D925CV2 > ioapic0 <Version 2.0> irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: <INTEL D925CV2> on motherboard acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi0: Power Button (fixed) acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 cpu0: <ACPI CPU> on acpi0 est0: <Enhanced SpeedStep Frequency Control> on cpu0 est: CPU supports Enhanced Speedstep, but is not recognized. est: Please update driver or contact the maintainer. est: cpu_vendor GenuineIntel, msr f2d00000f2d, bus_clk, 64 device_attach: est0 attach returned 6 p4tcc0: <CPU Frequency Thermal Control> on cpu0 pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 pci0: <ACPI PCI bus> on pcib0 pcib1: <ACPI PCI-PCI bridge> at device 1.0 on pci0 pci1: <ACPI PCI bus> on pcib1 pci1: <display, VGA> at device 0.0 (no driver attached) pci0: <multimedia> at device 27.0 (no driver attached) pcib2: <ACPI PCI-PCI bridge> at device 28.0 on pci0 pci5: <ACPI PCI bus> on pcib2 pcib3: <ACPI PCI-PCI bridge> at device 28.1 on pci0 pci4: <ACPI PCI bus> on pcib3 pcib4: <ACPI PCI-PCI bridge> at device 28.2 on pci0 pci3: <ACPI PCI bus> on pcib4 pcib5: <ACPI PCI-PCI bridge> at device 28.3 on pci0 pci2: <ACPI PCI bus> on pcib5 uhci0: <Intel 82801FB/FR/FW/FRW (ICH6) USB controller USB-A> port 0xcc00-0xcc1f irq 23 at device 29.0 on pci0 uhci0: [GIANT-LOCKED] usb0: <Intel 82801FB/FR/FW/FRW (ICH6) USB controller USB-A> on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: <Intel 82801FB/FR/FW/FRW (ICH6) USB controller USB-B> port 0xd000-0xd01f irq 19 at device 29.1 on pci0 uhci1: [GIANT-LOCKED] usb1: <Intel 82801FB/FR/FW/FRW (ICH6) USB controller USB-B> on uhci1 usb1: USB revision 1.0 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered uhci2: <Intel 82801FB/FR/FW/FRW (ICH6) USB controller USB-C> port 0xd400-0xd41f irq 18 at device 29.2 on pci0 uhci2: [GIANT-LOCKED] usb2: <Intel 82801FB/FR/FW/FRW (ICH6) USB controller USB-C> on uhci2 usb2: USB revision 1.0 uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub2: 2 ports with 2 removable, self powered uhci3: <Intel 82801FB/FR/FW/FRW (ICH6) USB controller USB-D> port 0xd800-0xd81f irq 16 at device 29.3 on pci0 uhci3: [GIANT-LOCKED] usb3: <Intel 82801FB/FR/FW/FRW (ICH6) USB controller USB-D> on uhci3 usb3: USB revision 1.0 uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub3: 2 ports with 2 removable, self powered ehci0: <Intel 82801FB (ICH6) USB 2.0 controller> mem 0xf3afb800-0xf3afbbff irq 23 at device 29.7 on pci0 ehci0: [GIANT-LOCKED] usb4: EHCI version 1.0 usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4: <Intel 82801FB (ICH6) USB 2.0 controller> on ehci0 usb4: USB revision 2.0 uhub4: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub4: 8 ports with 8 removable, self powered pcib6: <ACPI PCI-PCI bridge> at device 30.0 on pci0 pci6: <ACPI PCI bus> on pcib6 vge0: <VIA Networking Gigabit Ethernet> port 0xb800-0xb8ff mem 0xf3b00000-0xf3b000ff irq 22 at device 1.0 on pci6 miibus0: <MII bus> on vge0 ciphy0: <Cicada CS8201 10/100/1000TX PHY> on miibus0 ciphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto vge0: Ethernet address: 00:02:2a:dd:05:41 vge1: <VIA Networking Gigabit Ethernet> port 0xb400-0xb4ff mem 0xf3b00400-0xf3b004ff irq 18 at device 2.0 on pci6 miibus1: <MII bus> on vge1 ciphy1: <Cicada CS8201 10/100/1000TX PHY> on miibus1 ciphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto vge1: Ethernet address: 00:02:2a:dd:0c:31 vge2: <VIA Networking Gigabit Ethernet> port 0xb000-0xb0ff mem 0xf3b00800-0xf3b008ff irq 19 at device 3.0 on pci6 miibus2: <MII bus> on vge2 ciphy2: <Cicada CS8201 10/100/1000TX PHY> on miibus2 ciphy2: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto vge2: Ethernet address: 00:02:2a:dd:00:7d isab0: <PCI-ISA bridge> at device 31.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <Intel ICH6 UDMA100 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 31.1 on pci0 ata0: <ATA channel 0> on atapci0 ata1: <ATA channel 1> on atapci0 atapci1: <Intel ICH6 SATA150 controller> port 0xec00-0xec07,0xe800-0xe803,0xe400-0xe407,0xe000-0xe003,0xdc00-0xdc0f mem 0xf3afbc00-0xf3afbfff irq 19 at device 31.2 on pci0 ata2: <ATA channel 0> on atapci1 ata3: <ATA channel 1> on atapci1 ata4: <ATA channel 2> on atapci1 ata5: <ATA channel 3> on atapci1 ichsmb0: <SMBus controller> port 0xc800-0xc81f irq 19 at device 31.3 on pci0 ichsmb0: [GIANT-LOCKED] smbus0: <System Management Bus> on ichsmb0 smb0: <SMBus generic I/O> on smbus0 acpi_button0: <Power Button> on acpi0 atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] fdc0: <floppy drive controller> port 0x3f0-0x3f1,0x3f2-0x3f3,0x3f4-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A ppc0: <Standard parallel printer port> port 0x378-0x37f irq 7 on acpi0 ppc0: Generic chipset (EPP/NIBBLE) in COMPATIBLE mode ppbus0: <Parallel port bus> on ppc0 plip0: <PLIP network interface> on ppbus0 lpt0: <Printer> on ppbus0 lpt0: Interrupt-driven port ppi0: <Parallel I/O> on ppbus0 orm0: <ISA Option ROM> at iomem 0xc0000-0xcefff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ums0: Microsoft Microsoft IntelliMouse\M-. Explorer, rev 2.00/4.24, addr 2, iclass 3/1 ums0: 5 buttons and Z dir and a TILT dir. Timecounter "TSC" frequency 3000098070 Hz quality 800 Timecounters tick every 1.000 msec module_register_init: MOD_LOAD (amr_linux, 0xffffffff8062fd50, 0) error 6 ipfw2 (+ipv6) initialized, divert loadable, rule-based forwarding enabled, default to deny, logging disabled acd0: DVDR <HL-DT-ST DVDRAM GSA-4167B/DL12> at ata0-master UDMA33 ad4: 239372MB <Maxtor 7V250F0 VA111610> at ata2-master SATA150 ad6: 239372MB <Maxtor 7V250F0 VA111610> at ata3-master SATA150 GEOM_MIRROR: Device gm0s1 created (id=400615043). GEOM_MIRROR: Device gm0s1: provider ad4s1 detected. GEOM_MIRROR: Device gm0s1: provider ad6s1 detected. GEOM_MIRROR: Device gm0s1: provider ad6s1 activated. GEOM_MIRROR: Device gm0s1: provider mirror/gm0s1 launched. GEOM_MIRROR: Device gm0s1: rebuilding provider ad4s1. Trying to mount root from ufs:/dev/mirror/gm0s1a
On 21 maj 2006, at 11.16, Johan Str?m wrote:> Hi > > I've had problems before with GEOM mirror and my SATA drives, and > i've posted about it here before too. The solution seemd to be a > change of motherboard, this reduced the crash very much (and also > the speeds archieved was greatly improved, from 10-15MB/s to > 40-50MB/s..). > However after the change i had one or two crashes, but now it has > been running for well over 50-60 days or so without any problems. > Then, 11 days ago I upgraded to 6.1... And now I got these > "crashe"s again (the mirror is crashed that is, the system still > runs fine): > > May 21 02:04:58 elfi kernel: ad6: FAILURE - device detached > May 21 02:04:58 elfi kernel: subdisk6: detached > May 21 02:04:58 elfi kernel: ad6: detached > May 21 02:04:58 elfi kernel: GEOM_MIRROR: Device gm0s1: provider > ad6s1 disconnected. > May 21 02:04:58 elfi kernel: g_vfs_done():mirror/gm0s1f[READ > (offset=11006308352, length=2048)]error = 6 > May 21 02:04:58 elfi kernel: g_vfs_done():mirror/gm0s1f[READ > (offset=164847927296, length=131072)]error = 6 > May 21 02:04:58 elfi kernel: g_vfs_done():mirror/gm0s1f[READ > (offset=256680296448, length=32768)]error = 6 > > > Some info about the controller and disks: > > May 9 22:46:52 elfi kernel: ata1: <ATA channel 1> on atapci0 > May 9 22:46:52 elfi kernel: atapci1: <nVidia nForce2 Pro SATA150 > controller> port > 0xec00-0xec07,0xe880-0xe883,0xe800-0xe807,0xe480-0xe483,0x7f00-0x7f0f, > 0x7c0 > 0-0x7c7f irq 22 at device 11.0 on pci0 > > May 9 22:46:52 elfi kernel: ad4: 286188MB <Maxtor 7L300S0 > BANC1G10> at ata2-master SATA150 > May 9 22:46:52 elfi kernel: ad6: 286188MB <Maxtor 7L300S0 > BANC1G10> at ata3-master SATA150 > May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1 created > (id=4118114647). > May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider > ad4s1 detected. > May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider > ad6s1 detected. > May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider > ad6s1 activated. > May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider > ad4s1 activated. > May 9 22:46:52 elfi kernel: GEOM_MIRROR: Device gm0s1: provider > mirror/gm0s1 launched. > May 9 22:46:52 elfi kernel: Trying to mount root from ufs:/dev/ > mirror/gm0s1a > > Anyone got any new clues? Afaik the disks should be working fine > (they are 6 months old and this same problem has occured multiple > times...) > > Hope to solve this ;) > > Thanks > Johan >Here we go again Jul 7 16:20:09 elfi kernel: ad4: FAILURE - device detached Jul 7 16:20:09 elfi kernel: subdisk4: detached Jul 7 16:20:09 elfi kernel: ad4: detached Jul 7 16:20:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider ad4s1 disconnected. Jul 7 16:20:09 elfi kernel: g_vfs_done():mirror/gm0s1f[READ (offset=88896847872, length=32768)]error = 6 However no read read timeouts etc as before, just this. 18 days uptime this time (i've rebooted for other reasons since last mail). It always seems to be ad4 that is disconnecting.. I'm going to do some disk tests on it but i doubt it will give anything since i've had similiar problems from day one (did tests at that time w/o problems) with this gmirror setup (new disks). Johan
Mike Tancsa wrote: [..]> Install the smartmontools from > > /usr/ports/sysutils/smartmontools/ > > and post the output of > smartctl -a /dev/ad8smartmontools was previously installed and running as daemon without any bad reports. I can not run "smartctl -a /dev/ad8" now, because my server housing provider replaced HDD with the new one and after an hour of synchronization "ad8: FAILURE - device detached". So provider replaced whole server, only ad4 is original piece of HW. On new server synchronization was much faster then in previous server (1:30 hour compared to 5 hours in previous server) - so I think it was HW problem. Now I am running stresstest with copying /usr/ports to another partition in infinite loop. I will post results later. (On bad server, test failed after about 30 minutes. On another server the test is running fine second day, so I think if disk will not fail after 1 day, problem is solved) At last - now I think this was not GEOM/gmirror related. I tried remove ad8 provider from gmirror (gm0), boot up system from gm0 with one provider (ad4) and test ad8 mounted separately - ad8 failed again. Miroslav Lachman
On 17 jul 2006, at 17.40, Miroslav Lachman wrote:> Mike Tancsa wrote: > [..] >> Install the smartmontools from >> /usr/ports/sysutils/smartmontools/ >> and post the output of >> smartctl -a /dev/ad8 > > smartmontools was previously installed and running as daemon > without any bad reports. > I can not run "smartctl -a /dev/ad8" now, because my server housing > provider replaced HDD with the new one and after an hour of > synchronization "ad8: FAILURE - device detached". So provider > replaced whole server, only ad4 is original piece of HW. > On new server synchronization was much faster then in previous > server (1:30 hour compared to 5 hours in previous server) - so I > think it was HW problem. > Now I am running stresstest with copying /usr/ports to another > partition in infinite loop. > I will post results later. (On bad server, test failed after about > 30 minutes. On another server the test is running fine second day, > so I think if disk will not fail after 1 day, problem is solved) > > At last - now I think this was not GEOM/gmirror related. I tried > remove ad8 provider from gmirror (gm0), boot up system from gm0 > with one provider (ad4) and test ad8 mounted separately - ad8 > failed again.Just got another one.. Jul 25 13:30:47 elfi kernel: ad4: FAILURE - device detached Jul 25 13:30:47 elfi kernel: subdisk4: detached Jul 25 13:30:47 elfi kernel: ad4: detached Jul 25 13:30:47 elfi kernel: GEOM_MIRROR: Device gm0s1: provider ad4s1 disconnected. Jul 25 13:30:47 elfi kernel: g_vfs_done():mirror/gm0s1f[READ (offset=46318008320, length=2048)]error = 6 Jul 25 13:30:47 elfi kernel: g_vfs_done():mirror/gm0s1f[READ (offset=77269614592, length=16384)]error = 6 6 days uptime when this occured... Both disks are tested with PowerMax without a single problem (same with smartctl), both SATA cables are new. So the only hwproblem that I cant rule out would be the mobo, but that is quite new too... Solutions? Try RELENG_6 as recommended earlier? Thanks Johan
Johan Str?m wrote: [...]> On 17 jul 2006, at 17.40, Miroslav Lachman wrote: > >> Mike Tancsa wrote: >> [..] >> >>> Install the smartmontools from >>> /usr/ports/sysutils/smartmontools/ >>> and post the output of >>> smartctl -a /dev/ad8 >> >> >> smartmontools was previously installed and running as daemon without >> any bad reports. >> I can not run "smartctl -a /dev/ad8" now, because my server housing >> provider replaced HDD with the new one and after an hour of >> synchronization "ad8: FAILURE - device detached". So provider >> replaced whole server, only ad4 is original piece of HW. >> On new server synchronization was much faster then in previous server >> (1:30 hour compared to 5 hours in previous server) - so I think it >> was HW problem. >> Now I am running stresstest with copying /usr/ports to another >> partition in infinite loop. >> I will post results later. (On bad server, test failed after about 30 >> minutes. On another server the test is running fine second day, so I >> think if disk will not fail after 1 day, problem is solved) >> >> At last - now I think this was not GEOM/gmirror related. I tried >> remove ad8 provider from gmirror (gm0), boot up system from gm0 with >> one provider (ad4) and test ad8 mounted separately - ad8 failed again. > > > Just got another one.. > > Jul 25 13:30:47 elfi kernel: ad4: FAILURE - device detached > Jul 25 13:30:47 elfi kernel: subdisk4: detached > Jul 25 13:30:47 elfi kernel: ad4: detached > Jul 25 13:30:47 elfi kernel: GEOM_MIRROR: Device gm0s1: provider ad4s1 > disconnected. > Jul 25 13:30:47 elfi kernel: g_vfs_done():mirror/gm0s1f[READ > (offset=46318008320, length=2048)]error = 6 > Jul 25 13:30:47 elfi kernel: g_vfs_done():mirror/gm0s1f[READ > (offset=77269614592, length=16384)]error = 6 > > 6 days uptime when this occured... Both disks are tested with PowerMax > without a single problem (same with smartctl), both SATA cables are > new. So the only hwproblem that I cant rule out would be the mobo, but > that is quite new too... > > Solutions? Try RELENG_6 as recommended earlier?In my case, server (mobo) replacement solved the problem. In this time, I got same problem on the second server. :( You can try BIOS update first, then RELENG_6 (I do not thing it helps), at last - replace mobo. Please, send me info, if BIOS update solved your problem. Miroslav Lachman
On Sat, Aug 19, 2006 at 04:39:55PM +0200, Miroslav Lachman wrote:> I upgraded to RELENG_6, changed all HW (whole servers and changed > Seagate HHDs to Samsung so every piece of HW is different from time of > my first post), but after one week I got the same error and systemJust a try - have you changed cables too?
Miroslav Lachman wrote:> I upgraded to RELENG_6, changed all HW (whole servers and changed > Seagate HHDs to Samsung so every piece of HW is different from time of > my first post), but after one week I got the same error and system > reboot today: > Aug 19 15:11:20 track ntpd[456]: kernel time sync enabled 2001 > Aug 19 15:15:47 track kernel: ad6: FAILURE - device detached > Aug 19 15:15:47 track kernel: subdisk6: detached > Aug 19 15:15:47 track kernel: ad6: detached > Aug 19 15:15:47 track kernel: GEOM_MIRROR: Device gm0: provider ad6 > disconnected. > Aug 19 15:15:47 track kernel: > g_vfs_done():mirror/gm0s2d[READ(offset=1169260544, leng > th=131072)]error = 6 > Aug 19 15:22:34 track syslogd: kernel boot file is /boot/kernel/kernel > > From my point of view - this is not related to 1 piece of HW, but > general problem of ICH7 chipset or (s)ATA driver in FreeBSD 6.x. As > other poster has different chipsets (ICH6 and nVidia), it seems more > FreeBSD ATA driver related. (7 different machines was tried) >Just a "me too", i have the same problems with ICH7 and disks mysteriously disconnecting. Aug 14 16:54:47 mx1 kernel: ad4: FAILURE - device detached Aug 14 16:54:47 mx1 kernel: ad4: detaGEOM_MIRROR:ched Device gm: provider ad4 disconnected. I think there definitely is a problem with the chipset/driver.