Eric Buchanan
2005-Mar-06 00:25 UTC
Error with CMD 649 ATA100 controller while copying files over NFS on 4.11-Stable
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 I switched one hard drive to a CMD 649 ATA100 controller last summer, and I've had some trouble with it on 4.11-Stable. When I copy a large volume of files over NFS (800 Megabytes, lets say) the add on controller will stop recognizing my hard drive. This time the files were not being copied off a drive handled by this controller. Any ideas? This only occurs when copying files over nfs, and it will happen even when I am not copying anything involving the drive on the controller. I've tried "atacontrol reinit 2" and it won't re-recognize the hard drive (ad5s4f). I have absolutely zero problems when I am not using NFS, especially zero problems with make worlds or building large ports on this hard drive in question. Thanks, Eric Buchanan Here is the dmesg I see after the hard drive is unrecognized by the ata controller: Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.11-STABLE #69: Mon Feb 14 15:38:38 PST 2005 idaho@paulus.org:/backup/src/sys/compile/MYKERNEL Timecounter "i8254" frequency 1193182 Hz CPU: Intel(R) Celeron(TM) CPU 1400MHz (1396.79-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x6b4 Stepping = 4 Features=0x383f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PA T,PSE36,MMX,FXSR,SSE> real memory = 436207616 (425984K bytes) avail memory = 413069312 (403388K bytes) Preloaded elf kernel "kernel" at 0xc0abe000. Preloaded elf module "linux.ko" at 0xc0abe09c. Preloaded elf module "nvidia.ko" at 0xc0abe13c. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 10 entries at 0xc00fdf20 npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Intel 82443BX (440 BX) host to PCI bridge> on motherboard pci0: <PCI bus> on pcib0 agp0: <Intel 82443BX (440 BX) host to PCI bridge> mem 0xf8000000-0xfbffffff at d evice 0.0 on pci0 pcib1: <Intel 82443BX (440 BX) PCI-PCI (AGP) bridge> at device 1.0 on pci0 pci1: <PCI bus> on pcib1 nvidia0: <RIVA TNT2 Model 64/Model 64 Pro> mem 0xfc000000-0xfdffffff,0xf5000000- 0xf5ffffff irq 11 at device 0.0 on pci1 isab0: <Intel 82371AB PCI to ISA bridge> at device 7.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <Intel PIIX4 ATA33 controller> port 0x18a0-0x18af at device 7.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 uhci0: <Intel 82371AB/EB (PIIX4) USB controller> port 0x1880-0x189f irq 9 at dev ice 7.2 on pci0 usb0: <Intel 82371AB/EB (PIIX4) USB controller> on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered ums0: U22_B003 iFeel Mouse, rev 1.10/98.03, addr 2, iclass 3/1 ums0: 3 buttons and Z dir. chip0: <Intel 82371AB Power management controller> port 0x7000-0x700f at device 7.3 on pci0 xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0x1800-0x187f mem 0xf4001000-0xf400 107f irq 11 at device 13.0 on pci0 xl0: Ethernet address: 00:01:02:3e:6d:64 miibus0: <MII bus> on xl0 xlphy0: <3c905C 10/100 internal PHY> on miibus0 xlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pcm0: <CMedia CMI8738> port 0x1000-0x10ff irq 10 at device 14.0 on pci0 ahc0: <Adaptec aic7880 Ultra SCSI adapter> port 0x1400-0x14ff mem 0xf4000000-0xf 4000fff irq 3 at device 15.0 on pci0 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs atapci1: <CMD 649 ATA100 controller> port 0x18b0-0x18bf,0x18c0-0x18c3,0x18c8-0x1 8cf,0x18c4-0x18c7,0x18d0-0x18d7 irq 9 at device 16.0 on pci0 ata2: at 0x18d0 on atapci1 ata3: at 0x18c8 on atapci1 orm0: <Option ROMs> at iomem 0xc0000-0xcafff,0xcb000-0xcb7ff,0xcb800-0xcdfff,0xe 0000-0xe3fff,0xe4000-0xeffff on isa0 pmtimer0 on isa0 fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1: configured irq 3 not in bitmap of probed irqs 0 ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (EPP/NIBBLE) in COMPATIBLE mode plip0: <PLIP network interface> on ppbus0 lpt0: <Printer> on ppbus0 lpt0: Interrupt-driven port ppi0: <Parallel I/O> on ppbus0 pca0 at port 0x40 on isa0 ahc0: Someone reset channel A IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to deny, logging limited to 10000 packets/entry by default ad0: 16479MB <Maxtor 91728D8> [33483/16/63] at ata0-master UDMA33 ad5: 57241MB <OEM60-54A 60GB 5400RPM> [116301/16/63] at ata2-slave UDMA100 acd0: CD-RW <_NEC CD-RW NR-9200A> at ata1-master PIO4 Mounting root from ufs:/dev/ad0s1a da0 at ahc0 bus 0 target 0 lun 0 da0: <SEAGATE SX19171W 9D23> Fixed Direct Access SCSI-2 device da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da0: 8683MB (17783112 512 byte sectors: 255H 63S/T 1106C) ad5s4f: soft error (ECC corrected) reading fsbn 185641109 of 84275072-84275075 ( ad5s4 bn 185641109; cn 11555 tn 159 sn 17)ad5s4f: hard error reading fsbn 185641 109 of 84275072-84275075 (ad5s4 bn 185641109; cn 11555 tn 159 sn 17) trying PIO mode ad5: READ command timeout tag=0 serv=0 - resetting ata2: resetting devices .. ad5: removed from configuration done spec_getpages:(#ad/0x5002d) I/O read failure: (error=6) bp 0xca673d88 vp 0xd72e3 780 size: 0, resid: 0, a_count: 20480, valid: 0x0 nread: 0, reqpage: 0, pindex: 1, pcount: 5 vm_fault: pager read error, pid 786 (kdeinit) spec_getpages:(#ad/0x5002d) I/O read failure: (error=6) bp 0xca673d88 vp 0xd72e3 780 size: 0, resid: 0, a_count: 883, valid: 0x0 nread: 0, reqpage: 0, pindex: 0, pcount: 1 vm_fault: pager read error, pid 788 (kdeinit) spec_getpages:(#ad/0x5002d) I/O read failure: (error=6) bp 0xca673d88 vp 0xd72e3 780 size: 0, resid: 0, a_count: 883, valid: 0x0 nread: 0, reqpage: 0, pindex: 0, pcount: 1 vm_fault: pager read error, pid 788 (kdeinit) pid 788 (kdeinit), uid 1000: exited on signal 11 paulus# -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.0 (FreeBSD) iD8DBQFCKk3c//GaROrFlAkRAvI/AJ9l8kHMfYRpHzogr1mG+yoq6hKBOQCffmT6 aJL4RBNQCJITXSM8RC1FWgg=Xo2M -----END PGP SIGNATURE-----
Doug White
2005-Mar-08 18:38 UTC
Error with CMD 649 ATA100 controller while copying files over NFS on 4.11-Stable
On Sat, 5 Mar 2005, Eric Buchanan wrote:> -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > I switched one hard drive to a CMD 649 ATA100 controller last summer, and I've > had some trouble with it on 4.11-Stable. When I copy a large volume of files > over NFS (800 Megabytes, lets say) the add on controller will stop > recognizing my hard drive. This time the files were not being copied off a > drive handled by this controller. > > Any ideas? This only occurs when copying files over nfs, and it will happen > even when I am not copying anything involving the drive on the controller. > I've tried "atacontrol reinit 2" and it won't re-recognize the hard drive > (ad5s4f). I have absolutely zero problems when I am not using NFS, > especially zero problems with make worlds or building large ports on this > hard drive in question.the CMD controller chips are known to be poor quality, you may be better off switching to a different controller. And change the cables out while you're at it. The messages below indicate that the drive itself is getting confused and ceasing to respond, but its hard to tell.> ad5: 57241MB <OEM60-54A 60GB 5400RPM> [116301/16/63] at ata2-slave UDMA100What brand/model is this drive, actually? It looks like its lost its identity data.> ad5s4f: soft error (ECC corrected) reading fsbn 185641109 of 84275072-84275075 > ( > ad5s4 bn 185641109; cn 11555 tn 159 sn 17)ad5s4f: hard error reading fsbn > 185641 109 of > 84275072-84275075 (ad5s4 bn 185641109; cn 11555 tn 159 sn 17) trying PIO > mode > ad5: READ command timeout tag=0 serv=0 - resetting > ata2: resetting devices .. > ad5: removed from configuration > done-- Doug White | FreeBSD: The Power to Serve dwhite@gumbysoft.com | www.FreeBSD.org