Eric Buchanan
2005-Mar-06 00:25 UTC
Error with CMD 649 ATA100 controller while copying files over NFS on 4.11-Stable
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
I switched one hard drive to a CMD 649 ATA100 controller last summer, and
I've
had some trouble with it on 4.11-Stable. When I copy a large volume of files
over NFS (800 Megabytes, lets say) the add on controller will stop
recognizing my hard drive. This time the files were not being copied off a
drive handled by this controller.
Any ideas? This only occurs when copying files over nfs, and it will happen
even when I am not copying anything involving the drive on the controller.
I've tried "atacontrol reinit 2" and it won't re-recognize the
hard drive
(ad5s4f). I have absolutely zero problems when I am not using NFS,
especially zero problems with make worlds or building large ports on this
hard drive in question.
Thanks,
Eric Buchanan
Here is the dmesg I see after the hard drive is unrecognized by the ata
controller:
Copyright (c) 1992-2005 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 4.11-STABLE #69: Mon Feb 14 15:38:38 PST 2005
idaho@paulus.org:/backup/src/sys/compile/MYKERNEL
Timecounter "i8254" frequency 1193182 Hz
CPU: Intel(R) Celeron(TM) CPU 1400MHz (1396.79-MHz 686-class
CPU)
Origin = "GenuineIntel" Id = 0x6b4 Stepping = 4
Features=0x383f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PA
T,PSE36,MMX,FXSR,SSE>
real memory = 436207616 (425984K bytes)
avail memory = 413069312 (403388K bytes)
Preloaded elf kernel "kernel" at 0xc0abe000.
Preloaded elf module "linux.ko" at 0xc0abe09c.
Preloaded elf module "nvidia.ko" at 0xc0abe13c.
Pentium Pro MTRR support enabled
md0: Malloc disk
Using $PIR table, 10 entries at 0xc00fdf20
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <Intel 82443BX (440 BX) host to PCI bridge> on motherboard
pci0: <PCI bus> on pcib0
agp0: <Intel 82443BX (440 BX) host to PCI bridge> mem
0xf8000000-0xfbffffff at
d
evice 0.0 on pci0
pcib1: <Intel 82443BX (440 BX) PCI-PCI (AGP) bridge> at device 1.0 on pci0
pci1: <PCI bus> on pcib1
nvidia0: <RIVA TNT2 Model 64/Model 64 Pro> mem
0xfc000000-0xfdffffff,0xf5000000-
0xf5ffffff irq 11 at device 0.0 on pci1
isab0: <Intel 82371AB PCI to ISA bridge> at device 7.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel PIIX4 ATA33 controller> port 0x18a0-0x18af at device 7.1
on
pci0
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
uhci0: <Intel 82371AB/EB (PIIX4) USB controller> port 0x1880-0x189f irq 9
at
dev
ice 7.2 on pci0
usb0: <Intel 82371AB/EB (PIIX4) USB controller> on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
ums0: U22_B003 iFeel Mouse, rev 1.10/98.03, addr 2, iclass 3/1
ums0: 3 buttons and Z dir.
chip0: <Intel 82371AB Power management controller> port 0x7000-0x700f at
device
7.3 on pci0
xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0x1800-0x187f mem
0xf4001000-0xf400
107f irq 11 at device 13.0 on pci0
xl0: Ethernet address: 00:01:02:3e:6d:64
miibus0: <MII bus> on xl0
xlphy0: <3c905C 10/100 internal PHY> on miibus0
xlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pcm0: <CMedia CMI8738> port 0x1000-0x10ff irq 10 at device 14.0 on pci0
ahc0: <Adaptec aic7880 Ultra SCSI adapter> port 0x1400-0x14ff mem
0xf4000000-0xf
4000fff irq 3 at device 15.0 on pci0
aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs
atapci1: <CMD 649 ATA100 controller> port
0x18b0-0x18bf,0x18c0-0x18c3,0x18c8-0x1
8cf,0x18c4-0x18c7,0x18d0-0x18d7 irq 9 at device 16.0 on pci0
ata2: at 0x18d0 on atapci1
ata3: at 0x18c8 on atapci1
orm0: <Option ROMs> at iomem
0xc0000-0xcafff,0xcb000-0xcb7ff,0xcb800-0xcdfff,0xe
0000-0xe3fff,0xe4000-0xeffff on isa0
pmtimer0 on isa0
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1: configured irq 3 not in bitmap of probed irqs 0
ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
ppc0: Generic chipset (EPP/NIBBLE) in COMPATIBLE mode
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
pca0 at port 0x40 on isa0
ahc0: Someone reset channel A
IP packet filtering initialized, divert enabled, rule-based forwarding
enabled,
default to deny, logging limited to 10000 packets/entry by default
ad0: 16479MB <Maxtor 91728D8> [33483/16/63] at ata0-master UDMA33
ad5: 57241MB <OEM60-54A 60GB 5400RPM> [116301/16/63] at ata2-slave UDMA100
acd0: CD-RW <_NEC CD-RW NR-9200A> at ata1-master PIO4
Mounting root from ufs:/dev/ad0s1a
da0 at ahc0 bus 0 target 0 lun 0
da0: <SEAGATE SX19171W 9D23> Fixed Direct Access SCSI-2 device
da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing
Enabled
da0: 8683MB (17783112 512 byte sectors: 255H 63S/T 1106C)
ad5s4f: soft error (ECC corrected) reading fsbn 185641109 of 84275072-84275075
(
ad5s4 bn 185641109; cn 11555 tn 159 sn 17)ad5s4f: hard error reading fsbn
185641 109 of
84275072-84275075 (ad5s4 bn 185641109; cn 11555 tn 159 sn 17) trying PIO
mode
ad5: READ command timeout tag=0 serv=0 - resetting
ata2: resetting devices ..
ad5: removed from configuration
done
spec_getpages:(#ad/0x5002d) I/O read failure: (error=6) bp 0xca673d88 vp
0xd72e3 780
size: 0, resid: 0, a_count: 20480, valid: 0x0
nread: 0, reqpage: 0, pindex: 1, pcount: 5
vm_fault: pager read error, pid 786 (kdeinit)
spec_getpages:(#ad/0x5002d) I/O read failure: (error=6) bp 0xca673d88 vp
0xd72e3 780
size: 0, resid: 0, a_count: 883, valid: 0x0
nread: 0, reqpage: 0, pindex: 0, pcount: 1
vm_fault: pager read error, pid 788 (kdeinit)
spec_getpages:(#ad/0x5002d) I/O read failure: (error=6) bp 0xca673d88 vp
0xd72e3 780
size: 0, resid: 0, a_count: 883, valid: 0x0
nread: 0, reqpage: 0, pindex: 0, pcount: 1
vm_fault: pager read error, pid 788 (kdeinit)
pid 788 (kdeinit), uid 1000: exited on signal 11
paulus#
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.0 (FreeBSD)
iD8DBQFCKk3c//GaROrFlAkRAvI/AJ9l8kHMfYRpHzogr1mG+yoq6hKBOQCffmT6
aJL4RBNQCJITXSM8RC1FWgg=Xo2M
-----END PGP SIGNATURE-----
Doug White
2005-Mar-08 18:38 UTC
Error with CMD 649 ATA100 controller while copying files over NFS on 4.11-Stable
On Sat, 5 Mar 2005, Eric Buchanan wrote:> -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > I switched one hard drive to a CMD 649 ATA100 controller last summer, and I've > had some trouble with it on 4.11-Stable. When I copy a large volume of files > over NFS (800 Megabytes, lets say) the add on controller will stop > recognizing my hard drive. This time the files were not being copied off a > drive handled by this controller. > > Any ideas? This only occurs when copying files over nfs, and it will happen > even when I am not copying anything involving the drive on the controller. > I've tried "atacontrol reinit 2" and it won't re-recognize the hard drive > (ad5s4f). I have absolutely zero problems when I am not using NFS, > especially zero problems with make worlds or building large ports on this > hard drive in question.the CMD controller chips are known to be poor quality, you may be better off switching to a different controller. And change the cables out while you're at it. The messages below indicate that the drive itself is getting confused and ceasing to respond, but its hard to tell.> ad5: 57241MB <OEM60-54A 60GB 5400RPM> [116301/16/63] at ata2-slave UDMA100What brand/model is this drive, actually? It looks like its lost its identity data.> ad5s4f: soft error (ECC corrected) reading fsbn 185641109 of 84275072-84275075 > ( > ad5s4 bn 185641109; cn 11555 tn 159 sn 17)ad5s4f: hard error reading fsbn > 185641 109 of > 84275072-84275075 (ad5s4 bn 185641109; cn 11555 tn 159 sn 17) trying PIO > mode > ad5: READ command timeout tag=0 serv=0 - resetting > ata2: resetting devices .. > ad5: removed from configuration > done-- Doug White | FreeBSD: The Power to Serve dwhite@gumbysoft.com | www.FreeBSD.org