Anthony Volodkin
2006-Jun-12 20:27 UTC
Intermittent Kernel Panics on Disk Activity/cvsup/make on 6.1-RELEASE
Hi, On my Athlon XP 1800 / Abit KX7-333R system, I've been encountering intermittent kernel panics during periods of high disk activity or when running cvsup or make buildworld. What's notable is that at this point the motherboard, CPU, and power supply have been replaced and the system has no difficulties running Memtest86 for several hours. Additionally, this problem does not happen EACH time I generate a lot of disk load or run cvsup, but occasionally. Per the handbook, I've built a debug kernel and captured crash data. Below are my dmesg, and the output of several kgdb, list *instruction pointer and backtrace. What are some of the next steps I can take to help resolve this or find the cause of these crashes? Any help is highly appreciated. Please CC me when responding as I am not subscribed to this list. Thank you, Anthony Volodkin ---------------------------------------- kgdb kernel.debug /var/crash/vmcore.1 ---------------------------------------- [GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"] GNU gdb 6.1.1 [FreeBSD] ... [snip] ... This GDB was configured as "i386-marcel-freebsd". Unread portion of the kernel message buffer: Fatal trap 9: general protection fault while in kernel mode instruction pointer = 0x20:0xc06aacfd stack pointer = 0x28:0xe981ac10 frame pointer = 0x28:0xe981acdc code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 18257 (cvsup) trap number = 9 panic: general protection fault Uptime: 3d15h50m31s Dumping 2047 MB (2 chunks) chunk 0: 1MB (159 pages) ... ok chunk 1: 2047MB (524016 pages) 2031 2015 1999 1983 1967 1951 1935 1919 1903 1887 1871 1855 1839 1823 1807 1791 1775 1759 1743 1727 1711 1695 1679 1663 1647 1631 1615 1599 1583 1567 1551 1535 1519 1503 1487 1471 1455 1439 1423 1407 1391 1375 1359 1343 1327 1311 1295 1279 1263 1247 1231 1215 1199 1183 1167 1151 1135 1119 1103 1087 1071 1055 1039 1023 1007 991 975 959 943 927 911 895 879 863 847 831 815 799 783 767 751 735 719 703 687 671 655 639 623 607 591 575 559 543 527 511 495 479 463 447 431 415 399 383 367 351 335 319 303 287 271 255 239 223 207 191 175 159 143 127 111 95 79 63 47 31 15 #0 doadump () at pcpu.h:165 165 __asm __volatile("movl %%fs:0,%0" : "=r" (td)); ---------------------------------------- (kgdb) list *0xc06aacfd ---------------------------------------- 0xc06aacfd is in lseek (/usr/src/sys/kern/vfs_syscalls.c:1788). 1783 goto drop; 1784 fp->f_offset = offset; 1785 *(off_t *)(td->td_retval) = fp->f_offset; 1786 drop: 1787 fdrop(fp, td); 1788 VFS_UNLOCK_GIANT(vfslocked); 1789 return (error); 1790 } 1791 1792 #if defined(COMPAT_43) ---------------------------------------- (kgdb) bt ---------------------------------------- #0 doadump () at pcpu.h:165 #1 0xc064dee1 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:402 #2 0xc064e178 in panic (fmt=0xc088cb0e "%s") at /usr/src/sys/kern/kern_shutdown.c:558 #3 0xc0841d94 in trap_fatal (frame=0xe981abd0, eva=0) at /usr/src/sys/i386/i386/trap.c:836 #4 0xc08418bc in trap (frame {tf_fs = 8, tf_es = 40, tf_ds = 40, tf_edi = -377377532, tf_esi = -948068912, tf_ebp = -377377572, tf_isp = -377377796, tf_ebx = 0, tf_edx = -969420352, tf_ecx = -911557632, tf_eax = 0, tf_trapno = 9, tf_err = 48128, tf_eip = -1066750723, tf_cs = 32, tf_eflags = 66050, tf_esp = -960093848, tf_ss = -911557632}) at /usr/src/sys/i386/i386/trap.c:631 #5 0xc0830c9a in calltrap () at /usr/src/sys/i386/i386/exception.s:139 #6 0xc06aacfd in lseek (td=0xc9aabc00, uap=0xe981ad04) at /usr/src/sys/kern/vfs_syscalls.c:1787 #7 0xc08420ab in syscall (frame {tf_fs = 59, tf_es = 59, tf_ds = 59, tf_edi = 672019104, tf_esi = -1077940896, tf_ebp = 136321580, tf_isp = -377377436, tf_ebx = 673236232, tf_edx = 0, tf_ecx = 118, tf_eax = 198, tf_trapno = 0, tf_err = 2, tf_eip = 673179699, tf_cs = 51, tf_eflags = 514, tf_esp = 136321536, tf_ss = 59}) at /usr/src/sys/i386/i386/trap.c:981 #8 0xc0830cef in Xint0x80_syscall () at /usr/src/sys/i386/i386/exception.s:200 #9 0x00000033 in ?? () Previous frame inner to this frame (corrupt stack?) ---------------------------------------- dmesg ---------------------------------------- Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.1-RELEASE #0: Fri May 26 03:01:47 EDT 2006 root@superior.local.non-standard.net:/usr/obj/usr/src/sys/SUPERIOR mptable_probe: MP Config Table has bad signature: \M^D\^A\^A Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) XP 1800+ (1533.99-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x662 Stepping = 2 Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> AMD Features=0xc0480800<SYSCALL,MP,MMX+,3DNow+,3DNow> real memory = 2147418112 (2047 MB) avail memory = 2096558080 (1999 MB) kbd1 at kbdmux0 acpi0: <VIA694 AWRDACPI> on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 cpu0: <ACPI CPU> on acpi0 acpi_throttle0: <ACPI CPU Throttling> on cpu0 acpi_button0: <Power Button> on acpi0 pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff,0x4000-0x407f,0x4080-0x40ff,0x5000-0x500f on acpi0 pci0: <ACPI PCI bus> on pcib0 agp0: <VIA 8367 (KT266/KY266x/KT333) host to PCI bridge> mem 0xe4000000-0xe47fffff at device 0.0 on pci0 pcib1: <PCI-PCI bridge> at device 1.0 on pci0 pci1: <PCI bus> on pcib1 pci1: <display, VGA> at device 0.0 (no driver attached) fxp0: <Intel 82559 Pro/100 Ethernet> port 0xb000-0xb03f mem 0xe4904000-0xe4904fff,0xe4800000-0xe48fffff irq 10 at device 9.0 on pci0 miibus0: <MII bus> on fxp0 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:90:27:a7:39:6e atapci0: <Promise PDC20269 UDMA133 controller> port 0xb400-0xb407,0xb800-0xb803,0xbc00-0xbc07,0xc000-0xc003,0xc400-0xc40f mem 0xe4900000-0xe4903fff irq 5 at device 15.0 on pci0 ata2: <ATA channel 0> on atapci0 ata3: <ATA channel 1> on atapci0 isab0: <PCI-ISA bridge> at device 17.0 on pci0 isa0: <ISA bus> on isab0 atapci1: <VIA 8233A UDMA133 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xc800-0xc80f at device 17.1 on pci0 ata0: <ATA channel 0> on atapci1 ata1: <ATA channel 1> on atapci1 uhci0: <VIA 83C572 USB controller> port 0xcc00-0xcc1f irq 11 at device 17.2 on pci0 uhci0: [GIANT-LOCKED] usb0: <VIA 83C572 USB controller> on uhci0 usb0: USB revision 1.0 uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: <VIA 83C572 USB controller> port 0xd000-0xd01f irq 11 at device 17.3 on pci0 uhci1: [GIANT-LOCKED] usb1: <VIA 83C572 USB controller> on uhci1 usb1: USB revision 1.0 uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered atapci2: <HighPoint HighPoint UDMA133 controller> port 0xd400-0xd407,0xd800-0xd803,0xdc00-0xdc07,0xe000-0xe003,0xe400-0xe4ff irq 10 at device 19.0 on pci0 ata4: <ATA channel 0> on atapci2 ata5: <ATA channel 1> on atapci2 acpi_tz0: <Thermal Zone> on acpi0 fdc0: <floppy drive controller> port 0x3f2-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: <Standard parallel printer port> port 0x378-0x37f,0x778-0x77b irq 7 on acpi0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: <Parallel port bus> on ppc0 plip0: <PLIP network interface> on ppbus0 lpt0: <Printer> on ppbus0 lpt0: Interrupt-driven port ppi0: <Parallel I/O> on ppbus0 atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0 atkbd0: <AT Keyboard> irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] pmtimer0 on isa0 orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xca7ff,0xcb000-0xcc7ff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounter "TSC" frequency 1533986228 Hz quality 800 Timecounters tick every 1.000 msec ad0: 57259MB <MAXTOR 6L060J3 A93.0500> at ata0-master UDMA133 ad2: 190782MB <Seagate ST3200822A 3.01> at ata1-master UDMA100 ad3: 78167MB <Maxtor 6Y080L0 YAR41VW0> at ata1-slave UDMA133 ad4: 239372MB <Maxtor 6B250R0 BAH41BM0> at ata2-master UDMA133 ad5: 238475MB <WDC WD2500JB-32FUA0 15.05R15> at ata2-slave UDMA100 ad6: 194481MB <Maxtor 6L200P0 BAJ41G20> at ata3-master UDMA133 ad7: 76345MB <MAXTOR 6L080L4 A93.0500> at ata3-slave UDMA133 Trying to mount root from ufs:/dev/ad0s1a