I'm having some trouble with a SuperMicro SuperServer 6022L-6 that
previously ran 7.0-BETA4 without problems. Today, I updated this machine to
7.0-PRERELEASE and now it will not fully boot unless I disable ACPI. A quick
search of the PR database didn't turn up anything similar with sysctl and
ACPI.
I can boot to single user mode, but if I issue sysctl -a while there, it
also crashes.
I have two vmcore files, one where I booted to single user mode and issued
sysctl -a, the other when it was attempting to boot normally.
When running sysctl -a in single user mode, the last three lines before the
crash are (transcribed by hand, no serial console available):
dev.pcib.3.%location: handle=\_SB_.PCI3
dev.pcib.3.%pnpinfo: _HID=PNP0A03 UID=3
dev.pcib.3.%parent: acpi0
========================================================================Kernel
config is GENERIC, with ULE scheduler and "options ASR_COMPAT"
========================================================================[root@test1
/usr/obj/usr/src/sys/TEST]# uname -a
FreeBSD test1.hpcisp.com 7.0-PRERELEASE FreeBSD 7.0-PRERELEASE #1: Thu Feb
14 14:08:02 EST 2008 root@test1.hpcisp.com:/usr/obj/usr/src/sys/TEST i386
========================================================================[root@test1
/usr/obj/usr/src/sys/TEST]# kgdb kernel.debug
/var/crash/vmcore.1
Fatal trap 12: page fault while in kernel mode
cpuid = 3; apic id = 03
fault virtual address = 0x2043455c
fault code = supervisor read, page not present
instruction pointer = 0x20:0xc0743036
stack pointer = 0x28:0xe8cb3a0c
frame pointer = 0x28:0xe8cb3a38
code segment = base 0x0, limit 0xfffff, type 0x1b
= DPL 0, pres 1, def32 1, gran 1
processor eflags = interrupt enabled, resume, IOPL = 0
current process = 67 (sysctl)
trap number = 12
panic: page fault
cpuid = 3
Uptime: 6s
Physical memory: 2035 MB
Dumping 65 MB: 50 34 18 2
#0 doadump () at pcpu.h:195
195 __asm __volatile("movl %%fs:0,%0" : "=r"
(td));
(kgdb) bt
#0 doadump () at pcpu.h:195
#1 0xc073aa38 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:409
#2 0xc073acf1 in panic (fmt=Variable "fmt" is not available.
) at /usr/src/sys/kern/kern_shutdown.c:563
#3 0xc0a1fd00 in trap_fatal (frame=0xe8cb39cc, eva=541279580) at
/usr/src/sys/i386/i386/trap.c:899
#4 0xc0a1ff70 in trap_pfault (frame=0xe8cb39cc, usermode=0,
eva=541279580) at /usr/src/sys/i386/i386/trap.c:812
#5 0xc0a208ed in trap (frame=0xe8cb39cc) at
/usr/src/sys/i386/i386/trap.c:490
#6 0xc0a07bdb in calltrap () at /usr/src/sys/i386/i386/exception.s:139
#7 0xc0743036 in sysctl_sysctl_next_ls (lsp=Variable "lsp" is not
available.
) at /usr/src/sys/kern/kern_sysctl.c:630
#8 0xc07430f6 in sysctl_sysctl_next_ls (lsp=Variable "lsp" is not
available.
) at /usr/src/sys/kern/kern_sysctl.c:618
#9 0xc0743133 in sysctl_sysctl_next_ls (lsp=Variable "lsp" is not
available.
) at /usr/src/sys/kern/kern_sysctl.c:630
#10 0xc0743133 in sysctl_sysctl_next_ls (lsp=Variable "lsp" is not
available.
) at /usr/src/sys/kern/kern_sysctl.c:630
#11 0xc0743196 in sysctl_sysctl_next (oidp=0xc0b53280, arg1=0xe8cb3c1c,
arg2=4, req=0xe8cb3ba4)
at /usr/src/sys/kern/kern_sysctl.c:651
#12 0xc0743aa2 in sysctl_root (oidp=Variable "oidp" is not available.
) at /usr/src/sys/kern/kern_sysctl.c:1306
#13 0xc0743bde in userland_sysctl (td=0xc5479660, name=0xe8cb3c14,
namelen=6, old=0xbfbfe4e8, oldlenp=0xbfbfe598, inkernel=0,
new=0x0, newlen=0, retval=0xe8cb3c10, flags=0) at
/usr/src/sys/kern/kern_sysctl.c:1401
#14 0xc0744812 in __sysctl (td=0xc5479660, uap=0xe8cb3cfc) at
/usr/src/sys/kern/kern_sysctl.c:1336
#15 0xc0a202b8 in syscall (frame=0xe8cb3d38) at
/usr/src/sys/i386/i386/trap.c:1035
#16 0xc0a07c40 in Xint0x80_syscall () at
/usr/src/sys/i386/i386/exception.s:196
#17 0x00000033 in ?? ()
Previous frame inner to this frame (corrupt stack?)
========================================================================dmesg is
attached, but it is from a non-acpi boot.
========================================================================
Anyone have any ideas on what might be the cause or a possible fix?
I'll keep the crash dumps around. This is a test box that I'm
researching
7.0 on for possible production use on similar hardware. There is no planned
usage yet, and no other plans for this box, so anything goes in terms of
possible debugging.
If I get some time next week I might try a binary search of commits between
BETA4 and now, to pinpoint where it stopped working.
Thanks,
Jim
-------------- next part --------------
Copyright (c) 1992-2008 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 7.0-PRERELEASE #1: Thu Feb 14 14:08:02 EST 2008
root@test1.hpcisp.com:/usr/obj/usr/src/sys/TEST
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) XEON(TM) CPU 2.00GHz (1999.95-MHz 686-class CPU)
Origin = "GenuineIntel" Id = 0xf24 Stepping = 4
Features=0x3febfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM>
Logical CPUs per core: 2
real memory = 2147418112 (2047 MB)
avail memory = 2091892736 (1994 MB)
MPTable: <AMI GCHE >
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
cpu0 (BSP): APIC ID: 0
cpu1 (AP): APIC ID: 1
cpu2 (AP): APIC ID: 2
cpu3 (AP): APIC ID: 3
ioapic0: Assuming intbase of 0
ioapic1: Assuming intbase of 16
ioapic2: Assuming intbase of 32
ioapic3: Assuming intbase of 48
ioapic0 <Version 1.1> irqs 0-15 on motherboard
ioapic1 <Version 1.1> irqs 16-31 on motherboard
ioapic2 <Version 1.1> irqs 32-47 on motherboard
ioapic3 <Version 1.1> irqs 48-63 on motherboard
kbd1 at kbdmux0
ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
cpu0 on motherboard
p4tcc0: <CPU Frequency Thermal Control> on cpu0
cpu1 on motherboard
p4tcc1: <CPU Frequency Thermal Control> on cpu1
cpu2 on motherboard
p4tcc2: <CPU Frequency Thermal Control> on cpu2
cpu3 on motherboard
p4tcc3: <CPU Frequency Thermal Control> on cpu3
pcib0: <MPTable Host-PCI bridge> pcibus 0 on motherboard
pci0: <PCI bus> on pcib0
vgapci0: <VGA-compatible display> port 0xa800-0xa8ff mem
0xfd000000-0xfdffffff,0xfe5ff000-0xfe5fffff irq 18 at device 2.0 on pci0
fxp0: <Intel 82550 Pro/100 Ethernet> port 0xae80-0xaebf mem
0xfe5fc000-0xfe5fcfff,0xfe580000-0xfe59ffff irq 17 at device 4.0 on pci0
miibus0: <MII bus> on fxp0
inphy0: <i82555 10/100 media interface> PHY 1 on miibus0
inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:30:48:20:a3:9e
fxp0: [ITHREAD]
fxp1: <Intel 82550 Pro/100 Ethernet> port 0xaf00-0xaf3f mem
0xfe5fd000-0xfe5fdfff,0xfe5a0000-0xfe5bffff irq 19 at device 5.0 on pci0
miibus1: <MII bus> on fxp1
inphy1: <i82555 10/100 media interface> PHY 1 on miibus1
inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp1: Ethernet address: 00:30:48:20:a3:9f
fxp1: [ITHREAD]
isab0: <PCI-ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <ServerWorks CSB5 UDMA100 controller> port
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 15.1 on pci0
ata0: <ATA channel 0> on atapci0
ata0: [ITHREAD]
ata1: <ATA channel 1> on atapci0
ata1: [ITHREAD]
ohci0: <OHCI (generic) USB controller> mem 0xfe5fe000-0xfe5fefff irq 10 at
device 15.2 on pci0
ohci0: [GIANT-LOCKED]
ohci0: [ITHREAD]
usb0: OHCI version 1.0, legacy support
usb0: SMM does not respond, resetting
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: <(0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb0
uhub0: 4 ports with 4 removable, self powered
pcib1: <ServerWorks host to PCI bridge(unknown chipset)> pcibus 1 on
motherboard
pir0: <PCI Interrupt Routing Table: 9 Entries> on motherboard
$PIR: Ignoring invalid BIOS IRQ 18 from 0.2.INTA for link 0x12
$PIR: Ignoring invalid BIOS IRQ 17 from 0.4.INTA for link 0x11
$PIR: Ignoring invalid BIOS IRQ 19 from 0.5.INTA for link 0x13
pci1: <PCI bus> on pcib1
pcib2: <ServerWorks host to PCI bridge(unknown chipset)> pcibus 2 on
motherboard
pci2: <PCI bus> on pcib2
pcib3: <ServerWorks host to PCI bridge(unknown chipset)> pcibus 3 on
motherboard
pci3: <PCI bus> on pcib3
pcib4: <MPTable Host-PCI bridge> pcibus 4 on motherboard
pci4: <PCI bus> on pcib4
asr0: <Adaptec Caching SCSI RAID> mem
0xfeb00000-0xfebfffff,0xfb000000-0xfbffffff,0xf8000000-0xf9ffffff irq 29 at
device 3.0 on pci4
asr0: [GIANT-LOCKED]
asr0: [ITHREAD]
asr0: ADAPTEC 2005S FW Rev. 380E, 2 channel, 2000 CCBs, Protocol I2O
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem
0xc0000-0xc7fff,0xc8000-0xcdfff,0xce000-0xcefff,0xcf000-0xcffff pnpid ORM0000 on
isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
atkbd0: [ITHREAD]
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: [GIANT-LOCKED]
psm0: [ITHREAD]
psm0: model NetMouse/NetScroll Optical, device ID 0
fdc0: <Enhanced floppy controller> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2
on isa0
fdc0: [FILTER]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
ppc0: Generic chipset (ECP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/8 bytes threshold
ppbus0: <Parallel port bus> on ppc0
ppbus0: [ITHREAD]
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
ppc0: [GIANT-LOCKED]
ppc0: [ITHREAD]
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio0: [FILTER]
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
sio1: [FILTER]
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
unknown: <PNP0303> can't assign resources (port)
unknown: <PNP0f13> can't assign resources (irq)
unknown: <PNP0501> can't assign resources (port)
unknown: <PNP0501> can't assign resources (port)
unknown: <PNP0401> can't assign resources (port)
unknown: <PNP0700> can't assign resources (port)
Timecounters tick every 1.000 msec
acd0: CDROM <MATSHITA CR-177/7T0D> at ata1-master UDMA33
ses0 at asr0 bus 0 target 6 lun 0
ses0: <SUPER GEM318 0> Fixed Processor SCSI-2 device
SMP: AP CPU #1 Launched!
SMP: AP CPU #2 Launched!
SMP: AP CPU #3 Launched!
da0 at asr0 bus 0 target 0 lun 0
da0: <ADAPTEC RAID-5 380E> Fixed Direct Access SCSI-2 device
Trying to mount root from ufs:/dev/da0s1a
WARNING: / was not properly dismounted