Hello everybody
I have a big problem
There is one FreeBSD server in our company. The server platform is: Supermicro
SuperServer 6014V-T2B (2x Intel Xeon 2.8, 1Gb RAM, 3WARE 3W-8006-2LP
RAID-Controller).
The server works as:
- a gateway between LAN and Internet
- an Intranet web- and database server (Apache + MySQL + PHP)
- a firewall (OpenBSD pf)
- a transparent proxy server (Squid)
A mounthly traffic through this server is about 100Gb. There is about 200
internet users in our conpany.
Here is a part of my dmesg-listing:
Copyright (c) 1992-2007 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 6.2-RELEASE-p8 #2: Thu Oct 11 19:51:25 MSD 2007
sa@gateway.konliga.ru:/usr/obj/usr/src/sys/KERNEL01_NOSMP
module_register: module pci/em already exists!
Module pci/em failed to register: 17
ACPI APIC Table: <A M I OEMAPIC >
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2800.12-MHz 686-class CPU)
Origin = "GenuineIntel" Id = 0xf43 Stepping = 3
Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
Features2=0x641d<SSE3,RSVD2,MON,DS_CPL,CNTX-ID,CX16,<b14>>
AMD Features=0x20000000<LM>
Logical CPUs per core: 2
real memory = 1073479680 (1023 MB)
avail memory = 1041465344 (993 MB)
ioapic0 <Version 2.0> irqs 0-23 on motherboard
ioapic1 <Version 2.0> irqs 24-47 on motherboard
ichwd module loaded
kbd1 at kbdmux0
ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
acpi0: <A M I OEMRSDT> on motherboard
acpi0: Power Button (fixed)
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0
cpu0: <ACPI CPU> on acpi0
acpi_throttle0: <ACPI CPU Throttling> on cpu0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
pcib1: <ACPI PCI-PCI bridge> irq 16 at device 2.0 on pci0
pci1: <ACPI PCI bus> on pcib1
pcib2: <ACPI PCI-PCI bridge> irq 16 at device 3.0 on pci0
pci2: <ACPI PCI bus> on pcib2
pcib3: <ACPI PCI-PCI bridge> at device 28.0 on pci0
pci3: <ACPI PCI bus> on pcib3
twe0: <3ware Storage Controller. Driver version 1.50.01.002> port
0xbc00-0xbc0f mem 0xfc9ffc00-0xfc9ffc0f,0xfc000000-0xfc7fffff irq 24 at device
1.0 on pci3
twe0: [GIANT-LOCKED]
twe0: 2 ports, Firmware FE8S 1.05.00.068, BIOS BE7X 1.08.00.048
em0: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> port
0xb800-0xb83f mem 0xfc9c0000-0xfc9dffff irq 26 at device 3.0 on pci3
em0: Ethernet address: 00:30:48:58:4d:2a
em0: [FAST]
em1: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> port
0xb400-0xb43f mem 0xfc9a0000-0xfc9bffff irq 27 at device 4.0 on pci3
em1: Ethernet address: 00:30:48:58:4d:2b
em1: [FAST]
uhci0: <UHCI (generic) USB controller> port 0xe800-0xe81f irq 16 at device
29.0 on pci0
uhci0: [GIANT-LOCKED]
usb0: <UHCI (generic) USB controller> on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1: <UHCI (generic) USB controller> port 0xec00-0xec1f irq 19 at device
29.1 on pci0
uhci1: [GIANT-LOCKED]
usb1: <UHCI (generic) USB controller> on uhci1
usb1: USB revision 1.0
uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
pci0: <base peripheral> at device 29.4 (no driver attached)
pci0: <base peripheral, interrupt controller> at device 29.5 (no driver
attached)
ehci0: <Intel 6300ESB USB 2.0 controller> mem 0xfebffc00-0xfebfffff irq 23
at device 29.7 on pci0
ehci0: [GIANT-LOCKED]
usb2: EHCI version 1.0
usb2: companion controllers, 2 ports each: usb0 usb1
usb2: <Intel 6300ESB USB 2.0 controller> on ehci0
usb2: USB revision 2.0
uhub2: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub2: 4 ports with 4 removable, self powered
pcib4: <ACPI PCI-PCI bridge> at device 30.0 on pci0
pci4: <ACPI PCI bus> on pcib4
pci4: <display, VGA> at device 5.0 (no driver attached)
isab0: <PCI-ISA bridge> at device 31.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel 6300ESB UDMA100 controller> port
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xfc00-0xfc0f at device 31.1 on pci0
ata0: <ATA channel 0> on atapci0
ata1: <ATA channel 1> on atapci0
pci0: <serial bus, SMBus> at device 31.3 (no driver attached)
acpi_button0: <Power Button> on acpi0
acpi_button1: <Sleep Button> on acpi0
sio0: configured irq 4 not in bitmap of probed irqs 0
sio0: port may not be enabled
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on
acpi0
sio0: type 16550A
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
fdc0: <floppy drive controller (FDE)> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2
on acpi0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
ppc0: <ECP parallel printer port> port 0x378-0x37f,0x778-0x77f irq 7 drq 3
on acpi0
ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/9 bytes threshold
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: [GIANT-LOCKED]
psm0: model IntelliMouse, device ID 3
ichwd0: <Intel 6300ESB watchdog timer> on isa0
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem
0xc0000-0xc7fff,0xc8000-0xc8fff,0xc9800-0xca7ff,0xca800-0xcb7ff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounter "TSC" frequency 2800118202 Hz quality 800
Timecounters tick every 1.000 msec
acd0: CDROM <CD-224E-N/1.AA> at ata0-master UDMA33
twed0: <Unit 0, TwinStor, Normal> on twe0
twed0: 152626MB (312579760 sectors)
Trying to mount root from ufs:/dev/twed0s1a
ext0: link state changed to UP
int0: link state changed to UP
vlan0: link state changed to UP
This server hangs up every day without any messages in the log files and on the
system console. A keyboard dosen't work too. I can make only hard reset and
after restart coredump files are not appearing.
Here is my kernel configuration file:
include GENERIC
ident KERNEL01_NOSMP
device ichwd # Intel ICH watchdog timer
#options SMP
options ALTQ
options ALTQ_CBQ
options ALTQ_RED
options ALTQ_RIO
options ALTQ_HFSC
options ALTQ_PRIQ
#options ALTQ_NOPCC
options SC_DISABLE_REBOOT
options MP_WATCHDOG
options SW_WATCHDOG
If I make and install a kernel with SMP options the system under working load
begins hang up every two hours.
The two days "Memtest" gave no result.
I tried to install the newest Intel ethernet adapter driver, but without any
results.
As an experiment I tried also to plug a system HDD to another sever platform
(SuperServer 6015V-TB), but system hanging didn't stop.
I think that it is not only hardware problem.
Linux (Gentoo) and Windows server 2003 on this hardware were working fine.
Please help me to find a solution and solve a problem.
Your faithfully
Dmitry Komaleev
IT Manager
"EDIPRESSE-KONLIGA" http://www.konliga.ru
Russia, Moscow
tel.: +7 (495) 775-14-35, ext. 169
fax: +7 (495) 775-14-34
P.S. I have written the Bug Report on my problem but have received only one
advice to turn off ACPI-option.
If I disable ACPI, then the RAID-controller and both of the ethernet controllers
on my server recieve the same IRQ. I believe this is not good.
A system failure of this sort (one which leaves no log entries of any kind) is generally a hardware fault; memory stick failures tend to cause kernel panics and easy repeatability. I would suggest examining the hardware components, the motherboard could have some faulty capacitors (burst, leaking, or swollen); the fans on the processors could be failing causing a lockup, the power supply fans could be failing causing an undervolt and lockup, but this usually makes the system reset. You get the idea, your symptoms are pointing to hardware issues in my opinion. David On 10/31/07, ??????? ???????? <d.komaleev@konliga.ru> wrote:> Hello everybody > > I have a big problem > > There is one FreeBSD server in our company. The server platform is: Supermicro SuperServer 6014V-T2B (2x Intel Xeon 2.8, 1Gb RAM, 3WARE 3W-8006-2LP RAID-Controller). > The server works as: > - a gateway between LAN and Internet > - an Intranet web- and database server (Apache + MySQL + PHP) > - a firewall (OpenBSD pf) > - a transparent proxy server (Squid) > A mounthly traffic through this server is about 100Gb. There is about 200 internet users in our conpany. > Here is a part of my dmesg-listing: > > Copyright (c) 1992-2007 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD is a registered trademark of The FreeBSD Foundation. > FreeBSD 6.2-RELEASE-p8 #2: Thu Oct 11 19:51:25 MSD 2007 > sa@gateway.konliga.ru:/usr/obj/usr/src/sys/KERNEL01_NOSMP > module_register: module pci/em already exists! > Module pci/em failed to register: 17 > ACPI APIC Table: <A M I OEMAPIC > > Timecounter "i8254" frequency 1193182 Hz quality 0 > CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2800.12-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0xf43 Stepping = 3 > Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> > Features2=0x641d<SSE3,RSVD2,MON,DS_CPL,CNTX-ID,CX16,<b14>> > AMD Features=0x20000000<LM> > Logical CPUs per core: 2 > real memory = 1073479680 (1023 MB) > avail memory = 1041465344 (993 MB) > ioapic0 <Version 2.0> irqs 0-23 on motherboard > ioapic1 <Version 2.0> irqs 24-47 on motherboard > ichwd module loaded > kbd1 at kbdmux0 > ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) > acpi0: <A M I OEMRSDT> on motherboard > acpi0: Power Button (fixed) > Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 > cpu0: <ACPI CPU> on acpi0 > acpi_throttle0: <ACPI CPU Throttling> on cpu0 > pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 > pci0: <ACPI PCI bus> on pcib0 > pcib1: <ACPI PCI-PCI bridge> irq 16 at device 2.0 on pci0 > pci1: <ACPI PCI bus> on pcib1 > pcib2: <ACPI PCI-PCI bridge> irq 16 at device 3.0 on pci0 > pci2: <ACPI PCI bus> on pcib2 > pcib3: <ACPI PCI-PCI bridge> at device 28.0 on pci0 > pci3: <ACPI PCI bus> on pcib3 > twe0: <3ware Storage Controller. Driver version 1.50.01.002> port 0xbc00-0xbc0f mem 0xfc9ffc00-0xfc9ffc0f,0xfc000000-0xfc7fffff irq 24 at device 1.0 on pci3 > twe0: [GIANT-LOCKED] > twe0: 2 ports, Firmware FE8S 1.05.00.068, BIOS BE7X 1.08.00.048 > em0: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> port 0xb800-0xb83f mem 0xfc9c0000-0xfc9dffff irq 26 at device 3.0 on pci3 > em0: Ethernet address: 00:30:48:58:4d:2a > em0: [FAST] > em1: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> port 0xb400-0xb43f mem 0xfc9a0000-0xfc9bffff irq 27 at device 4.0 on pci3 > em1: Ethernet address: 00:30:48:58:4d:2b > em1: [FAST] > uhci0: <UHCI (generic) USB controller> port 0xe800-0xe81f irq 16 at device 29.0 on pci0 > uhci0: [GIANT-LOCKED] > usb0: <UHCI (generic) USB controller> on uhci0 > usb0: USB revision 1.0 > uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub0: 2 ports with 2 removable, self powered > uhci1: <UHCI (generic) USB controller> port 0xec00-0xec1f irq 19 at device 29.1 on pci0 > uhci1: [GIANT-LOCKED] > usb1: <UHCI (generic) USB controller> on uhci1 > usb1: USB revision 1.0 > uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub1: 2 ports with 2 removable, self powered > pci0: <base peripheral> at device 29.4 (no driver attached) > pci0: <base peripheral, interrupt controller> at device 29.5 (no driver attached) > ehci0: <Intel 6300ESB USB 2.0 controller> mem 0xfebffc00-0xfebfffff irq 23 at device 29.7 on pci0 > ehci0: [GIANT-LOCKED] > usb2: EHCI version 1.0 > usb2: companion controllers, 2 ports each: usb0 usb1 > usb2: <Intel 6300ESB USB 2.0 controller> on ehci0 > usb2: USB revision 2.0 > uhub2: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 > uhub2: 4 ports with 4 removable, self powered > pcib4: <ACPI PCI-PCI bridge> at device 30.0 on pci0 > pci4: <ACPI PCI bus> on pcib4 > pci4: <display, VGA> at device 5.0 (no driver attached) > isab0: <PCI-ISA bridge> at device 31.0 on pci0 > isa0: <ISA bus> on isab0 > atapci0: <Intel 6300ESB UDMA100 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xfc00-0xfc0f at device 31.1 on pci0 > ata0: <ATA channel 0> on atapci0 > ata1: <ATA channel 1> on atapci0 > pci0: <serial bus, SMBus> at device 31.3 (no driver attached) > acpi_button0: <Power Button> on acpi0 > acpi_button1: <Sleep Button> on acpi0 > sio0: configured irq 4 not in bitmap of probed irqs 0 > sio0: port may not be enabled > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 > sio0: type 16550A > sio1: configured irq 3 not in bitmap of probed irqs 0 > sio1: port may not be enabled > sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 > sio1: type 16550A > fdc0: <floppy drive controller (FDE)> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 > fdc0: [FAST] > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > ppc0: <ECP parallel printer port> port 0x378-0x37f,0x778-0x77f irq 7 drq 3 on acpi0 > ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode > ppc0: FIFO with 16/16/9 bytes threshold > ppbus0: <Parallel port bus> on ppc0 > plip0: <PLIP network interface> on ppbus0 > lpt0: <Printer> on ppbus0 > lpt0: Interrupt-driven port > ppi0: <Parallel I/O> on ppbus0 > atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0 > atkbd0: <AT Keyboard> irq 1 on atkbdc0 > kbd0 at atkbd0 > atkbd0: [GIANT-LOCKED] > psm0: <PS/2 Mouse> irq 12 on atkbdc0 > psm0: [GIANT-LOCKED] > psm0: model IntelliMouse, device ID 3 > ichwd0: <Intel 6300ESB watchdog timer> on isa0 > pmtimer0 on isa0 > orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xc8fff,0xc9800-0xca7ff,0xca800-0xcb7ff on isa0 > sc0: <System console> at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=0x300> > vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > Timecounter "TSC" frequency 2800118202 Hz quality 800 > Timecounters tick every 1.000 msec > acd0: CDROM <CD-224E-N/1.AA> at ata0-master UDMA33 > twed0: <Unit 0, TwinStor, Normal> on twe0 > twed0: 152626MB (312579760 sectors) > Trying to mount root from ufs:/dev/twed0s1a > ext0: link state changed to UP > int0: link state changed to UP > vlan0: link state changed to UP > > This server hangs up every day without any messages in the log files and on the system console. A keyboard dosen't work too. I can make only hard reset and after restart coredump files are not appearing. > Here is my kernel configuration file: > > include GENERIC > ident KERNEL01_NOSMP > device ichwd # Intel ICH watchdog timer > #options SMP > options ALTQ > options ALTQ_CBQ > options ALTQ_RED > options ALTQ_RIO > options ALTQ_HFSC > options ALTQ_PRIQ > #options ALTQ_NOPCC > options SC_DISABLE_REBOOT > options MP_WATCHDOG > options SW_WATCHDOG > > If I make and install a kernel with SMP options the system under working load begins hang up every two hours. > > The two days "Memtest" gave no result. > I tried to install the newest Intel ethernet adapter driver, but without any results. > As an experiment I tried also to plug a system HDD to another sever platform (SuperServer 6015V-TB), but system hanging didn't stop. > I think that it is not only hardware problem. > Linux (Gentoo) and Windows server 2003 on this hardware were working fine. > > Please help me to find a solution and solve a problem. > > Your faithfully > Dmitry Komaleev > IT Manager > "EDIPRESSE-KONLIGA" http://www.konliga.ru > Russia, Moscow > tel.: +7 (495) 775-14-35, ext. 169 > fax: +7 (495) 775-14-34 > > P.S. I have written the Bug Report on my problem but have received only one advice to turn off ACPI-option. > If I disable ACPI, then the RAID-controller and both of the ethernet controllers on my server recieve the same IRQ. I believe this is not good. > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" >
On Wednesday 31 October 2007, ??????? ???????? wrote:> Hello everybody > > I have a big problem > > There is one FreeBSD server in our company. The server platform is: > Supermicro SuperServer 6014V-T2B (2x Intel Xeon 2.8, 1Gb RAM, 3WARE > 3W-8006-2LP RAID-Controller). The server works as: > - a gateway between LAN and Internet > - an Intranet web- and database server (Apache + MySQL + PHP) > - a firewall (OpenBSD pf)Do you use any user or group rules as part of your pf.conf? If so, you should take a look at the pf.conf(5) man page - specificly the BUGS section.> - a transparent proxy server (Squid) > A mounthly traffic through this server is about 100Gb. There is about > 200 internet users in our conpany. Here is a part of my dmesg-listing: > > Copyright (c) 1992-2007 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, > 1994 The Regents of the University of California. All rights reserved. > FreeBSD is a registered trademark of The FreeBSD Foundation. > FreeBSD 6.2-RELEASE-p8 #2: Thu Oct 11 19:51:25 MSD 2007 > sa@gateway.konliga.ru:/usr/obj/usr/src/sys/KERNEL01_NOSMP > module_register: module pci/em already exists! > Module pci/em failed to register: 17 > ACPI APIC Table: <A M I OEMAPIC > > Timecounter "i8254" frequency 1193182 Hz quality 0 > CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2800.12-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0xf43 Stepping = 3 > > Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PG >E,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> > Features2=0x641d<SSE3,RSVD2,MON,DS_CPL,CNTX-ID,CX16,<b14>> > AMD Features=0x20000000<LM> > Logical CPUs per core: 2 > real memory = 1073479680 (1023 MB) > avail memory = 1041465344 (993 MB) > ioapic0 <Version 2.0> irqs 0-23 on motherboard > ioapic1 <Version 2.0> irqs 24-47 on motherboard > ichwd module loaded > kbd1 at kbdmux0 > ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, > RF5413) acpi0: <A M I OEMRSDT> on motherboard > acpi0: Power Button (fixed) > Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 > cpu0: <ACPI CPU> on acpi0 > acpi_throttle0: <ACPI CPU Throttling> on cpu0 > pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 > pci0: <ACPI PCI bus> on pcib0 > pcib1: <ACPI PCI-PCI bridge> irq 16 at device 2.0 on pci0 > pci1: <ACPI PCI bus> on pcib1 > pcib2: <ACPI PCI-PCI bridge> irq 16 at device 3.0 on pci0 > pci2: <ACPI PCI bus> on pcib2 > pcib3: <ACPI PCI-PCI bridge> at device 28.0 on pci0 > pci3: <ACPI PCI bus> on pcib3 > twe0: <3ware Storage Controller. Driver version 1.50.01.002> port > 0xbc00-0xbc0f mem 0xfc9ffc00-0xfc9ffc0f,0xfc000000-0xfc7fffff irq 24 at > device 1.0 on pci3 twe0: [GIANT-LOCKED] > twe0: 2 ports, Firmware FE8S 1.05.00.068, BIOS BE7X 1.08.00.048 > em0: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> port > 0xb800-0xb83f mem 0xfc9c0000-0xfc9dffff irq 26 at device 3.0 on pci3 > em0: Ethernet address: 00:30:48:58:4d:2a > em0: [FAST] > em1: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> port > 0xb400-0xb43f mem 0xfc9a0000-0xfc9bffff irq 27 at device 4.0 on pci3 > em1: Ethernet address: 00:30:48:58:4d:2b > em1: [FAST] > uhci0: <UHCI (generic) USB controller> port 0xe800-0xe81f irq 16 at > device 29.0 on pci0 uhci0: [GIANT-LOCKED] > usb0: <UHCI (generic) USB controller> on uhci0 > usb0: USB revision 1.0 > uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub0: 2 ports with 2 removable, self powered > uhci1: <UHCI (generic) USB controller> port 0xec00-0xec1f irq 19 at > device 29.1 on pci0 uhci1: [GIANT-LOCKED] > usb1: <UHCI (generic) USB controller> on uhci1 > usb1: USB revision 1.0 > uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub1: 2 ports with 2 removable, self powered > pci0: <base peripheral> at device 29.4 (no driver attached) > pci0: <base peripheral, interrupt controller> at device 29.5 (no driver > attached) ehci0: <Intel 6300ESB USB 2.0 controller> mem > 0xfebffc00-0xfebfffff irq 23 at device 29.7 on pci0 ehci0: > [GIANT-LOCKED] > usb2: EHCI version 1.0 > usb2: companion controllers, 2 ports each: usb0 usb1 > usb2: <Intel 6300ESB USB 2.0 controller> on ehci0 > usb2: USB revision 2.0 > uhub2: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 > uhub2: 4 ports with 4 removable, self powered > pcib4: <ACPI PCI-PCI bridge> at device 30.0 on pci0 > pci4: <ACPI PCI bus> on pcib4 > pci4: <display, VGA> at device 5.0 (no driver attached) > isab0: <PCI-ISA bridge> at device 31.0 on pci0 > isa0: <ISA bus> on isab0 > atapci0: <Intel 6300ESB UDMA100 controller> port > 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xfc00-0xfc0f at device 31.1 on > pci0 ata0: <ATA channel 0> on atapci0 > ata1: <ATA channel 1> on atapci0 > pci0: <serial bus, SMBus> at device 31.3 (no driver attached) > acpi_button0: <Power Button> on acpi0 > acpi_button1: <Sleep Button> on acpi0 > sio0: configured irq 4 not in bitmap of probed irqs 0 > sio0: port may not be enabled > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on > acpi0 sio0: type 16550A > sio1: configured irq 3 not in bitmap of probed irqs 0 > sio1: port may not be enabled > sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 > sio1: type 16550A > fdc0: <floppy drive controller (FDE)> port 0x3f0-0x3f5,0x3f7 irq 6 drq > 2 on acpi0 fdc0: [FAST] > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > ppc0: <ECP parallel printer port> port 0x378-0x37f,0x778-0x77f irq 7 > drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in > COMPATIBLE mode ppc0: FIFO with 16/16/9 bytes threshold > ppbus0: <Parallel port bus> on ppc0 > plip0: <PLIP network interface> on ppbus0 > lpt0: <Printer> on ppbus0 > lpt0: Interrupt-driven port > ppi0: <Parallel I/O> on ppbus0 > atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0 > atkbd0: <AT Keyboard> irq 1 on atkbdc0 > kbd0 at atkbd0 > atkbd0: [GIANT-LOCKED] > psm0: <PS/2 Mouse> irq 12 on atkbdc0 > psm0: [GIANT-LOCKED] > psm0: model IntelliMouse, device ID 3 > ichwd0: <Intel 6300ESB watchdog timer> on isa0 > pmtimer0 on isa0 > orm0: <ISA Option ROMs> at iomem > 0xc0000-0xc7fff,0xc8000-0xc8fff,0xc9800-0xca7ff,0xca800-0xcb7ff on isa0 > sc0: <System console> at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=0x300> > vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on > isa0 Timecounter "TSC" frequency 2800118202 Hz quality 800 > Timecounters tick every 1.000 msec > acd0: CDROM <CD-224E-N/1.AA> at ata0-master UDMA33 > twed0: <Unit 0, TwinStor, Normal> on twe0 > twed0: 152626MB (312579760 sectors) > Trying to mount root from ufs:/dev/twed0s1a > ext0: link state changed to UP > int0: link state changed to UP > vlan0: link state changed to UP > > This server hangs up every day without any messages in the log files > and on the system console. A keyboard dosen't work too. I can make only > hard reset and after restart coredump files are not appearing. Here is > my kernel configuration file: > > include GENERIC > ident KERNEL01_NOSMP > device ichwd # Intel ICH watchdog timer > #options SMP > options ALTQ > options ALTQ_CBQ > options ALTQ_RED > options ALTQ_RIO > options ALTQ_HFSC > options ALTQ_PRIQ > #options ALTQ_NOPCC > options SC_DISABLE_REBOOT > options MP_WATCHDOG > options SW_WATCHDOG > > If I make and install a kernel with SMP options the system under > working load begins hang up every two hours. > > The two days "Memtest" gave no result. > I tried to install the newest Intel ethernet adapter driver, but > without any results. As an experiment I tried also to plug a system HDD > to another sever platform (SuperServer 6015V-TB), but system hanging > didn't stop. I think that it is not only hardware problem. > Linux (Gentoo) and Windows server 2003 on this hardware were working > fine. > > Please help me to find a solution and solve a problem. > > Your faithfully > Dmitry Komaleev > IT Manager > "EDIPRESSE-KONLIGA" http://www.konliga.ru > Russia, Moscow > tel.: +7 (495) 775-14-35, ext. 169 > fax: +7 (495) 775-14-34 > > P.S. I have written the Bug Report on my problem but have received only > one advice to turn off ACPI-option. If I disable ACPI, then the > RAID-controller and both of the ethernet controllers on my server > recieve the same IRQ. I believe this is not good.-- /"\ Best regards, | mlaier@freebsd.org \ / Max Laier | ICQ #67774661 X http://pf4freebsd.love2party.net/ | mlaier@EFnet / \ ASCII Ribbon Campaign | Against HTML Mail and News -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 187 bytes Desc: This is a digitally signed message part. Url : http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20071101/de958345/attachment.pgp
> > A system failure of this sort (one which leaves no log entries of any > kind) is generally a hardware fault; memory stick failures tend to > cause kernel panics and easy repeatability. > > I would suggest examining the hardware components, the motherboard > could have some faulty capacitors (burst, leaking, or swollen); the > fans on the processors could be failing causing a lockup, the power > supply fans could be failing causing an undervolt and lockup, but this > usually makes the system reset. > > You get the idea, your symptoms are pointing to hardware > issues in my opinion.I have written already, that I tried to plug a system HDD to another sever with the same configuration; on the new platform the system hanging didn't stop. The RAID controller remained the same, but it has the own error log and it is clear.> > On 10/31/07, ??????? ???????? <d.komaleev@konliga.ru> wrote: > > Hello everybody > > > > I have a big problem > > > > There is one FreeBSD server in our company. The server > platform is: Supermicro SuperServer 6014V-T2B (2x Intel Xeon > 2.8, 1Gb RAM, 3WARE 3W-8006-2LP RAID-Controller). > > The server works as: > > - a gateway between LAN and Internet > > - an Intranet web- and database server (Apache + MySQL + PHP) > > - a firewall (OpenBSD pf) > > - a transparent proxy server (Squid) > > A mounthly traffic through this server is about 100Gb. > There is about 200 internet users in our conpany. > > Here is a part of my dmesg-listing: > > > > Copyright (c) 1992-2007 The FreeBSD Project. > > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, > 1992, 1993, 1994 > > The Regents of the University of California. All > rights reserved. > > FreeBSD is a registered trademark of The FreeBSD Foundation. > > FreeBSD 6.2-RELEASE-p8 #2: Thu Oct 11 19:51:25 MSD 2007 > > sa@gateway.konliga.ru:/usr/obj/usr/src/sys/KERNEL01_NOSMP > > module_register: module pci/em already exists! > > Module pci/em failed to register: 17 > > ACPI APIC Table: <A M I OEMAPIC > > > Timecounter "i8254" frequency 1193182 Hz quality 0 > > CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2800.12-MHz 686-class CPU) > > Origin = "GenuineIntel" Id = 0xf43 Stepping = 3 > > > Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,> SS,HTT,TM,PBE>> > Features2=0x641d<SSE3,RSVD2,MON,DS_CPL,CNTX-ID,CX16,<b14>> > > AMD Features=0x20000000<LM> > > Logical CPUs per core: 2 > > real memory = 1073479680 (1023 MB) > > avail memory = 1041465344 (993 MB) > > ioapic0 <Version 2.0> irqs 0-23 on motherboard > > ioapic1 <Version 2.0> irqs 24-47 on motherboard > > ichwd module loaded > > kbd1 at kbdmux0 > > ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, > RF2413, RF5413) > > acpi0: <A M I OEMRSDT> on motherboard > > acpi0: Power Button (fixed) > > Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 > > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 > > cpu0: <ACPI CPU> on acpi0 > > acpi_throttle0: <ACPI CPU Throttling> on cpu0 > > pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 > > pci0: <ACPI PCI bus> on pcib0 > > pcib1: <ACPI PCI-PCI bridge> irq 16 at device 2.0 on pci0 > > pci1: <ACPI PCI bus> on pcib1 > > pcib2: <ACPI PCI-PCI bridge> irq 16 at device 3.0 on pci0 > > pci2: <ACPI PCI bus> on pcib2 > > pcib3: <ACPI PCI-PCI bridge> at device 28.0 on pci0 > > pci3: <ACPI PCI bus> on pcib3 > > twe0: <3ware Storage Controller. Driver version > 1.50.01.002> port 0xbc00-0xbc0f mem > 0xfc9ffc00-0xfc9ffc0f,0xfc000000-0xfc7fffff irq 24 at device > 1.0 on pci3 > > twe0: [GIANT-LOCKED] > > twe0: 2 ports, Firmware FE8S 1.05.00.068, BIOS BE7X 1.08.00.048 > > em0: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> > port 0xb800-0xb83f mem 0xfc9c0000-0xfc9dffff irq 26 at device > 3.0 on pci3 > > em0: Ethernet address: 00:30:48:58:4d:2a > > em0: [FAST] > > em1: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> > port 0xb400-0xb43f mem 0xfc9a0000-0xfc9bffff irq 27 at device > 4.0 on pci3 > > em1: Ethernet address: 00:30:48:58:4d:2b > > em1: [FAST] > > uhci0: <UHCI (generic) USB controller> port 0xe800-0xe81f > irq 16 at device 29.0 on pci0 > > uhci0: [GIANT-LOCKED] > > usb0: <UHCI (generic) USB controller> on uhci0 > > usb0: USB revision 1.0 > > uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > > uhub0: 2 ports with 2 removable, self powered > > uhci1: <UHCI (generic) USB controller> port 0xec00-0xec1f > irq 19 at device 29.1 on pci0 > > uhci1: [GIANT-LOCKED] > > usb1: <UHCI (generic) USB controller> on uhci1 > > usb1: USB revision 1.0 > > uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > > uhub1: 2 ports with 2 removable, self powered > > pci0: <base peripheral> at device 29.4 (no driver attached) > > pci0: <base peripheral, interrupt controller> at device > 29.5 (no driver attached) > > ehci0: <Intel 6300ESB USB 2.0 controller> mem > 0xfebffc00-0xfebfffff irq 23 at device 29.7 on pci0 > > ehci0: [GIANT-LOCKED] > > usb2: EHCI version 1.0 > > usb2: companion controllers, 2 ports each: usb0 usb1 > > usb2: <Intel 6300ESB USB 2.0 controller> on ehci0 > > usb2: USB revision 2.0 > > uhub2: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 > > uhub2: 4 ports with 4 removable, self powered > > pcib4: <ACPI PCI-PCI bridge> at device 30.0 on pci0 > > pci4: <ACPI PCI bus> on pcib4 > > pci4: <display, VGA> at device 5.0 (no driver attached) > > isab0: <PCI-ISA bridge> at device 31.0 on pci0 > > isa0: <ISA bus> on isab0 > > atapci0: <Intel 6300ESB UDMA100 controller> port > 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xfc00-0xfc0f at device > 31.1 on pci0 > > ata0: <ATA channel 0> on atapci0 > > ata1: <ATA channel 1> on atapci0 > > pci0: <serial bus, SMBus> at device 31.3 (no driver attached) > > acpi_button0: <Power Button> on acpi0 > > acpi_button1: <Sleep Button> on acpi0 > > sio0: configured irq 4 not in bitmap of probed irqs 0 > > sio0: port may not be enabled > > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 > flags 0x10 on acpi0 > > sio0: type 16550A > > sio1: configured irq 3 not in bitmap of probed irqs 0 > > sio1: port may not be enabled > > sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 > > sio1: type 16550A > > fdc0: <floppy drive controller (FDE)> port > 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 > > fdc0: [FAST] > > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > > ppc0: <ECP parallel printer port> port > 0x378-0x37f,0x778-0x77f irq 7 drq 3 on acpi0 > > ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode > > ppc0: FIFO with 16/16/9 bytes threshold > > ppbus0: <Parallel port bus> on ppc0 > > plip0: <PLIP network interface> on ppbus0 > > lpt0: <Printer> on ppbus0 > > lpt0: Interrupt-driven port > > ppi0: <Parallel I/O> on ppbus0 > > atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0 > > atkbd0: <AT Keyboard> irq 1 on atkbdc0 > > kbd0 at atkbd0 > > atkbd0: [GIANT-LOCKED] > > psm0: <PS/2 Mouse> irq 12 on atkbdc0 > > psm0: [GIANT-LOCKED] > > psm0: model IntelliMouse, device ID 3 > > ichwd0: <Intel 6300ESB watchdog timer> on isa0 > > pmtimer0 on isa0 > > orm0: <ISA Option ROMs> at iomem > 0xc0000-0xc7fff,0xc8000-0xc8fff,0xc9800-0xca7ff,0xca800-0xcb7f > f on isa0 > > sc0: <System console> at flags 0x100 on isa0 > > sc0: VGA <16 virtual consoles, flags=0x300> > > vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem > 0xa0000-0xbffff on isa0 > > Timecounter "TSC" frequency 2800118202 Hz quality 800 > > Timecounters tick every 1.000 msec > > acd0: CDROM <CD-224E-N/1.AA> at ata0-master UDMA33 > > twed0: <Unit 0, TwinStor, Normal> on twe0 > > twed0: 152626MB (312579760 sectors) > > Trying to mount root from ufs:/dev/twed0s1a > > ext0: link state changed to UP > > int0: link state changed to UP > > vlan0: link state changed to UP > > > > This server hangs up every day without any messages in the > log files and on the system console. A keyboard dosen't work > too. I can make only hard reset and after restart coredump > files are not appearing. > > Here is my kernel configuration file: > > > > include GENERIC > > ident KERNEL01_NOSMP > > device ichwd # Intel ICH watchdog timer > > #options SMP > > options ALTQ > > options ALTQ_CBQ > > options ALTQ_RED > > options ALTQ_RIO > > options ALTQ_HFSC > > options ALTQ_PRIQ > > #options ALTQ_NOPCC > > options SC_DISABLE_REBOOT > > options MP_WATCHDOG > > options SW_WATCHDOG > > > > If I make and install a kernel with SMP options the system > under working load begins hang up every two hours. > > > > The two days "Memtest" gave no result. > > I tried to install the newest Intel ethernet adapter > driver, but without any results. > > As an experiment I tried also to plug a system HDD to > another sever platform (SuperServer 6015V-TB), but system > hanging didn't stop. > > I think that it is not only hardware problem. > > Linux (Gentoo) and Windows server 2003 on this hardware > were working fine. > > > > Please help me to find a solution and solve a problem. > > > > Your faithfully > > Dmitry Komaleev > > IT Manager > > "EDIPRESSE-KONLIGA" http://www.konliga.ru > > Russia, Moscow > > tel.: +7 (495) 775-14-35, ext. 169 > > fax: +7 (495) 775-14-34 > > > > P.S. I have written the Bug Report on my problem but have > received only one advice to turn off ACPI-option. > > If I disable ACPI, then the RAID-controller and both of the > ethernet controllers on my server recieve the same IRQ. I > believe this is not good. > > _______________________________________________ > > freebsd-stable@freebsd.org mailing list > > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > > To unsubscribe, send any mail to > "freebsd-stable-unsubscribe@freebsd.org" > > >
> > > I have written already, that I tried to plug a system HDD > to another sever with the same configuration; on the new > platform the system hanging didn't stop. The RAID controller > remained the same, but it has the own error log and it is clear. > > > > em0: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> > > :)) > > >> > options MP_WATCHDOG > >> > options SW_WATCHDOG > > Try to compile the GENERIC SMP kernel with all ALTQ options > but without > watchdog options. Maybe this will work.The watchdog options whare turned off initially and the server froze often. I enabled watchdog in order not to reset my server manually. It works. :)