Geoff Roberts
2007-Jan-06 00:46 UTC
System freeze on 6.1/2 when running makeworld and dump
Hi,
I can consistantly make my system freeze when building makeworld and
running dump at the same time. The system actually locks - I have to
hit the reset switch to bring the system back to life.
I also get a core dump on current.
I have mounted the file systems as follows:
/
/tmp
/usr
/usr/obj
/var
The freeze always happens when using dump on the /usr mount point. Most
of the writes are going to /usr/obj.
I have tried both SCSI hardware and IDE. I have completely swapped RAM
(a few times) so I am only as confident as you can be it is not a
hardware issue :) I've also tried various CPU burn and RAM testers to
make sure there is no issue there.
I can also duplicate the issue dumping to /dev/null as opposed to an
actual tape drive through the SCSI card.
If I run dump without doing a makeworld in the background the system
backs up fine. If I run buildworld by itself without dump all is fine
as well.
Below is some information about my system configuration:
GENERIC 6.1 or 6.2 kernel (or current without SMP)
(note in the dmesg output it does not say GENERIC, but the CUSTOM config
is an exact copy of GENERIC).
I can freeze the system using either /dev/null or /dev/nsa0 below:
dump -C 32 -0 -a -L -u -f /dev/null /usr
The freeze does not happen till dump starts to write the files.
Has anyone else experienced this?
Kind regards,
Geoff
-------------- next part --------------
Copyright (c) 1992-2006 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 6.1-RELEASE #0: Sun May 7 04:32:43 UTC 2006
root@opus.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: AMD Athlon(tm) Processor (908.09-MHz 686-class CPU)
Origin = "AuthenticAMD" Id = 0x642 Stepping = 2
Features=0x183f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR>
AMD Features=0xc0440800<SYSCALL,<b18>,MMX+,3DNow+,3DNow>
real memory = 1073659904 (1023 MB)
avail memory = 1041719296 (993 MB)
kbd1 at kbdmux0
acpi0: <ASUS A7V> on motherboard
acpi0: Power Button (fixed)
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0
cpu0: <ACPI CPU> on acpi0
acpi_throttle0: <ACPI CPU Throttling> on cpu0
acpi_button0: <Power Button> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
agp0: <VIA 82C8363 (Apollo KT133x/KM133) host to PCI bridge> mem
0xe7800000-0xe7bfffff at device 0.0 on pci0
pcib1: <PCI-PCI bridge> at device 1.0 on pci0
pci1: <PCI bus> on pcib1
pci1: <display, VGA> at device 0.0 (no driver attached)
isab0: <PCI-ISA bridge> at device 4.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <VIA 82C686A UDMA66 controller> port
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xd800-0xd80f at device 4.1 on pci0
ata0: <ATA channel 0> on atapci0
ata1: <ATA channel 1> on atapci0
uhci0: <VIA 83C572 USB controller> port 0xd400-0xd41f irq 7 at device 4.2
on pci0
uhci0: [GIANT-LOCKED]
usb0: <VIA 83C572 USB controller> on uhci0
usb0: USB revision 1.0
uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1: <VIA 83C572 USB controller> port 0xd000-0xd01f irq 7 at device 4.3
on pci0
uhci1: [GIANT-LOCKED]
usb1: <VIA 83C572 USB controller> on uhci1
usb1: USB revision 1.0
uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
pci0: <bridge> at device 4.4 (no driver attached)
pcib2: <PCI-PCI bridge> at device 10.0 on pci0
pci2: <PCI bus> on pcib2
amr0: <LSILogic MegaRAID 1.53> mem 0xe3000000-0xe300ffff irq 3 at device
10.1 on pci0
amr0: <Series 438> Firmware GH8E, BIOS 1.46, 16MB RAM
fxp0: <Intel 82550 Pro/100 Ethernet> port 0x9800-0x983f mem
0xe0800000-0xe0800fff,0xe0000000-0xe001ffff irq 11 at device 12.0 on pci0
miibus0: <MII bus> on fxp0
inphy0: <i82555 10/100 media interface> on miibus0
inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:02:b3:b7:14:52
atapci1: <Promise PDC20265 UDMA100 controller> port
0x9400-0x9407,0x9000-0x9003,0x8800-0x8807,0x8400-0x8403,0x8000-0x803f mem
0xdf800000-0xdf81ffff irq 10 at device 17.0 on pci0
ata2: <ATA channel 0> on atapci1
ata3: <ATA channel 1> on atapci1
fdc0: <floppy drive controller> port 0x3f2-0x3f5,0x3f7 irq 6 drq 2 on
acpi0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on
acpi0
sio0: type 16550A
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: [GIANT-LOCKED]
psm0: model IntelliMouse, device ID 3
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem 0xc0000-0xcafff,0xd0000-0xd17ff on isa0
ppc0: parallel port not found.
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
uhub2: ALCOR Generic USB Hub, class 9/0, rev 1.10/1.00, addr 2
uhub2: 4 ports with 4 removable, self powered
Timecounter "TSC" frequency 908090887 Hz quality 800
Timecounters tick every 1.000 msec
acd0: CDRW <AOPEN CRW1232/1.31> at ata1-master PIO4
amrd0: <LSILogic MegaRAID logical drive> on amr0
amrd0: 21552MB (44138496 sectors) RAID 0 (optimal)
Trying to mount root from ufs:/dev/amrd0s1a
fxp0: link state changed to UP
-------------- next part --------------
/dev/ad0s1a on / (ufs, local, soft-updates)
devfs on /dev (devfs, local)
/dev/ad0s1e on /tmp (ufs, local, soft-updates)
/dev/ad0s2e on /usr (ufs, local, soft-updates)
/dev/ad0s2d on /usr/obj (ufs, local, soft-updates)
/dev/ad0s1d on /var (ufs, local, soft-updates)
Adrian Wontroba
2007-Jan-06 05:44 UTC
System freeze on 6.1/2 when running makeworld and dump
On Sat, Jan 06, 2007 at 07:14:41PM +1100, Geoff Roberts wrote:> I can consistantly make my system freeze when building makeworld and > running dump at the same time. The system actually locks - I have to > hit the reset switch to bring the system back to life.I have an old SMP machine at work which sometimes does something similar during its daily housekeeping, where Apache and Nagios are bounced and a small MySQL database dumped. It sometimes appears to hang during the database dump. The debugger shows many processes waiting for UFS. I suspect that the problem starts several minutes earlier. All of the following help to keep the problem away: Upgrading from a several months old 5-STABLE to 6-STABLE. Inserting 60 second delays at various points. Disabling SMP. No core dump available (Mylex disk controller). My next diagnostic step will be a serial console. -- Adrian Wontroba The biggest mistake you can make is to believe that you are working for someone else.
Kris Kennaway
2007-Jan-11 07:33 UTC
System freeze on 6.1/2 when running makeworld and dump
On Sat, Jan 06, 2007 at 07:14:41PM +1100, Geoff Roberts wrote:> Hi, > > I can consistantly make my system freeze when building makeworld and > running dump at the same time. The system actually locks - I have to > hit the reset switch to bring the system back to life. > > I also get a core dump on current.Don't keep us in suspense, when you have additional relevant details please provide them in your mail! As for the freeze, it might be snapshot-related but we have no way to tell until you provide additional debugging details, as outlined in the developers handbook chapter on kernel debugging. Thanks, Kris -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 187 bytes Desc: not available Url : http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20070111/605b88fc/attachment.pgp