Geoff Roberts
2007-Jan-06 00:46 UTC
System freeze on 6.1/2 when running makeworld and dump
Hi, I can consistantly make my system freeze when building makeworld and running dump at the same time. The system actually locks - I have to hit the reset switch to bring the system back to life. I also get a core dump on current. I have mounted the file systems as follows: / /tmp /usr /usr/obj /var The freeze always happens when using dump on the /usr mount point. Most of the writes are going to /usr/obj. I have tried both SCSI hardware and IDE. I have completely swapped RAM (a few times) so I am only as confident as you can be it is not a hardware issue :) I've also tried various CPU burn and RAM testers to make sure there is no issue there. I can also duplicate the issue dumping to /dev/null as opposed to an actual tape drive through the SCSI card. If I run dump without doing a makeworld in the background the system backs up fine. If I run buildworld by itself without dump all is fine as well. Below is some information about my system configuration: GENERIC 6.1 or 6.2 kernel (or current without SMP) (note in the dmesg output it does not say GENERIC, but the CUSTOM config is an exact copy of GENERIC). I can freeze the system using either /dev/null or /dev/nsa0 below: dump -C 32 -0 -a -L -u -f /dev/null /usr The freeze does not happen till dump starts to write the files. Has anyone else experienced this? Kind regards, Geoff -------------- next part -------------- Copyright (c) 1992-2006 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.1-RELEASE #0: Sun May 7 04:32:43 UTC 2006 root@opus.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) Processor (908.09-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x642 Stepping = 2 Features=0x183f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR> AMD Features=0xc0440800<SYSCALL,<b18>,MMX+,3DNow+,3DNow> real memory = 1073659904 (1023 MB) avail memory = 1041719296 (993 MB) kbd1 at kbdmux0 acpi0: <ASUS A7V> on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0 cpu0: <ACPI CPU> on acpi0 acpi_throttle0: <ACPI CPU Throttling> on cpu0 acpi_button0: <Power Button> on acpi0 pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 pci0: <ACPI PCI bus> on pcib0 agp0: <VIA 82C8363 (Apollo KT133x/KM133) host to PCI bridge> mem 0xe7800000-0xe7bfffff at device 0.0 on pci0 pcib1: <PCI-PCI bridge> at device 1.0 on pci0 pci1: <PCI bus> on pcib1 pci1: <display, VGA> at device 0.0 (no driver attached) isab0: <PCI-ISA bridge> at device 4.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <VIA 82C686A UDMA66 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xd800-0xd80f at device 4.1 on pci0 ata0: <ATA channel 0> on atapci0 ata1: <ATA channel 1> on atapci0 uhci0: <VIA 83C572 USB controller> port 0xd400-0xd41f irq 7 at device 4.2 on pci0 uhci0: [GIANT-LOCKED] usb0: <VIA 83C572 USB controller> on uhci0 usb0: USB revision 1.0 uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: <VIA 83C572 USB controller> port 0xd000-0xd01f irq 7 at device 4.3 on pci0 uhci1: [GIANT-LOCKED] usb1: <VIA 83C572 USB controller> on uhci1 usb1: USB revision 1.0 uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered pci0: <bridge> at device 4.4 (no driver attached) pcib2: <PCI-PCI bridge> at device 10.0 on pci0 pci2: <PCI bus> on pcib2 amr0: <LSILogic MegaRAID 1.53> mem 0xe3000000-0xe300ffff irq 3 at device 10.1 on pci0 amr0: <Series 438> Firmware GH8E, BIOS 1.46, 16MB RAM fxp0: <Intel 82550 Pro/100 Ethernet> port 0x9800-0x983f mem 0xe0800000-0xe0800fff,0xe0000000-0xe001ffff irq 11 at device 12.0 on pci0 miibus0: <MII bus> on fxp0 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:02:b3:b7:14:52 atapci1: <Promise PDC20265 UDMA100 controller> port 0x9400-0x9407,0x9000-0x9003,0x8800-0x8807,0x8400-0x8403,0x8000-0x803f mem 0xdf800000-0xdf81ffff irq 10 at device 17.0 on pci0 ata2: <ATA channel 0> on atapci1 ata3: <ATA channel 1> on atapci1 fdc0: <floppy drive controller> port 0x3f2-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0 atkbd0: <AT Keyboard> irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] psm0: <PS/2 Mouse> irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: model IntelliMouse, device ID 3 pmtimer0 on isa0 orm0: <ISA Option ROMs> at iomem 0xc0000-0xcafff,0xd0000-0xd17ff on isa0 ppc0: parallel port not found. sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 uhub2: ALCOR Generic USB Hub, class 9/0, rev 1.10/1.00, addr 2 uhub2: 4 ports with 4 removable, self powered Timecounter "TSC" frequency 908090887 Hz quality 800 Timecounters tick every 1.000 msec acd0: CDRW <AOPEN CRW1232/1.31> at ata1-master PIO4 amrd0: <LSILogic MegaRAID logical drive> on amr0 amrd0: 21552MB (44138496 sectors) RAID 0 (optimal) Trying to mount root from ufs:/dev/amrd0s1a fxp0: link state changed to UP -------------- next part -------------- /dev/ad0s1a on / (ufs, local, soft-updates) devfs on /dev (devfs, local) /dev/ad0s1e on /tmp (ufs, local, soft-updates) /dev/ad0s2e on /usr (ufs, local, soft-updates) /dev/ad0s2d on /usr/obj (ufs, local, soft-updates) /dev/ad0s1d on /var (ufs, local, soft-updates)
Adrian Wontroba
2007-Jan-06 05:44 UTC
System freeze on 6.1/2 when running makeworld and dump
On Sat, Jan 06, 2007 at 07:14:41PM +1100, Geoff Roberts wrote:> I can consistantly make my system freeze when building makeworld and > running dump at the same time. The system actually locks - I have to > hit the reset switch to bring the system back to life.I have an old SMP machine at work which sometimes does something similar during its daily housekeeping, where Apache and Nagios are bounced and a small MySQL database dumped. It sometimes appears to hang during the database dump. The debugger shows many processes waiting for UFS. I suspect that the problem starts several minutes earlier. All of the following help to keep the problem away: Upgrading from a several months old 5-STABLE to 6-STABLE. Inserting 60 second delays at various points. Disabling SMP. No core dump available (Mylex disk controller). My next diagnostic step will be a serial console. -- Adrian Wontroba The biggest mistake you can make is to believe that you are working for someone else.
Kris Kennaway
2007-Jan-11 07:33 UTC
System freeze on 6.1/2 when running makeworld and dump
On Sat, Jan 06, 2007 at 07:14:41PM +1100, Geoff Roberts wrote:> Hi, > > I can consistantly make my system freeze when building makeworld and > running dump at the same time. The system actually locks - I have to > hit the reset switch to bring the system back to life. > > I also get a core dump on current.Don't keep us in suspense, when you have additional relevant details please provide them in your mail! As for the freeze, it might be snapshot-related but we have no way to tell until you provide additional debugging details, as outlined in the developers handbook chapter on kernel debugging. Thanks, Kris -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 187 bytes Desc: not available Url : lists.freebsd.org/pipermail/freebsd-stable/attachments/20070111/605b88fc/attachment.pgp