Hi all, I'm rather new to freebsd so forgive me if I am saying things that don't make sense. I have a dual opteron server running freebsd 6.1/amd64 updated 4 weeks ago. Since I moved to 6 I've been having tho following problem. When I put heavy load on the server (python scripts that take 20 minutes to complete) the server sometimes hangs. I can then not ping it anymore and it does not respond to anything (no keyboard power button etc.). Mostly there are no error messages or anything so I have no clue to what the problem is. One time it did give an error but I did not wrote it down, but as I recall it was something like SMP spin lock timed out. It hangs every three weeks Please give me some info on how I can get it to give more info or how to resolve this problem. thanks in advance Bram below are the first lines of dmesg FreeBSD 6.1-RELEASE-p2 #3: Tue Aug 1 16:30:28 CEST 2006 bram@SIMBA.jeansgallery.be:/usr/obj/usr/src/sys/SIMBA ACPI APIC Table: <A M I OEMAPIC > Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Opteron(tm) Processor 246 (2004.56-MHz K8-class CPU) Origin = "AuthenticAMD" Id = 0xf5a Stepping = 10 Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2> AMD Features=0xe0500800<SYSCALL,NX,MMX+,LM,3DNow+,3DNow> real memory = 2147418112 (2047 MB) avail memory = 2062467072 (1966 MB) FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
In response to bram <bram@diomedia.be>:> Hi all, > > I'm rather new to freebsd so forgive me if I am saying things that don't > make sense. > > I have a dual opteron server running freebsd 6.1/amd64 updated 4 weeks ago. > > Since I moved to 6 I've been having tho following problem. > > When I put heavy load on the server (python scripts that take 20 minutes > to complete) the server sometimes hangs. > > I can then not ping it anymore and it does not respond to anything (no > keyboard power button etc.). > Mostly there are no error messages or anything so I have no clue to what > the problem is. > One time it did give an error but I did not wrote it down, but as I > recall it was something like SMP spin lock timed out. > > It hangs every three weeks > > Please give me some info on how I can get it to give more info or how to > resolve this problem.I'm no expert, but this is probably where you'll need to start: http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kerneldebug-online-ddb.html -- Bill Moran Collaborative Fusion Inc.
On Fri, Oct 27, 2006 at 09:35:09AM +0200, bram wrote:> Hi all, > > I'm rather new to freebsd so forgive me if I am saying things that don't > make sense. > > I have a dual opteron server running freebsd 6.1/amd64 updated 4 weeks ago. > > Since I moved to 6 I've been having tho following problem. > > When I put heavy load on the server (python scripts that take 20 minutes > to complete) the server sometimes hangs. > > I can then not ping it anymore and it does not respond to anything (no > keyboard power button etc.). > Mostly there are no error messages or anything so I have no clue to what > the problem is. > One time it did give an error but I did not wrote it down, but as I > recall it was something like SMP spin lock timed out. > > It hangs every three weeks > > Please give me some info on how I can get it to give more info or how to > resolve this problem.See the chapter on kernel debugging in the develpers handbook Kris -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 187 bytes Desc: not available Url : http://lists.freebsd.org/pipermail/freebsd-stable/attachments/20061027/e7c47d08/attachment.pgp
Hi all, I turned kernel debuging on and tried the key combinations when the machine was stuck but it did not really help because the machine was not responding to this. I do think that I might have found the cause of the problem. I was running powerd and the cpufreq module was loaded each I had problems. I now turned powerd of and did not load the cpufreq module and the machine is now running for 23 days in a row. Does anybody know wether this is possible/makes sense ? kind regards Kris Kennaway schreef:> On Fri, Oct 27, 2006 at 09:35:09AM +0200, bram wrote: > >> Hi all, >> >> I'm rather new to freebsd so forgive me if I am saying things that don't >> make sense. >> >> I have a dual opteron server running freebsd 6.1/amd64 updated 4 weeks ago. >> >> Since I moved to 6 I've been having tho following problem. >> >> When I put heavy load on the server (python scripts that take 20 minutes >> to complete) the server sometimes hangs. >> >> I can then not ping it anymore and it does not respond to anything (no >> keyboard power button etc.). >> Mostly there are no error messages or anything so I have no clue to what >> the problem is. >> One time it did give an error but I did not wrote it down, but as I >> recall it was something like SMP spin lock timed out. >> >> It hangs every three weeks >> >> Please give me some info on how I can get it to give more info or how to >> resolve this problem. >> > > See the chapter on kernel debugging in the develpers handbook > > Kris >