Please help me!
I know that I am getting few responses to my emails - I am guessing that
my situation is difficult. If you could offer any ideas how to help
with further diagnostics.
I am regularly getting panics with instruction pointer equal to
0xc0611c69. I am not able to get any dumps - the dumpon directive is
simply ignored.
(I did get one dump (for some reason), but that was with a kernel that
was not made with config -g, and new kernels made afterwards seem
significantly different, despite having exactly the same size.)
The code at this instruction pointer is
(kgdb) list *0xc0611c69
0xc0611c69 is in fill_kinfo_thread (../../../kern/kern_proc.c:748).
743 }
744
745 kg = td->td_ksegrp;
746
747 /* things in the KSE GROUP */
748 kp->ki_estcpu = kg->kg_estcpu;
749 kp->ki_slptime = kg->kg_slptime;
750 kp->ki_pri.pri_user = kg->kg_user_pri;
751 kp->ki_pri.pri_class = kg->kg_pri_class;
752
so I'm guessing that kp is not correct.
Because of the consistency of the instruction pointer value from panic
to panic, I really am thinking that this is not a hardware issue.
I will try any reasonable test you guys have for me. Right now I am
switching off HTT to see if that is the issue. This is a dual Xeon system.
I am willing to provide a copy of the program that I'm guessing is
causing the problem. It is a multithreaded program that is very CPU
instensive, although most of the inners of the code are from the fftw3 port.
One interesting thing about this program is that when I run it, top says
that about 45% CPU is being used (which with 4 logical CPU's means that
almost 2 CPU's are being used), but that actual program is registered at
running with about 80% CPU time (which I am guessing means 0.8 of one
CPU is being used). It seems to me that there is some disparity in the
accounting.
Maybe it is a problem with the math/fftw3 code. But is still shouldn't
causes crashes.
Please help me. I am sure that this is a difficult problem, but I just
don't know how to provide you any further decent diagnostic information.
Thanks, Stephen