I'm running Red Hat 7.3 with kernel 2.4.18-10 and all errata patches
installed. My system has been running for over three years without any
problems. All I've done to it in that time is add a mirror set of two
80GB drives, a Promise IDE controller, and upgrade Red Hat through the 7.x
series. Right now I have three drives. A 13GB system drive off the
motherboard, and two 80GB drives software mirrored on the Promise
controller. Everything is running ext3.
Recently, one of the drives in the mirror set died. I decided to replace
both the drives so that I could have a mirror set with two identical
drives. That was about two weeks ago. This past Saturday (Sep 28), I
woke up to find the error below on the console. It wasn't written to
/var/log/messages. The machine was locked cold so I had to reboot it.
It ran fine until this morning (Oct 1) when I woke up and turned the
monitor on to see the same message. This time I wrote down everything on
the screen and typed it back in for this email.
Does anyone know how I might go about debugging this? As I said, the only
thing that has changed recently (the last four months) is the removal of
the old mirrored drives and then the addition of these two new drives.
Thanks in advance for any help.
Here is the error message in full:
Assertion failure in __journal_remove_journal_head() at journal.c:1772:
------------[ cut here ]------------
kernel BUG at journal.c:1772!
invalid operand: 0000
nfsd lockd sunrpc autofs tulip ide-cd cdrom usb-uhci usbcore ext3 jbd raid1
EIP: 0010:[<e0818b59>] Not tainted
EIP is at __journal_remove_journal_head [jbd] 0xa9 (2.4.18-10)
eax: 0000001e ebx: d47ebf60 ecx: 00000001 edx: 000028a5
esi: d85d9ee0 edi: d85d9f10 ebp: c2483850 esp: df623e64
ds: 0018 es: 0018 ss: 0018
Process kjournald (pid: 16, stackpage=df623000)
Stack: e081b42f 000006ec d47ebf60 d85d9ee0 e0814533 d47ebf60 d85d9ee0 df622000
00000000 00000000 00000000 00000019 dfc5dba0 d7cdefd0 0005b9ce d29b281e
db6bf760 de956000 c01db16b d4955f80 d4955f20 d4955ec0 d4955e60 d4955e00
Call Trace: [<e081b42f>] .rodata.str1.1 [jdb] 0x4ef
[<e0814533>] journal_commit_transaction [jbd] 0x293
[<c01db16b>] ip_rcv [kernel] 0x31b
[<e0817106>] kjournald [jbd] 0x116
[<e0816fd0>] commit_timeout [jbd] 0x0
[<c0107136>] kernel_thread [kernel] 0x26
[<e0816ff0>] kjournald [jbd] 0x0
Code: 0f 0b 58 5a 39 1e 74 34 68 39 b6 81 e0 68 ed 06 00 00 68 2f