See earlier post - May 10th "Node Panic" Can anyone tell me what might be happening here? I have a 3 node cluster running under RH AS 4 (2.6.9-34.ELsmp) with ocfs2 v. 1.2.1. I've upgraded to 1.2.1 as suggested in the previous post, but one or more of my nodes continues to panic weekly: Aug 16 15:29:02 linux96 kernel: (6670,2):ocfs2_extend_file:787 ERROR: bug expression: i_size_read(inode) != (le64_to_cpu(fe->i_size) - *bytes_extended) Aug 16 15:29:02 linux96 kernel: (6670,2):ocfs2_extend_file:787 ERROR: Inode 1067290 i_size = 6073937, dinode i_size = 6074345, bytes_extended = 0, new_i_size = 6074037 Aug 16 15:29:02 linux96 kernel: ------------[ cut here ]------------ Aug 16 15:29:02 linux96 kernel: kernel BUG at /rpmbuild/smushran/BUILD/ocfs2-1.2.1/fs/ocfs2/file.c:787! Aug 16 15:29:02 linux96 kernel: invalid operand: 0000 [#1] Aug 16 15:29:02 linux96 kernel: SMP Aug 16 15:29:02 linux96 kernel: Modules linked in: nfs lockd nfs_acl md5 ipv6 parport_pc lp parport autofs4 i2c_dev i2c_core ocfs2(U) debugfs(U) ocfs2_dlmfs(U) ocfs2_dlm(U) ocfs2_nodemanager(U) configfs(U) sunrpc dm_mirror dm_mod emcphr(U) emcpmpap(U) emcpmpaa(U) emcpmpc(U) emcpmp(U) emcp(U) emcplib(U) button battery ac uhci_hcd ehci_hcd hw_random e1000 bonding(U) floppy sg ext3 jbd lpfc(U) scsi_transport_fc megaraid_mbox megaraid_mm sd_mod scsi_mod Aug 16 15:29:02 linux96 kernel: CPU: 2 Aug 16 15:29:02 linux96 kernel: EIP: 0060:[<f8f5f081>] Tainted: P VLI Aug 16 15:29:02 linux96 kernel: EFLAGS: 00010292 (2.6.9-34.ELsmp) Aug 16 15:29:02 linux96 kernel: EIP is at ocfs2_extend_file+0x380/0xf25 [ocfs2] Aug 16 15:29:02 linux96 kernel: eax: 0000008b ebx: 00000000 ecx: ce929e6c edx: f8f8826f Aug 16 15:29:02 linux96 kernel: esi: e7f8de24 edi: ce929f18 ebp: f325d000 esp: ce929ea4 Aug 16 15:29:02 linux96 kernel: ds: 007b es: 007b ss: 0068 Aug 16 15:29:02 linux96 kernel: Process perl (pid: 6670, threadinfo=ce929000 task=e722ed30) Aug 16 15:29:02 linux96 kernel: Stack: f5803ac0 00000000 00000000 00000000 e7f8de24 f5f82880 ce929f58 00000000 Aug 16 15:29:02 linux96 kernel: 00000000 d53c51e8 c56a8800 ce929f68 00000000 ce929f68 00000000 ce929f68 Aug 16 15:29:02 linux96 kernel: 00000000 e7f8de24 f8f6d213 005caeb5 00000000 ce929f18 005cae51 00000000 Aug 16 15:29:02 linux96 kernel: Call Trace: Aug 16 15:29:02 linux96 kernel: [<f8f6d213>] ocfs2_write_lock_maybe_extend+0x731/0xad5 [ocfs2] Aug 16 15:29:02 linux96 kernel: [<f8f5d0d0>] ocfs2_file_write+0x11f/0x254 [ocfs2] Aug 16 15:29:02 linux96 kernel: [<c015a5e8>] vfs_write+0xb6/0xe2 Aug 16 15:29:02 linux96 kernel: [<c015a6b2>] sys_write+0x3c/0x62 Aug 16 15:29:02 linux96 kernel: [<c02d2657>] syscall_call+0x7/0xb Aug 16 15:29:02 linux96 kernel: Code: b1 e0 fd ff ff ff b1 dc fd ff ff 68 13 03 00 00 68 b5 28 f8 f8 ff 70 10 ff b2 94 00 00 00 68 6f 82 f8 f8 e8 bc 35 1c c7 83 c4 3c <0f> 0b 13 03 d3 7f f8 f8 8b 5c 24 10 8b 83 54 01 00 00 0f ae e8 Aug 16 15:29:02 linux96 kernel: <0>Fatal exception: panic in 5 seconds Aug 16 15:34:40 linux96 syslogd 1.4.1: restart. Aug 16 15:34:40 linux96 syslog: syslogd startup succeeded Aug 16 15:34:40 linux96 kernel: klogd 1.4.1, log source = /proc/kmsg started. Aug 16 15:34:40 linux96 kernel: Linux version 2.6.9-34.ELsmp (bhcompile@hs20-bc1-7.build.redhat.com) (gcc version 3.4.5 20051201 (Red Hat 3.4.5-2)) #1 SMP Fri Feb 24 16:54:53 EST 2006 Aug 16 15:34:40 linux96 kernel: BIOS-provided physical RAM map: Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000000000000 - 00000000000a0000 (usable) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000000100000 - 00000000dffc0000 (usable) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000dffc0000 - 00000000dffcfc00 (ACPI data) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000dffcfc00 - 00000000dffff000 (reserved) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000e0000000 - 00000000f0000000 (reserved) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fec00000 - 00000000fec90000 (reserved) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fed00000 - 00000000fed00400 (reserved) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fee00000 - 00000000fee10000 (reserved) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000100000000 - 00000001ffffe000 (usable) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000001ffffe000 - 0000000200000000 (reserved) Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000200000000 - 0000000220000000 (usable) Any suggestions are welcome. Please let me know if there's any debugging I can follow-up with. Thanks.
This specific problem was fixed in 1.2.2. The latest release is 1.2.3. Upgrade to 1.2.3. Jim Erb wrote:> See earlier post - May 10th "Node Panic" > > Can anyone tell me what might be happening here? I have a 3 node > cluster running under RH AS 4 (2.6.9-34.ELsmp) with ocfs2 v. > 1.2.1. I've upgraded to 1.2.1 as suggested in the previous post, > but one or more of my nodes continues to panic weekly: > > Aug 16 15:29:02 linux96 kernel: (6670,2):ocfs2_extend_file:787 ERROR: bug expression: i_size_read(inode) != (le64_to_cpu(fe->i_size) - *bytes_extended) > Aug 16 15:29:02 linux96 kernel: (6670,2):ocfs2_extend_file:787 ERROR: Inode 1067290 i_size = 6073937, dinode i_size = 6074345, bytes_extended = 0, new_i_size = 6074037 > Aug 16 15:29:02 linux96 kernel: ------------[ cut here ]------------ > Aug 16 15:29:02 linux96 kernel: kernel BUG at /rpmbuild/smushran/BUILD/ocfs2-1.2.1/fs/ocfs2/file.c:787! > Aug 16 15:29:02 linux96 kernel: invalid operand: 0000 [#1] > Aug 16 15:29:02 linux96 kernel: SMP > Aug 16 15:29:02 linux96 kernel: Modules linked in: nfs lockd nfs_acl md5 ipv6 parport_pc lp parport autofs4 i2c_dev i2c_core ocfs2(U) debugfs(U) ocfs2_dlmfs(U) ocfs2_dlm(U) ocfs2_nodemanager(U) configfs(U) sunrpc dm_mirror dm_mod emcphr(U) emcpmpap(U) emcpmpaa(U) emcpmpc(U) emcpmp(U) emcp(U) emcplib(U) button battery ac uhci_hcd ehci_hcd hw_random e1000 bonding(U) floppy sg ext3 jbd lpfc(U) scsi_transport_fc megaraid_mbox megaraid_mm sd_mod scsi_mod > Aug 16 15:29:02 linux96 kernel: CPU: 2 > Aug 16 15:29:02 linux96 kernel: EIP: 0060:[<f8f5f081>] Tainted: P VLI > Aug 16 15:29:02 linux96 kernel: EFLAGS: 00010292 (2.6.9-34.ELsmp) > Aug 16 15:29:02 linux96 kernel: EIP is at ocfs2_extend_file+0x380/0xf25 [ocfs2] > Aug 16 15:29:02 linux96 kernel: eax: 0000008b ebx: 00000000 ecx: ce929e6c edx: f8f8826f > Aug 16 15:29:02 linux96 kernel: esi: e7f8de24 edi: ce929f18 ebp: f325d000 esp: ce929ea4 > Aug 16 15:29:02 linux96 kernel: ds: 007b es: 007b ss: 0068 > Aug 16 15:29:02 linux96 kernel: Process perl (pid: 6670, threadinfo=ce929000 task=e722ed30) > Aug 16 15:29:02 linux96 kernel: Stack: f5803ac0 00000000 00000000 00000000 e7f8de24 f5f82880 ce929f58 00000000 > Aug 16 15:29:02 linux96 kernel: 00000000 d53c51e8 c56a8800 ce929f68 00000000 ce929f68 00000000 ce929f68 > Aug 16 15:29:02 linux96 kernel: 00000000 e7f8de24 f8f6d213 005caeb5 00000000 ce929f18 005cae51 00000000 > Aug 16 15:29:02 linux96 kernel: Call Trace: > Aug 16 15:29:02 linux96 kernel: [<f8f6d213>] ocfs2_write_lock_maybe_extend+0x731/0xad5 [ocfs2] > Aug 16 15:29:02 linux96 kernel: [<f8f5d0d0>] ocfs2_file_write+0x11f/0x254 [ocfs2] > Aug 16 15:29:02 linux96 kernel: [<c015a5e8>] vfs_write+0xb6/0xe2 > Aug 16 15:29:02 linux96 kernel: [<c015a6b2>] sys_write+0x3c/0x62 > Aug 16 15:29:02 linux96 kernel: [<c02d2657>] syscall_call+0x7/0xb > Aug 16 15:29:02 linux96 kernel: Code: b1 e0 fd ff ff ff b1 dc fd ff ff 68 13 03 00 00 68 b5 28 f8 f8 ff 70 10 ff b2 94 00 00 00 68 6f 82 f8 f8 e8 bc 35 1c c7 83 c4 3c <0f> 0b 13 03 d3 7f f8 f8 8b 5c 24 10 8b 83 54 01 00 00 0f ae e8 > Aug 16 15:29:02 linux96 kernel: <0>Fatal exception: panic in 5 seconds > Aug 16 15:34:40 linux96 syslogd 1.4.1: restart. > Aug 16 15:34:40 linux96 syslog: syslogd startup succeeded > Aug 16 15:34:40 linux96 kernel: klogd 1.4.1, log source = /proc/kmsg started. > Aug 16 15:34:40 linux96 kernel: Linux version 2.6.9-34.ELsmp (bhcompile@hs20-bc1-7.build.redhat.com) (gcc version 3.4.5 20051201 (Red Hat 3.4.5-2)) #1 SMP Fri Feb 24 16:54:53 EST 2006 > Aug 16 15:34:40 linux96 kernel: BIOS-provided physical RAM map: > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000000000000 - 00000000000a0000 (usable) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000000100000 - 00000000dffc0000 (usable) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000dffc0000 - 00000000dffcfc00 (ACPI data) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000dffcfc00 - 00000000dffff000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000e0000000 - 00000000f0000000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fec00000 - 00000000fec90000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fed00000 - 00000000fed00400 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fee00000 - 00000000fee10000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000100000000 - 00000001ffffe000 (usable) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000001ffffe000 - 0000000200000000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000200000000 - 0000000220000000 (usable) > > Any suggestions are welcome. Please let me know if there's any debugging I can > follow-up with. Thanks. > > > _______________________________________________ > Ocfs2-users mailing list > Ocfs2-users@oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs2-users >
Hello, This was fixed some time ago, I think in 1.2.2. Please see http://oss.oracle.com/projects/ocfs2/files/RedHat/RHEL4/ to upgrade to the latest (1.2.3). Thanks -kurt Kurt C. Hackel Oracle Jim Erb wrote:> See earlier post - May 10th "Node Panic" > > Can anyone tell me what might be happening here? I have a 3 node > cluster running under RH AS 4 (2.6.9-34.ELsmp) with ocfs2 v. > 1.2.1. I've upgraded to 1.2.1 as suggested in the previous post, > but one or more of my nodes continues to panic weekly: > > Aug 16 15:29:02 linux96 kernel: (6670,2):ocfs2_extend_file:787 ERROR: bug expression: i_size_read(inode) != (le64_to_cpu(fe->i_size) - *bytes_extended) > Aug 16 15:29:02 linux96 kernel: (6670,2):ocfs2_extend_file:787 ERROR: Inode 1067290 i_size = 6073937, dinode i_size = 6074345, bytes_extended = 0, new_i_size = 6074037 > Aug 16 15:29:02 linux96 kernel: ------------[ cut here ]------------ > Aug 16 15:29:02 linux96 kernel: kernel BUG at /rpmbuild/smushran/BUILD/ocfs2-1.2.1/fs/ocfs2/file.c:787! > Aug 16 15:29:02 linux96 kernel: invalid operand: 0000 [#1] > Aug 16 15:29:02 linux96 kernel: SMP > Aug 16 15:29:02 linux96 kernel: Modules linked in: nfs lockd nfs_acl md5 ipv6 parport_pc lp parport autofs4 i2c_dev i2c_core ocfs2(U) debugfs(U) ocfs2_dlmfs(U) ocfs2_dlm(U) ocfs2_nodemanager(U) configfs(U) sunrpc dm_mirror dm_mod emcphr(U) emcpmpap(U) emcpmpaa(U) emcpmpc(U) emcpmp(U) emcp(U) emcplib(U) button battery ac uhci_hcd ehci_hcd hw_random e1000 bonding(U) floppy sg ext3 jbd lpfc(U) scsi_transport_fc megaraid_mbox megaraid_mm sd_mod scsi_mod > Aug 16 15:29:02 linux96 kernel: CPU: 2 > Aug 16 15:29:02 linux96 kernel: EIP: 0060:[<f8f5f081>] Tainted: P VLI > Aug 16 15:29:02 linux96 kernel: EFLAGS: 00010292 (2.6.9-34.ELsmp) > Aug 16 15:29:02 linux96 kernel: EIP is at ocfs2_extend_file+0x380/0xf25 [ocfs2] > Aug 16 15:29:02 linux96 kernel: eax: 0000008b ebx: 00000000 ecx: ce929e6c edx: f8f8826f > Aug 16 15:29:02 linux96 kernel: esi: e7f8de24 edi: ce929f18 ebp: f325d000 esp: ce929ea4 > Aug 16 15:29:02 linux96 kernel: ds: 007b es: 007b ss: 0068 > Aug 16 15:29:02 linux96 kernel: Process perl (pid: 6670, threadinfo=ce929000 task=e722ed30) > Aug 16 15:29:02 linux96 kernel: Stack: f5803ac0 00000000 00000000 00000000 e7f8de24 f5f82880 ce929f58 00000000 > Aug 16 15:29:02 linux96 kernel: 00000000 d53c51e8 c56a8800 ce929f68 00000000 ce929f68 00000000 ce929f68 > Aug 16 15:29:02 linux96 kernel: 00000000 e7f8de24 f8f6d213 005caeb5 00000000 ce929f18 005cae51 00000000 > Aug 16 15:29:02 linux96 kernel: Call Trace: > Aug 16 15:29:02 linux96 kernel: [<f8f6d213>] ocfs2_write_lock_maybe_extend+0x731/0xad5 [ocfs2] > Aug 16 15:29:02 linux96 kernel: [<f8f5d0d0>] ocfs2_file_write+0x11f/0x254 [ocfs2] > Aug 16 15:29:02 linux96 kernel: [<c015a5e8>] vfs_write+0xb6/0xe2 > Aug 16 15:29:02 linux96 kernel: [<c015a6b2>] sys_write+0x3c/0x62 > Aug 16 15:29:02 linux96 kernel: [<c02d2657>] syscall_call+0x7/0xb > Aug 16 15:29:02 linux96 kernel: Code: b1 e0 fd ff ff ff b1 dc fd ff ff 68 13 03 00 00 68 b5 28 f8 f8 ff 70 10 ff b2 94 00 00 00 68 6f 82 f8 f8 e8 bc 35 1c c7 83 c4 3c <0f> 0b 13 03 d3 7f f8 f8 8b 5c 24 10 8b 83 54 01 00 00 0f ae e8 > Aug 16 15:29:02 linux96 kernel: <0>Fatal exception: panic in 5 seconds > Aug 16 15:34:40 linux96 syslogd 1.4.1: restart. > Aug 16 15:34:40 linux96 syslog: syslogd startup succeeded > Aug 16 15:34:40 linux96 kernel: klogd 1.4.1, log source = /proc/kmsg started. > Aug 16 15:34:40 linux96 kernel: Linux version 2.6.9-34.ELsmp (bhcompile@hs20-bc1-7.build.redhat.com) (gcc version 3.4.5 20051201 (Red Hat 3.4.5-2)) #1 SMP Fri Feb 24 16:54:53 EST 2006 > Aug 16 15:34:40 linux96 kernel: BIOS-provided physical RAM map: > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000000000000 - 00000000000a0000 (usable) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000000100000 - 00000000dffc0000 (usable) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000dffc0000 - 00000000dffcfc00 (ACPI data) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000dffcfc00 - 00000000dffff000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000e0000000 - 00000000f0000000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fec00000 - 00000000fec90000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fed00000 - 00000000fed00400 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000fee00000 - 00000000fee10000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000100000000 - 00000001ffffe000 (usable) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 00000001ffffe000 - 0000000200000000 (reserved) > Aug 16 15:34:40 linux96 kernel: BIOS-e820: 0000000200000000 - 0000000220000000 (usable) > > Any suggestions are welcome. Please let me know if there's any debugging I can > follow-up with. Thanks. > > > _______________________________________________ > Ocfs2-users mailing list > Ocfs2-users@oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs2-users >