Hi all, I am using OCFS2-1.4.7 for 2 servers which is running Red hat enterprise 5.7 kernel 2.6.18-274.el5. OCFS2 I use for drdb for replicating master-master. My 2 servers was installed HA-Proxy. Yesterday, server web1 was down with the log kernel panic. And today, web2 was down too. After that, I trace the log file on these server and found that the reason from ocfs2. The log like this: Jul 3 10:58:37 web1 kernel: d-con r0: PingAck did not arrive in time. Jul 3 10:58:37 web1 kernel: d-con r0: peer( Primary -> Unknown ) conn( Connected -> NetworkFailure ) pdsk( UpToDate -> DUnknown ) Jul 3 10:58:37 web1 kernel: d-con r0: asender terminated Jul 3 10:58:37 web1 kernel: d-con r0: Terminating asender thread Jul 3 10:58:37 web1 kernel: d-con r0: error receiving Data, e: -5 l: 4096! Jul 3 10:58:37 web1 kernel: block drbd0: new current UUID A69EE0FA8CB9B85D:C9BABEF0844508EB:2F0151CEDDA9713A:2F0051CEDDA9713B Jul 3 10:58:37 web1 kernel: d-con r0: Connection closed Jul 3 10:58:37 web1 kernel: d-con r0: conn( NetworkFailure -> Unconnected ) Jul 3 10:58:37 web1 kernel: d-con r0: receiver terminated Jul 3 10:58:37 web1 kernel: d-con r0: Restarting receiver thread Jul 3 10:58:37 web1 kernel: d-con r0: receiver (re)started Jul 3 10:58:37 web1 kernel: d-con r0: conn( Unconnected -> WFConnection ) Jul 3 10:58:53 web1 kernel: d-con r0: Handshake successful: Agreed network protocol version 100 Jul 3 10:58:53 web1 kernel: d-con r0: Peer authenticated using 20 bytes HMAC Jul 3 10:58:53 web1 kernel: d-con r0: conn( WFConnection -> WFReportParams ) Jul 3 10:58:53 web1 kernel: d-con r0: Starting asender thread (from drbd_r_r0 [1164]) Jul 3 10:58:53 web1 kernel: block drbd0: drbd_sync_handshake: Jul 3 10:58:53 web1 kernel: block drbd0: self A69EE0FA8CB9B85D:C9BABEF0844508EB:2F0151CEDDA9713A:2F0051CEDDA9713B bits:466 flags:0 Jul 3 10:58:53 web1 kernel: block drbd0: peer 3ED53D15A1945AAF:C9BABEF0844508EB:2F0151CEDDA9713B:2F0051CEDDA9713B bits:49 flags:0 Jul 3 10:58:53 web1 kernel: block drbd0: uuid_compare()=100 by rule 90 Jul 3 10:58:53 web1 kernel: block drbd0: helper command: /sbin/drbdadm initial-split-brain minor-0 Jul 3 10:58:53 web1 kernel: block drbd0: helper command: /sbin/drbdadm initial-split-brain minor-0 exit code 0 (0x0) Jul 3 10:58:53 web1 kernel: block drbd0: Split-Brain detected but unresolved, dropping connection! Jul 3 10:58:53 web1 kernel: block drbd0: helper command: /sbin/drbdadm split-brain minor-0 Jul 3 10:58:53 web1 kernel: block drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0) Jul 3 10:58:53 web1 kernel: d-con r0: conn( WFReportParams -> Disconnecting ) Jul 3 10:58:53 web1 kernel: d-con r0: error receiving ReportState, e: -5 l: 0! Jul 3 10:58:53 web1 kernel: d-con r0: asender terminated Jul 3 10:58:53 web1 kernel: d-con r0: Terminating asender thread Jul 3 10:58:53 web1 kernel: d-con r0: Connection closed Jul 3 10:58:53 web1 kernel: d-con r0: conn( Disconnecting -> StandAlone ) Jul 3 10:58:53 web1 kernel: d-con r0: receiver terminated Jul 3 10:58:53 web1 kernel: d-con r0: Terminating receiver thread Jul 3 10:58:54 web1 kernel: (httpd,11395,3):ocfs2_truncate_file:425 ERROR: bug expression: le64_to_cpu(fe->i_size) != i_size_read(inode) Jul 3 10:58:54 web1 kernel: (httpd,11395,3):ocfs2_truncate_file:425 ERROR: Inode 389752, inode i_size = 28059 != di i_size = 17004, i_flags = 0x1 Jul 3 10:58:54 web1 kernel: ----------- [cut here ] --------- [please bite here ] --------- Jul 3 10:58:54 web1 kernel: Kernel BUG at ...rpmbuild/xiaowei/BUILD/ocfs2-1.4.7/fs/ocfs2/file.c:425 Is there anyone meet the same situation? Please help me Thanks and Regards, Namldp -------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20120704/b965b6a3/attachment-0001.html