This is from one of the surviving node. As to why a node died will
be known by looking at the netconsole logs of the dead node.
On 03/04/2011 02:01 PM, Garcia, Raymundo wrote:>
> Hello... I wonder if someone have had similar problem like this... a node
evicts almost in a weekly basis and I have not found the root cause yet....
>
> Mar 2 10:20:57 xirisoas3 kernel: ocfs2_dlm: Node 1 joins domain
129859624F7042EAB9829B18CA65FC88
>
> Mar 2 10:20:57 xirisoas3 kernel: ocfs2_dlm: Nodes in domain
("129859624F7042EAB9829B18CA65FC88"): 1 2 3 4
>
> Mar 3 16:18:02 xirisoas3 kernel: o2net: no longer connected to node
XIRISOAS2 (num 2) at 10.0.0.5:9999
>
> Mar 3 16:18:04 xirisoas3 kernel: (23344,2):dlm_get_lock_resource:921
129859624F7042EAB9829B18CA65FC88:$RECOVERY: at least one node (2) torecover
before lock mastery can begin
>
> Mar 3 16:18:04 xirisoas3 kernel: (23344,2):dlm_get_lock_resource:955
129859624F7042EAB9829B18CA65FC88: recovery map is not empty, but must master
$RECOVERY lock now
>
> Mar 3 16:18:04 xirisoas3 kernel: (23344,2):dlm_do_recovery:519 (23344) Node
3 is the Recovery Master for the Dead Node 2 for Domain
129859624F7042EAB9829B18CA65FC88
>
> Mar 3 16:20:48 xirisoas3 kernel: (22790,2):o2net_connect_expired:1585
ERROR: no connection established with node 2 after 10.0 seconds, giving up and
returning errors.
>
> Mar 3 16:20:59 xirisoas3 kernel: o2net: connected to node XIRISOAS2 (num 2)
at 10.0.0.5:9999
>
> Mar 3 16:20:59 xirisoas3 kernel: ocfs2_dlm: Node 2 joins domain
129859624F7042EAB9829B18CA65FC88
>
> Mar 3 16:20:59 xirisoas3 kernel: ocfs2_dlm: Nodes in domain
("129859624F7042EAB9829B18CA65FC88"): 1 2 3 4
>
> Maybe someone has some light in this problem... I appreciate any help.
>
> Thanks
>
> Raymundo Garcia
>
>
>
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
> The information contained in this message may be confidential and legally
protected under applicable law. The message is intended solely for the
addressee(s). If you are not the intended recipient, you are hereby notified
that any use, forwarding, dissemination, or reproduction of this message is
strictly prohibited and may be unlawful. If you are not the intended recipient,
please contact the sender by return e-mail and destroy all copies of the
original message.
>
>
> _______________________________________________
> Ocfs2-users mailing list
> Ocfs2-users at oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://oss.oracle.com/pipermail/ocfs2-users/attachments/20110304/503dee98/attachment.html