I am receiving the following error when I attempt to add a node to the cluster: o2net: connected to node bapp02 (num 5) at 10.10.16.12:7777 o2net: connected to node bapp04 (num 7) at 10.10.16.14:7777 o2net: connected to node bapp05 (num 8) at 10.10.16.15:7777 o2net: connected to node bdb02 (num 10) at 10.10.20.12:7777 o2net: connected to node bdb01 (num 9) at 10.10.20.11:7777 o2net: connected to node bapp03 (num 6) at 10.10.16.13:7777 o2net: connected to node bapp06x (num 13) at 10.10.16.16:7777 o2net: connected to node bsip01x (num 15) at 10.10.16.17:7777 o2net: connected to node bnosey (num 11) at 10.10.8.20:7777 o2net: connected to node bws01 (num 1) at 10.10.12.11:7777 o2net: connected to node bws02 (num 2) at 10.10.12.12:7777 o2net: connected to node bws03 (num 3) at 10.10.12.13:7777 OCFS2 1.4.1 Wed Jul 23 12:06:08 PDT 2008 (build 3fc82af4b5669945497b322b6aabd031) (1279,0):o2net_connect_expired:1637 ERROR: no connection established with node 0 after 30.0 seconds, giving up and returning errors. (3303,0):dlm_request_join:1033 ERROR: status = -107 (3303,0):dlm_try_to_join_domain:1207 ERROR: status = -107 (3303,0):dlm_join_domain:1485 ERROR: status = -107 (3303,0):dlm_register_domain:1732 ERROR: status = -107 (3303,0):ocfs2_dlm_init:2662 ERROR: status = -107 (3303,0):ocfs2_mount_volume:1251 ERROR: status = -107 ocfs2: Unmounting device (202,33) on (node 16) o2net: no longer connected to node bws01 (num 1) at 10.10.12.11:7777 o2net: no longer connected to node bws02 (num 2) at 10.10.12.12:7777 o2net: no longer connected to node bws03 (num 3) at 10.10.12.13:7777 o2net: no longer connected to node bapp02 (num 5) at 10.10.16.12:7777 o2net: no longer connected to node bapp03 (num 6) at 10.10.16.13:7777 o2net: no longer connected to node bapp04 (num 7) at 10.10.16.14:7777 o2net: no longer connected to node bapp05 (num 8) at 10.10.16.15:7777 o2net: no longer connected to node bdb01 (num 9) at 10.10.20.11:7777 o2net: no longer connected to node bdb02 (num 10) at 10.10.20.12:7777 o2net: no longer connected to node bnosey (num 11) at 10.10.8.20:7777 o2net: no longer connected to node bapp06x (num 13) at 10.10.16.16:7777 o2net: no longer connected to node bsip01x (num 15) at 10.10.16.17:7777 Other nodes in the cluster can mount the shared file system without any issues. I have checked over /etc/ocfs2/cluster.conf and the file is correct, no typos's or anything else. When I run o2cb_ctl to get information about the node it shows the following: bash-3.2# o2cb_ctl -I -n bftp01x node: name = bftp01x cluster = cfi_cl01 number = 16 ip_address = 10.10.12.14 ip_port = 7777 status = configured I am a little stumped as to what can be going on. Is there a way I can delete the node and start from scratch on a running cluster? I attempted to use o2cb_ctl to do this however I get the following error: bash-3.2# o2cb_ctl -D -n bftp01x -u o2cb_ctl: Not yet supported bash-3.2# Any help on this would be great. Thanks Andrew -------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20090206/ed021e83/attachment.html
Node 0 does not know about this node. Run the o2cb_ctl command on node 0 to add a node to a live cluster as listed in 1.4 user's guide. Andrew Deagman wrote:> I am receiving the following error when I attempt to add a node to the > cluster: > > o2net: connected to node bapp02 (num 5) at 10.10.16.12:7777 > <http://10.10.16.12:7777> > o2net: connected to node bapp04 (num 7) at 10.10.16.14:7777 > <http://10.10.16.14:7777> > o2net: connected to node bapp05 (num 8) at 10.10.16.15:7777 > <http://10.10.16.15:7777> > o2net: connected to node bdb02 (num 10) at 10.10.20.12:7777 > <http://10.10.20.12:7777> > o2net: connected to node bdb01 (num 9) at 10.10.20.11:7777 > <http://10.10.20.11:7777> > o2net: connected to node bapp03 (num 6) at 10.10.16.13:7777 > <http://10.10.16.13:7777> > o2net: connected to node bapp06x (num 13) at 10.10.16.16:7777 > <http://10.10.16.16:7777> > o2net: connected to node bsip01x (num 15) at 10.10.16.17:7777 > <http://10.10.16.17:7777> > o2net: connected to node bnosey (num 11) at 10.10.8.20:7777 > <http://10.10.8.20:7777> > o2net: connected to node bws01 (num 1) at 10.10.12.11:7777 > <http://10.10.12.11:7777> > o2net: connected to node bws02 (num 2) at 10.10.12.12:7777 > <http://10.10.12.12:7777> > o2net: connected to node bws03 (num 3) at 10.10.12.13:7777 > <http://10.10.12.13:7777> > OCFS2 1.4.1 Wed Jul 23 12:06:08 PDT 2008 (build > 3fc82af4b5669945497b322b6aabd031) > (1279,0):o2net_connect_expired:1637 ERROR: no connection established > with node 0 after 30.0 seconds, giving up and returning errors. > (3303,0):dlm_request_join:1033 ERROR: status = -107 > (3303,0):dlm_try_to_join_domain:1207 ERROR: status = -107 > (3303,0):dlm_join_domain:1485 ERROR: status = -107 > (3303,0):dlm_register_domain:1732 ERROR: status = -107 > (3303,0):ocfs2_dlm_init:2662 ERROR: status = -107 > (3303,0):ocfs2_mount_volume:1251 ERROR: status = -107 > ocfs2: Unmounting device (202,33) on (node 16) > o2net: no longer connected to node bws01 (num 1) at 10.10.12.11:7777 > <http://10.10.12.11:7777> > o2net: no longer connected to node bws02 (num 2) at 10.10.12.12:7777 > <http://10.10.12.12:7777> > o2net: no longer connected to node bws03 (num 3) at 10.10.12.13:7777 > <http://10.10.12.13:7777> > o2net: no longer connected to node bapp02 (num 5) at 10.10.16.12:7777 > <http://10.10.16.12:7777> > o2net: no longer connected to node bapp03 (num 6) at 10.10.16.13:7777 > <http://10.10.16.13:7777> > o2net: no longer connected to node bapp04 (num 7) at 10.10.16.14:7777 > <http://10.10.16.14:7777> > o2net: no longer connected to node bapp05 (num 8) at 10.10.16.15:7777 > <http://10.10.16.15:7777> > o2net: no longer connected to node bdb01 (num 9) at 10.10.20.11:7777 > <http://10.10.20.11:7777> > o2net: no longer connected to node bdb02 (num 10) at 10.10.20.12:7777 > <http://10.10.20.12:7777> > o2net: no longer connected to node bnosey (num 11) at 10.10.8.20:7777 > <http://10.10.8.20:7777> > o2net: no longer connected to node bapp06x (num 13) at > 10.10.16.16:7777 <http://10.10.16.16:7777> > o2net: no longer connected to node bsip01x (num 15) at > 10.10.16.17:7777 <http://10.10.16.17:7777> > > Other nodes in the cluster can mount the shared file system without > any issues. I have checked over /etc/ocfs2/cluster.conf and the file > is correct, no typos's or anything else. When I run o2cb_ctl to get > information about the node it shows the following: > > bash-3.2# o2cb_ctl -I -n bftp01x > node: > name = bftp01x > cluster = cfi_cl01 > number = 16 > ip_address = 10.10.12.14 > ip_port = 7777 > status = configured > > I am a little stumped as to what can be going on. Is there a way I can > delete the node and start from scratch on a running cluster? I > attempted to use o2cb_ctl to do this however I get the following error: > > bash-3.2# o2cb_ctl -D -n bftp01x -u > o2cb_ctl: Not yet supported > bash-3.2# > > Any help on this would be great. > > Thanks > Andrew > > > ------------------------------------------------------------------------ > > _______________________________________________ > Ocfs2-users mailing list > Ocfs2-users at oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs2-users