Carl J. Benson
2009-Jan-30 22:44 UTC
[Ocfs2-users] one node rejects connection from new node
Hello. I have three systems that share an ocfs2 filesystem, and I'm trying to add a fourth system. These are all openSUSE 11.1, x86_64, kernel 2.6.27.7-9-default. All have RPMs ocfs2-tools-1.4.1-6.9 and ocfs2console-1.4.1-6.9 cluster.conf looks like this: node: ip_port = 7777 ip_address = 140.107.170.116 number = 0 name = merlot1 cluster = ocfs2 node: ip_port = 7777 ip_address = 140.107.158.54 number = 1 name = merlot2 cluster = ocfs2 node: ip_port = 7777 ip_address = 140.107.158.82 number = 2 name = wilson1 cluster = ocfs2 node: ip_port = 7778 ip_address = 140.107.170.108 number = 3 name = gladstone cluster = ocfs2 cluster: node_count = 4 name = ocfs2 gladstone is the new node. I edited the cluster.conf on wilson1 using ocfs2console, and propagated it to the other systems from there. When I try to bring my ocfs2 online with /etc/init.d/o2cb online ocfs2, merlot1 accepts the connection from gladstone, as does merlot2. However, wilson1 rejects it as an unknown node! For example: Jan 30 14:11:46 wilson1 kernel: (4447,3):o2net_accept_one:1795 attempt to connect from unknown node at 140.107.170.108:37795 Why would this happen? -- Carl Benson, PHS Linux SysAdmin (206-667-4862, cbenson at fhcrc.org)
Sunil Mushran
2009-Jan-30 22:57 UTC
[Ocfs2-users] one node rejects connection from new node
Could be that the cluster was already online on wilson1 when you propagated the cluster.conf to all nodes. If so, restart the cluster on that node. To add a node to an online cluster, you need to use the o2cb_ctl command. Details are in the 1.4 user's guide. Carl J. Benson wrote:> Hello. > > I have three systems that share an ocfs2 filesystem, and I'm > trying to add a fourth system. > > These are all openSUSE 11.1, x86_64, kernel 2.6.27.7-9-default. > All have RPMs ocfs2-tools-1.4.1-6.9 and ocfs2console-1.4.1-6.9 > > cluster.conf looks like this: > node: > ip_port = 7777 > ip_address = 140.107.170.116 > number = 0 > name = merlot1 > cluster = ocfs2 > > node: > ip_port = 7777 > ip_address = 140.107.158.54 > number = 1 > name = merlot2 > cluster = ocfs2 > > node: > ip_port = 7777 > ip_address = 140.107.158.82 > number = 2 > name = wilson1 > cluster = ocfs2 > > node: > ip_port = 7778 > ip_address = 140.107.170.108 > number = 3 > name = gladstone > cluster = ocfs2 > > cluster: > node_count = 4 > name = ocfs2 > > gladstone is the new node. > > I edited the cluster.conf on wilson1 using ocfs2console, and > propagated it to the other systems from there. > > When I try to bring my ocfs2 online with /etc/init.d/o2cb online ocfs2, > merlot1 accepts the connection from gladstone, as does merlot2. > However, wilson1 rejects it as an unknown node! For example: > > Jan 30 14:11:46 wilson1 kernel: (4447,3):o2net_accept_one:1795 attempt > to connect from unknown node at 140.107.170.108:37795 > > Why would this happen? > >