Carl J. Benson
2009-Jan-30 22:44 UTC
[Ocfs2-users] one node rejects connection from new node
Hello.
I have three systems that share an ocfs2 filesystem, and I'm
trying to add a fourth system.
These are all openSUSE 11.1, x86_64, kernel 2.6.27.7-9-default.
All have RPMs ocfs2-tools-1.4.1-6.9 and ocfs2console-1.4.1-6.9
cluster.conf looks like this:
node:
ip_port = 7777
ip_address = 140.107.170.116
number = 0
name = merlot1
cluster = ocfs2
node:
ip_port = 7777
ip_address = 140.107.158.54
number = 1
name = merlot2
cluster = ocfs2
node:
ip_port = 7777
ip_address = 140.107.158.82
number = 2
name = wilson1
cluster = ocfs2
node:
ip_port = 7778
ip_address = 140.107.170.108
number = 3
name = gladstone
cluster = ocfs2
cluster:
node_count = 4
name = ocfs2
gladstone is the new node.
I edited the cluster.conf on wilson1 using ocfs2console, and
propagated it to the other systems from there.
When I try to bring my ocfs2 online with /etc/init.d/o2cb online ocfs2,
merlot1 accepts the connection from gladstone, as does merlot2.
However, wilson1 rejects it as an unknown node! For example:
Jan 30 14:11:46 wilson1 kernel: (4447,3):o2net_accept_one:1795 attempt
to connect from unknown node at 140.107.170.108:37795
Why would this happen?
--
Carl Benson, PHS Linux SysAdmin (206-667-4862, cbenson at fhcrc.org)
Sunil Mushran
2009-Jan-30 22:57 UTC
[Ocfs2-users] one node rejects connection from new node
Could be that the cluster was already online on wilson1 when you propagated the cluster.conf to all nodes. If so, restart the cluster on that node. To add a node to an online cluster, you need to use the o2cb_ctl command. Details are in the 1.4 user's guide. Carl J. Benson wrote:> Hello. > > I have three systems that share an ocfs2 filesystem, and I'm > trying to add a fourth system. > > These are all openSUSE 11.1, x86_64, kernel 2.6.27.7-9-default. > All have RPMs ocfs2-tools-1.4.1-6.9 and ocfs2console-1.4.1-6.9 > > cluster.conf looks like this: > node: > ip_port = 7777 > ip_address = 140.107.170.116 > number = 0 > name = merlot1 > cluster = ocfs2 > > node: > ip_port = 7777 > ip_address = 140.107.158.54 > number = 1 > name = merlot2 > cluster = ocfs2 > > node: > ip_port = 7777 > ip_address = 140.107.158.82 > number = 2 > name = wilson1 > cluster = ocfs2 > > node: > ip_port = 7778 > ip_address = 140.107.170.108 > number = 3 > name = gladstone > cluster = ocfs2 > > cluster: > node_count = 4 > name = ocfs2 > > gladstone is the new node. > > I edited the cluster.conf on wilson1 using ocfs2console, and > propagated it to the other systems from there. > > When I try to bring my ocfs2 online with /etc/init.d/o2cb online ocfs2, > merlot1 accepts the connection from gladstone, as does merlot2. > However, wilson1 rejects it as an unknown node! For example: > > Jan 30 14:11:46 wilson1 kernel: (4447,3):o2net_accept_one:1795 attempt > to connect from unknown node at 140.107.170.108:37795 > > Why would this happen? > >