Martin Moravcik
2014-Jan-13 13:52 UTC
[CentOS] Fwd: HA cluster - strange communication between nodes
Hi,
For a testing purposes I'm trying to create two node HA environment for
running some service (openvpn and haproxy). I installed two CentOS 6.4
KVM guests.
I was able to create a cluster and some resources. I followed the document
https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Configuring_the_Red_Hat_High_Availability_Add-On_with_Pacemaker/index.html
But my cluster behaves not as expected:
After start of cluster sw on both nodes, they can see each other.
----------------------------------------
[root at lb1 ~]# pcs status
Cluster name: LB.STK
Last updated: Mon Jan 13 15:34:21 2014
Last change: Mon Jan 13 15:24:47 2014 via cibadmin on lb1.asol.local
Stack: cman
Current DC: lb1.asol.local - partition with quorum
Version: 1.1.10-14.el6_5.1-368c726
2 Nodes configured
2 Resources configured
Online: [ lb1.asol.local lb2.asol.local ]
Full list of resources:
Resource Group: LB
LAN.VIP (ocf::heartbeat:IPaddr2): Started lb2.asol.local
WAN.VIP (ocf::heartbeat:IPaddr2): Started lb2.asol.local
----------------------------------------
After manual shutdown of one node 2 (pcs cluster stop), the node 1
doesn't get this information and still believes node 2 is up and
running. In the log of corosync @lb2 these lines are repeating:
Jan 13 15:38:43 [1712] lb2.asol.local cib: info:
crm_client_new: Connecting 0x25a3810 for uid=0 gid=0 pid=10763
id=2b06a195-11f6-452d-992b-5ea0c69be21a
Jan 13 15:38:43 [1712] lb2.asol.local cib: info:
cib_process_request: Completed cib_query operation for section 'all':
OK (rc=0, origin=local/crm_resource/2, version=0.7.4)
Jan 13 15:38:43 [1712] lb2.asol.local cib: info:
crm_client_destroy: Destroying 0 events
Jan 13 17:24:24 corosync [TOTEM ] Retransmit List: 9a 9b 9c
The firewall on both nodes is open for incomming traffic from these
nodes and stonith-enabled is set to false. I created keys for root user,
so I can make ssh back and forth without using password. The pacemaker's
version is 1.1.10-14.
Do you have any idea, where might be a problem?
thanks
martin
Patrick Lists
2014-Jan-13 14:17 UTC
[CentOS] Fwd: HA cluster - strange communication between nodes
On 13-01-14 14:52, Martin Moravcik wrote:> Hi, > > For a testing purposes I'm trying to create two node HA environment for > running some service (openvpn and haproxy). I installed two CentOS 6.4 > KVM guests.Iirc CentOS 6.5 came with several updates to cluster related packages so you may want to investigate and update to 6.5. Regards, Patrick