Vaidya, Sachin
2006-Mar-30 00:45 UTC
[Ocfs2-users] Getting eI am using RHLError when mounting shared OCFS2 device.
Hi, I am using RHLE4 2.6.9-34.Elsmp with OCFS2 1.2. The h/w for this 2 node cluster is connected correctly. After loading ocfs2 on both nodes, the shared device could only be mounted on one node. When I try to mount same shared device on second node then I get following error. Mount.ocfs2: Transport endpoint is not connected while mounting /dev/md0 on /crs1 Any idea, why this is happening ? Any help will be highly appreciated. Thanks, Sachin Vaidya -------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20060329/a49a25e2/attachment.html
Sunil Mushran
2006-Mar-30 01:16 UTC
[Ocfs2-users] Getting eI am using RHLError when mounting shared OCFS2 device.
Connection failiure. Check dmesg. Mount triggers the heartbeat thread which triggers the o2net to make a connection to all heartbeating nodes. If this connection fails, the mount fails. (The larger node number initiates the connection to the lower node number.) Obvious error would be incorrect ipaddr specified in cluster.conf. Error messages in /var/log/messsages on both nodes will provide more clues. Vaidya, Sachin wrote:> > Hi, > > I am using RHLE4 2.6.9-34.Elsmp with OCFS2 1.2. > > The h/w for this 2 node cluster is connected correctly. > > After loading ocfs2 on both nodes, the shared device could only be > mounted on one node. When I try to mount same shared device on second > node then I get following error. > > Mount.ocfs2: Transport endpoint is not connected while mounting > /dev/md0 on /crs1 > > Any idea, why this is happening ? > > Any help will be highly appreciated. > > Thanks, > > Sachin Vaidya > > > > ------------------------------------------------------------------------ > > _______________________________________________ > Ocfs2-users mailing list > Ocfs2-users at oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs2-users >
Vaidya, Sachin
2006-Mar-30 23:27 UTC
[Ocfs2-users] Getting eI am using RHLError when mounting shar ed OCFS2 device.
Hi, Tried both public and private ip addreses but still not able to mount device on both nodes. Here are my configuration details. hosts file : same on both nodes. 127.0.0.1 localhost.localdomain localhost 172.18.11.12 acspittdw001 acspittdw001.servicemetrics.net 172.18.22.1 priv-acspittdw001 172.18.11.24 vip-acspittdw001 172.18.11.13 acspittdw002 acspittdw002.servicemetrics.net 172.18.22.2 priv-acspittdw002 172.18.11.25 vip-acspittdw002 The cluster.conf on both nodes looks same as node: ip_port = 7777 ip_address = 172.18.11.12 number = 0 name = acspittdw001 cluster = ocfs2 node: ip_port = 7777 ip_address = 172.18.11.13 number = 1 name = acspittdw002 cluster = ocfs2 cluster: node_count = 2 name = ocfs2 Both nodes can ping each other on public and private ips. The mount command produces following error on node 2 when device is already mounted on node 1. [root at acspittdw002 ~]# mount -t ocfs2 /dev/md0 /crs1 mount.ocfs2: Transport endpoint is not connected while mounting /dev/md0 on /crs1 [root at acspittdw002 ~]# dmesg show following messages SELinux: initialized (dev debugfs, type debugfs), uses genfs_contexts (5027,2):ocfs2_initialize_super:1354 max_slots for this device: 8 (5027,2):ocfs2_fill_local_node_info:1031 I am node 1 (4986,2):o2net_connect_expired:1446 ERROR: no connection established with node 0 after 10 seconds, giving up and returning errors. (5027,2):dlm_request_join:771 ERROR: status = -107 (5027,2):dlm_try_to_join_domain:919 ERROR: status = -107 (5027,2):dlm_join_domain:1164 ERROR: status = -107 (5027,2):dlm_register_domain:1354 ERROR: status = -107 (5027,2):ocfs2_dlm_init:1996 ERROR: status = -107 (5027,2):ocfs2_mount_volume:1063 ERROR: status = -107 ocfs2: Unmounting device (9,0) on (node 1) [root at acspittdw002 ~]# Any idea why this is happening ? I can provide more details if needed. Any help will be greatly appreciated. Thanks in advance. - Sachin Vaidya. -----Original Message----- From: Sunil Mushran To: Vaidya, Sachin Cc: 'ocfs2-users at oss.oracle.com' Sent: 3/29/2006 7:16 PM Subject: Re: [Ocfs2-users] Getting eI am using RHLError when mounting shared OCFS2 device. Connection failiure. Check dmesg. Mount triggers the heartbeat thread which triggers the o2net to make a connection to all heartbeating nodes. If this connection fails, the mount fails. (The larger node number initiates the connection to the lower node number.) Obvious error would be incorrect ipaddr specified in cluster.conf. Error messages in /var/log/messsages on both nodes will provide more clues. Vaidya, Sachin wrote:> > Hi, > > I am using RHLE4 2.6.9-34.Elsmp with OCFS2 1.2. > > The h/w for this 2 node cluster is connected correctly. > > After loading ocfs2 on both nodes, the shared device could only be > mounted on one node. When I try to mount same shared device on second > node then I get following error. > > Mount.ocfs2: Transport endpoint is not connected while mounting > /dev/md0 on /crs1 > > Any idea, why this is happening ? > > Any help will be highly appreciated. > > Thanks, > > Sachin Vaidya > > > >------------------------------------------------------------------------> > _______________________________________________ > Ocfs2-users mailing list > Ocfs2-users at oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs2-users >-------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20060330/8de16bd9/attachment.html
Joel Becker
2006-Mar-31 03:04 UTC
[Ocfs2-users] Getting eI am using RHLError when mounting shared OCFS2 device.
On Wed, Mar 29, 2006 at 06:45:08PM -0600, Vaidya, Sachin wrote:> Mount.ocfs2: Transport endpoint is not connected while mounting /dev/md0 on > /crs1md0? What type of raid? multipath, raid0, raid1, etc? Are you using persistent-superblock? Joel -- "The nearest approach to immortality on Earth is a government bureau." - James F. Byrnes Joel Becker Principal Software Developer Oracle E-mail: joel.becker at oracle.com Phone: (650) 506-8127