similar to: Another node is heartbeating in our slot! errors with LUN removal/addition

Displaying 20 results from an estimated 200 matches similar to: "Another node is heartbeating in our slot! errors with LUN removal/addition"

2010 Oct 20
1
OCFS2 + iscsi: another node is heartbeating in our slot (over scst)
Hi, I'm building a cluster containing two nodes with seperate common storage server. On storage server i have volume with ocfs2 fs which is sharing this volume via iscsi target. When node connected to the target i can local mount volume on node and using it. Unfortunately. on storage server ocfs2 logged to dmesg: Oct 19 22:21:02 storage kernel: [ 1510.424144]
2007 Mar 16
2
re: o2hb_do_disk_heartbeat:963 ERROR: Device "sdb1" another node is heartbeating in our slot!
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Folks, I'm trying to wrap my head around something that happened in our environment. Basically, we noticed the error in /var/log/messages with no other errors. "Mar 16 13:38:02 dbo3 kernel: (3712,3):o2hb_do_disk_heartbeat:963 ERROR: Device "sdb1": another node is heartbeating in our slot!" Usually there are a
2010 Dec 07
1
Two-node cluster often hanging in o2hb/jdb2
Hi, I'm pretty new to ocfs2 and a bit stuck. I have two Debian/Squeeze (testing) machines accessing an ocfs2 filesystem over aoe. The filesystem sits on an lvm2 volume, but I guess that is irrelevant. Even when mostly idle, everything accessing the cluster sometimes hangs for about 20 seconds. This happens rather frequently, say every 5 minutes, but the interval seems irregular while the
2011 Oct 18
12
Unable to stop cluster as heartbeat region still active
Hi, I have a 2 nodes ocfs2 cluster running UEK 2.6.32-100.0.19.el5, ocfs2console-1.6.3-2.el5, ocfs2-tools-1.6.3-2.el5. My problem is that all the time when i try to run /etc/init.d/o2cb stop it fails with this error: Stopping O2CB cluster CLUSTER: Failed Unable to stop cluster as heartbeat region still active There is no active mount point. I tried to manually stop the heartdbeat with
2006 Aug 04
3
OCFS2 and ASM Question
Ok guys & gals here is the scenario: 1.) Host RHEL 4 U3 2.6.9-34.0.2.EL 2.) OCFS2 latest version 3.) Successfully formatted & mounted OCFS2 filesystems on 2 nodes /dev/sdb1 /u02/oradata/usdev/voting /dev/sdc1 /u02/oradata/usdev/data01 /dev/sdd1 /u02/oradata/usdev/data02 /dev/sde1 /u02/oradata/usdev/data03 4.) Downloaded & installed ASMLib 2.0 on both nodes 5.) Ran
2008 Sep 11
4
Some more debug stuff
Added two debugfs entries... one to dump o2hb livenodes and the other to dump osb. $ cat /sys/kernel/debug/ocfs2/BC4F4550BEA74F92BDCC746AAD2EC0BF/fs_state Device => Id: 8,65 Uuid: BC4F4550BEA74F92BDCC746AAD2EC0BF Gen: 0xA02024F2 Label: sunil-xattr Volume => State: 1 Flags: 0x0 Sizes => Block: 4096 Cluster: 4096 Features => Compat: 0x1 Incompat: 0x350 ROcompat: 0x1
2005 Jul 12
1
problem mounting ocfs2: heartbeat
When attempting to mount the OCFS2 file system I'm getting the following error message: ocfs2_hb_ctl: Internal logic failure while starting heartbeat mount.ocfs2: Error when attempting to run /sbin/ocfs2_hb_ctl: "Operation not permitted" I followed the steps given in the users_guide: modprobe ocfs2_dlmfs mount -t configfs none /config mount -t ocfs2_dlmfs none /dlm o2cb_ctl
2011 Mar 03
1
OCFS2 1.4 + DRBD + iSCSI problem with DLM
An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20110303/0fbefee6/attachment.html
2008 Sep 10
4
mount.ocfs2: Error when attempting to run /sbin/ocfs2_hb_ctl: "Operation not permitted".
Hi, I am trying to configure a two node cluster on SLES10SP2 using user level heartbeat. Here is my configuration. ocfs2-tools-1.4.0-0.3 **user level heartbeat** -> lsmod | grep ocfs ocfs2_user_heartbeat 20992 1 ocfs2_dlmfs 37776 1 ocfs2_dlm 204456 1 ocfs2_dlmfs ocfs2_nodemanager 223384 6 ocfs2_user_heartbeat,ocfs2_dlmfs,ocfs2_dlm configfs 44700 3 ocfs2_user_heartbeat,ocfs2_nodemanager
2009 Jul 15
1
CentOS-5.3 + DRBD-8.2 + OCFS2-1.4
I've run into a problem mounting an OCFS2 filesystem on a DRBD device. I think it's the same one discussed at http://lists.linbit.com/pipermail/drbd-user/2007-April/006681.html When I try to mount the filesystem I get a ocfs2_hb_ctl: I/O error: [root at node-6A ~]# mount -t ocfs2 /dev/drbd2 /cshare ocfs2_hb_ctl: I/O error on channel while starting heartbeat mount.ocfs2: Error when
2010 Oct 08
23
O2CB global heartbeat - hopefully final drop!
All, This is hopefully the final drop of the patches for adding global heartbeat to the o2cb stack. The diff from the previous set is here: http://oss.oracle.com/~smushran/global-hb-diff-2010-10-07 Implemented most of the suggestions provided by Joel and Wengang. The most important one was to activate the feature only at the end, Also, got mostly a clean run with checkpatch.pl. Sunil
2005 Oct 12
2
Unable to access cluster service
hello, I'm running Ubuntu Breezy with the OCFS2 modules in the standard kernel. I installed ocfs2console and ocfs2-tools I've formatted a partition with ocfs2. But I can't add any node or mount the device(with the ocfs2console). because I get a "Unable to access cluster service" I can't find the cause nor the solution to this. root@lenaeja:~# /etc/init.d/o2cb status
2007 Mar 30
1
HowTo recover ocfs2 in a 10g four node cluster
Hi All, I needed to rebuild the operating system on one of the 4 nodes in my cluster but when I try to startup ocfs, the return from the init script is nothing? how do I fix: [root@kmloraper1 /]# /etc/init.d/ocfs2 restart Stopping Oracle Cluster File System (OCFS2) [ OK ] Starting Oracle Cluster File System (OCFS2) ocfs2_hb_ctl: Device name specified was not found while reading
2007 Dec 26
1
design question
Ladies and Gents, Researching the problem with DRBD 0.8 and OCFS2 not being able to mount... It appears to me the same problem has been reported, yet no fix was posted on the news group... (http://oss.oracle.com/pipermail/ocfs2-users/2006-August/000754.html) In any case, my env. looks as follows: drbd-8.0.6-1.el5.centos (primary/primary) kmod-drbd-8.0.6-1.2.6.18_53.1.4.el5
2006 Mar 19
1
ocfs2 problem
Hi, I am having a two node cluster with rhelu2 OS installed and when i am trying to mount with ocfs ,i am getting the error pasted below. "ocfs2_hb_ctl: Bad magic number in superblock while reading uuid mount.ocfs2: Error when attempting to run /sbin/ocfs2_hb_ctl: "Operation not permitted" I am getting this error in only one node and in other node it is ok. Is it because in the
2007 Sep 04
3
Ocfs2 and debian
Hi. I'm pretty new to ocfs2 and clusters. I'm trying to make ocfs2 running over a drbd device. I know it's not the best solution but for now i must deal with this. I set up drbd and work perfectly. I set up ocfs and i'm not able to make it to work. /etc/init.d/o2cb status: Module "configfs": Loaded Filesystem "configfs": Mounted Module
2007 Jun 26
3
1.2.6 compile failures
Hi, I'm trying to compile ocfs2 1.2.6 on a 2.6.21 kernel (with rsbac and pax patches), but I can't get this to work .. In 2.6.20 there was an change in the definition of the INIT_WORK macro (http://lkml.org/lkml/2006/12/5/269) this seems to cause my problems (see below) but even after removing the third parameter of the INIT_WORK calls the compile fails (see second compile failure). Can
2009 Sep 24
1
strange fencing behavior
I have 10 servers in a cluster running Debian Etch with 2.6.26-bpo.2 with a backport of ocfs2-tools-1.4.1-1 I'm using AoE to export the drives from a Debian Lenny server in the cluster. My problem is if I mount the ocfs2 partition on the server that is exporting it via AoE it fences the entire cluster. Looking at the logs exporting the ocfs2 partition doesn't give much information...
2010 Oct 25
7
[PATCH 0/6] Ocfs2-tools: Add a new tool 'o2info'.
Now it's a good time to introduce the new tool 'o2info' since kernel part of OCFS2_IOC_INFO ioctl has been pulld upstream by linus. The following 6 patches have already got sunil's SOBs, and now they're trying to attract more reviewers before it goes to central repo with a modification of getting manual pages being introduced.
2010 Jan 18
1
Getting Closer (was: Fencing options)
One more follow on, The combination of kernel.panic=60 and kernel.printk=7 4 1 7 seems to have netted the culrptit: E01-netconsole.log:Jan 18 09:45:10 E01 (10,0):o2hb_write_timeout:137 ERROR: Heartbeat write timeout to device dm-12 after 60000 milliseconds E01-netconsole.log:Jan 18 09:45:10 E01 (10,0):o2hb_stop_all_regions:1517 ERROR: stopping heartbeat on all active regions.