Displaying 16 results from an estimated 16 matches for "o2hb".
Did you mean:
o2cb
2010 Dec 07
1
Two-node cluster often hanging in o2hb/jdb2
...ut the interval seems irregular while the time that it hangs
is quite similar. This behavior seems pretty much independent from
the (IO) load of the nodes (as long as not really high).
I tried a ps, grepping for D repeated every second on both nodes.
When hanging, both show this:
1649 D< o2hb-02BC250CDB ?
3507 R+ ps -
1649 D< o2hb-02BC250CDB ?
3511 R+ ps -
1649 D< o2hb-02BC250CDB ?
3515 R+ ps -
1649 D< o2hb-02BC250CDB ?
3519 R+ ps -
1649 D< o2hb-02BC250CDB ?
3523 R+ ps -
1649 D<...
2008 Oct 22
2
Another node is heartbeating in our slot! errors with LUN removal/addition
Greetings,
Last night I manually unpresented and deleted a LUN (a SAN snapshot)
that was presented to one node in a four node RAC environment running
OCFS2 v1.4.1-1. The system then rebooted with the following error:
Oct 21 16:45:34 ausracdb03 kernel: (27,1):o2hb_write_timeout:166 ERROR:
Heartbeat write timeout to device dm-24 after 120000 milliseconds
Oct 21 16:45:34 ausracdb03 kernel: (27,1):o2hb_stop_all_regions:1873
ERROR: stopping heartbeat on all active regions.
I'm assuming that dm-24 was the LUN that was deleted. Looking back in
the syslog, I...
2023 Jun 06
0
[bug report] ocfs2/cluster: Pin/unpin o2hb regions
...param.type = fs_value_is_string;
182 }
183
184 ret = vfs_parse_fs_param(fc, ¶m);
185 kfree(param.string);
186 return ret;
187 }
There are a couple OCFS functions which call this sleeping function
while holding a spinlock.
o2hb_heartbeat_group_drop_item() <- disables preempt
o2hb_region_inc_user() <- disables preempt
-> o2hb_region_pin()
-> o2nm_depend_item()
-> configfs_depend_item()
-> configfs_pin_fs()
-> simple_pin_fs()
-> vfs_kern_mount()...
2010 Oct 20
1
OCFS2 + iscsi: another node is heartbeating in our slot (over scst)
...torage
server.
On storage server i have volume with ocfs2 fs which is sharing this
volume via iscsi target.
When node connected to the target i can local mount volume on node and
using it.
Unfortunately. on storage server ocfs2 logged to dmesg:
Oct 19 22:21:02 storage kernel: [ 1510.424144]
(o2hb-2283B3335E,4427,0):o2hb_do_disk_heartbeat:770 ERROR: Device
"sda1": another node is heartbeating in our slot!
Oct 19 22:21:02 storage kernel: [ 1510.428600] o2net: connection to node
node-2 (num 0) at 192.168.1.69:7777 shutdown, state 7
.
.
Oct 19 22:21:32 storage kernel: [ 1540.448016]...
2010 Oct 08
23
O2CB global heartbeat - hopefully final drop!
All,
This is hopefully the final drop of the patches for adding global heartbeat
to the o2cb stack.
The diff from the previous set is here:
http://oss.oracle.com/~smushran/global-hb-diff-2010-10-07
Implemented most of the suggestions provided by Joel and Wengang.
The most important one was to activate the feature only at the end,
Also, got mostly a clean run with checkpatch.pl.
Sunil
2010 Jan 18
1
Getting Closer (was: Fencing options)
One more follow on,
The combination of kernel.panic=60 and kernel.printk=7 4 1 7 seems to
have netted the culrptit:
E01-netconsole.log:Jan 18 09:45:10 E01 (10,0):o2hb_write_timeout:137
ERROR: Heartbeat write timeout to device dm-12 after 60000
milliseconds
E01-netconsole.log:Jan 18 09:45:10 E01
(10,0):o2hb_stop_all_regions:1517 ERROR: stopping heartbeat on all
active regions.
E01-netconsole.log:Jan 18 09:45:10 E01 ocfs2 is very sorry to be
fencing this system by...
2008 Sep 11
4
Some more debug stuff
Added two debugfs entries... one to dump o2hb livenodes and the other
to dump osb.
$ cat /sys/kernel/debug/ocfs2/BC4F4550BEA74F92BDCC746AAD2EC0BF/fs_state
Device => Id: 8,65 Uuid: BC4F4550BEA74F92BDCC746AAD2EC0BF Gen: 0xA02024F2 Label: sunil-xattr
Volume => State: 1 Flags: 0x0
Sizes => Block: 4096 Cluster: 4096
Features =...
2011 Mar 03
1
OCFS2 1.4 + DRBD + iSCSI problem with DLM
An HTML attachment was scrubbed...
URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20110303/0fbefee6/attachment.html
2010 Oct 22
0
[GIT PULL] ocfs2 changes for 2.6.37
...ing global heartbeat
ocfs2/dlm: Add message DLM_QUERY_NODEINFO
ocfs2/cluster: Print messages when adding/removing heartbeat regions
ocfs2/cluster: Print messages when adding/removing nodes
ocfs2/cluster: Check slots for unconfigured live nodes
ocfs2/cluster: Reorganize o2hb debugfs init
ocfs2/cluster: Maintain live node bitmap per heartbeat region
ocfs2/cluster: Track number of global heartbeat regions
ocfs2/cluster: Track bitmap of live heartbeat regions
ocfs2/cluster: Maintain bitmap of quorum regions
ocfs2/cluster: Maintain bitmap of f...
2010 Mar 11
0
Ocfs2-users Digest, Vol 75, Issue 10
...erti <gabriele.alberti at pg.infn.it>
> Cc: ocfs2-users at oss.oracle.com
> Message-ID: <4B97F97D.5040904 at oracle.com>
> Content-Type: text/plain; charset=ISO-8859-1;
> format=flowed
>
> Were the first set of messages on all nodes? On that node
> atleast
> the o2hb node down event fired. It should have fired on all
> nodes.
> This is the dlm eviction message.
>
> If they all fired, then look for a node to have a message
> that
> reads "Node x is the Recovery Master for the Dead Node y".
>
> That shows a node was elected to...
2014 May 06
0
poor write performance or locking issues with ocfs2
...0-1 to 40, 500, 2000! CPU is fine, no problems. RAM is free, no problems.
"ps -e -o pid,stat,comm,wchan=WIDE-WCHAN-COLUMN | grep D" shows me some apache processes with a state of "D", but with no "WIDE-WCHAN-COLUMN" filled. Here's an example output:
--
3176 D< o2hb-6F81EC9057 -
3392 D jbd2/dm-1-41 -
3393 D ocfs2cmt -
17221 D apache2 -
18424 D kworker/8:3 -
18453 D apache2 -
...
---
Some output of /proc/pid/stack:
--
[<ffffffff81051d5f>] process_timeout+0x0/0x5
[<ffffffff810528be>] msleep_interruptibl...
2011 Dec 20
8
ocfs2 - Kernel panic on many write/read from both
Sorry i don`t copy everything:
TEST-MAIL1# echo "ls //orphan_dir:0000"|debugfs.ocfs2 /dev/dm-0|wc
debugfs.ocfs2 1.6.4
5239722 26198604 246266859
TEST-MAIL1# echo "ls //orphan_dir:0001"|debugfs.ocfs2 /dev/dm-0|wc
debugfs.ocfs2 1.6.4
6074335 30371669 285493670
TEST-MAIL2 ~ # echo "ls //orphan_dir:0000"|debugfs.ocfs2 /dev/dm-0|wc
debugfs.ocfs2 1.6.4
5239722 26198604
2006 Dec 29
3
[git patches] ocfs2 fixes
...(&OCFS2_I(inode)->ip_lock);
ret = -EIO;
goto bail;
diff --git a/fs/ocfs2/cluster/heartbeat.c b/fs/ocfs2/cluster/heartbeat.c
index a25ef5a..277ca67 100644
--- a/fs/ocfs2/cluster/heartbeat.c
+++ b/fs/ocfs2/cluster/heartbeat.c
@@ -1447,6 +1447,15 @@ out:
return ret;
}
+static ssize_t o2hb_region_pid_read(struct o2hb_region *reg,
+ char *page)
+{
+ if (!reg->hr_task)
+ return 0;
+
+ return sprintf(page, "%u\n", reg->hr_task->pid);
+}
+
struct o2hb_region_attribute {
struct configfs_attribute attr;
ssize_t (*show)(struct o2...
2009 Apr 17
26
OCFS2 1.4: Patches backported from mainline
Please review the list of patches being applied to the ocfs2 1.4 tree.
All patches list the mainline commit hash.
Thanks
Sunil
2009 Mar 17
33
[git patches] Ocfs2 updates for 2.6.30
Hi,
The following patches comprise the bulk of Ocfs2 updates for the
2.6.30 merge window. Aside from larger, more involved fixes, we're adding
the following features, which I will describe in the order their patches are
mailed.
Sunil's exported some more state to our debugfs files, and
consolidated some other aspects of our debugfs infrastructure. This will
further aid us in debugging
2008 Apr 02
10
[PATCH 0/62] Ocfs2 updates for 2.6.26-rc1
...bal DLM_ constants in generic code.
ocfs2: Use -errno instead of dlm_status for ocfs2_dlm_lock/unlock() API.
ocfs2: Create the lock status block union.
ocfs2: Introduce the new ocfs2_cluster_connect/disconnect() API.
ocfs2: Abstract out node number queries.
ocfs2: Move o2hb functionality into the stack glue.
ocfs2: Remove CANCELGRANT from the view of dlmglue.
ocfs2: Abstract out a debugging function for underlying dlms.
ocfs2: Clean up stackglue initialization
ocfs2: Split o2cb code from generic stack functions.
ocfs2: Create ocfs2_stack_...