search for: o2hb

Displaying 16 results from an estimated 16 matches for "o2hb".

Did you mean: o2cb
2010 Dec 07
1
Two-node cluster often hanging in o2hb/jdb2
...ut the interval seems irregular while the time that it hangs is quite similar. This behavior seems pretty much independent from the (IO) load of the nodes (as long as not really high). I tried a ps, grepping for D repeated every second on both nodes. When hanging, both show this: 1649 D< o2hb-02BC250CDB ? 3507 R+ ps - 1649 D< o2hb-02BC250CDB ? 3511 R+ ps - 1649 D< o2hb-02BC250CDB ? 3515 R+ ps - 1649 D< o2hb-02BC250CDB ? 3519 R+ ps - 1649 D< o2hb-02BC250CDB ? 3523 R+ ps - 1649 D<...
2008 Oct 22
2
Another node is heartbeating in our slot! errors with LUN removal/addition
Greetings, Last night I manually unpresented and deleted a LUN (a SAN snapshot) that was presented to one node in a four node RAC environment running OCFS2 v1.4.1-1. The system then rebooted with the following error: Oct 21 16:45:34 ausracdb03 kernel: (27,1):o2hb_write_timeout:166 ERROR: Heartbeat write timeout to device dm-24 after 120000 milliseconds Oct 21 16:45:34 ausracdb03 kernel: (27,1):o2hb_stop_all_regions:1873 ERROR: stopping heartbeat on all active regions. I'm assuming that dm-24 was the LUN that was deleted. Looking back in the syslog, I...
2023 Jun 06
0
[bug report] ocfs2/cluster: Pin/unpin o2hb regions
...param.type = fs_value_is_string; 182 } 183 184 ret = vfs_parse_fs_param(fc, &param); 185 kfree(param.string); 186 return ret; 187 } There are a couple OCFS functions which call this sleeping function while holding a spinlock. o2hb_heartbeat_group_drop_item() <- disables preempt o2hb_region_inc_user() <- disables preempt -> o2hb_region_pin() -> o2nm_depend_item() -> configfs_depend_item() -> configfs_pin_fs() -> simple_pin_fs() -> vfs_kern_mount()...
2010 Oct 20
1
OCFS2 + iscsi: another node is heartbeating in our slot (over scst)
...torage server. On storage server i have volume with ocfs2 fs which is sharing this volume via iscsi target. When node connected to the target i can local mount volume on node and using it. Unfortunately. on storage server ocfs2 logged to dmesg: Oct 19 22:21:02 storage kernel: [ 1510.424144] (o2hb-2283B3335E,4427,0):o2hb_do_disk_heartbeat:770 ERROR: Device "sda1": another node is heartbeating in our slot! Oct 19 22:21:02 storage kernel: [ 1510.428600] o2net: connection to node node-2 (num 0) at 192.168.1.69:7777 shutdown, state 7 . . Oct 19 22:21:32 storage kernel: [ 1540.448016]...
2010 Oct 08
23
O2CB global heartbeat - hopefully final drop!
All, This is hopefully the final drop of the patches for adding global heartbeat to the o2cb stack. The diff from the previous set is here: http://oss.oracle.com/~smushran/global-hb-diff-2010-10-07 Implemented most of the suggestions provided by Joel and Wengang. The most important one was to activate the feature only at the end, Also, got mostly a clean run with checkpatch.pl. Sunil
2010 Jan 18
1
Getting Closer (was: Fencing options)
One more follow on, The combination of kernel.panic=60 and kernel.printk=7 4 1 7 seems to have netted the culrptit: E01-netconsole.log:Jan 18 09:45:10 E01 (10,0):o2hb_write_timeout:137 ERROR: Heartbeat write timeout to device dm-12 after 60000 milliseconds E01-netconsole.log:Jan 18 09:45:10 E01 (10,0):o2hb_stop_all_regions:1517 ERROR: stopping heartbeat on all active regions. E01-netconsole.log:Jan 18 09:45:10 E01 ocfs2 is very sorry to be fencing this system by...
2008 Sep 11
4
Some more debug stuff
Added two debugfs entries... one to dump o2hb livenodes and the other to dump osb. $ cat /sys/kernel/debug/ocfs2/BC4F4550BEA74F92BDCC746AAD2EC0BF/fs_state Device => Id: 8,65 Uuid: BC4F4550BEA74F92BDCC746AAD2EC0BF Gen: 0xA02024F2 Label: sunil-xattr Volume => State: 1 Flags: 0x0 Sizes => Block: 4096 Cluster: 4096 Features =...
2011 Mar 03
1
OCFS2 1.4 + DRBD + iSCSI problem with DLM
An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20110303/0fbefee6/attachment.html
2010 Oct 22
0
[GIT PULL] ocfs2 changes for 2.6.37
...ing global heartbeat ocfs2/dlm: Add message DLM_QUERY_NODEINFO ocfs2/cluster: Print messages when adding/removing heartbeat regions ocfs2/cluster: Print messages when adding/removing nodes ocfs2/cluster: Check slots for unconfigured live nodes ocfs2/cluster: Reorganize o2hb debugfs init ocfs2/cluster: Maintain live node bitmap per heartbeat region ocfs2/cluster: Track number of global heartbeat regions ocfs2/cluster: Track bitmap of live heartbeat regions ocfs2/cluster: Maintain bitmap of quorum regions ocfs2/cluster: Maintain bitmap of f...
2010 Mar 11
0
Ocfs2-users Digest, Vol 75, Issue 10
...erti <gabriele.alberti at pg.infn.it> > Cc: ocfs2-users at oss.oracle.com > Message-ID: <4B97F97D.5040904 at oracle.com> > Content-Type: text/plain; charset=ISO-8859-1; > format=flowed > > Were the first set of messages on all nodes? On that node > atleast > the o2hb node down event fired. It should have fired on all > nodes. > This is the dlm eviction message. > > If they all fired, then look for a node to have a message > that > reads "Node x is the Recovery Master for the Dead Node y". > > That shows a node was elected to...
2014 May 06
0
poor write performance or locking issues with ocfs2
...0-1 to 40, 500, 2000! CPU is fine, no problems. RAM is free, no problems. "ps -e -o pid,stat,comm,wchan=WIDE-WCHAN-COLUMN | grep D" shows me some apache processes with a state of "D", but with no "WIDE-WCHAN-COLUMN" filled. Here's an example output: -- 3176 D< o2hb-6F81EC9057 - 3392 D jbd2/dm-1-41 - 3393 D ocfs2cmt - 17221 D apache2 - 18424 D kworker/8:3 - 18453 D apache2 - ... --- Some output of /proc/pid/stack: -- [<ffffffff81051d5f>] process_timeout+0x0/0x5 [<ffffffff810528be>] msleep_interruptibl...
2011 Dec 20
8
ocfs2 - Kernel panic on many write/read from both
Sorry i don`t copy everything: TEST-MAIL1# echo "ls //orphan_dir:0000"|debugfs.ocfs2 /dev/dm-0|wc debugfs.ocfs2 1.6.4 5239722 26198604 246266859 TEST-MAIL1# echo "ls //orphan_dir:0001"|debugfs.ocfs2 /dev/dm-0|wc debugfs.ocfs2 1.6.4 6074335 30371669 285493670 TEST-MAIL2 ~ # echo "ls //orphan_dir:0000"|debugfs.ocfs2 /dev/dm-0|wc debugfs.ocfs2 1.6.4 5239722 26198604
2006 Dec 29
3
[git patches] ocfs2 fixes
...(&OCFS2_I(inode)->ip_lock); ret = -EIO; goto bail; diff --git a/fs/ocfs2/cluster/heartbeat.c b/fs/ocfs2/cluster/heartbeat.c index a25ef5a..277ca67 100644 --- a/fs/ocfs2/cluster/heartbeat.c +++ b/fs/ocfs2/cluster/heartbeat.c @@ -1447,6 +1447,15 @@ out: return ret; } +static ssize_t o2hb_region_pid_read(struct o2hb_region *reg, + char *page) +{ + if (!reg->hr_task) + return 0; + + return sprintf(page, "%u\n", reg->hr_task->pid); +} + struct o2hb_region_attribute { struct configfs_attribute attr; ssize_t (*show)(struct o2...
2009 Apr 17
26
OCFS2 1.4: Patches backported from mainline
Please review the list of patches being applied to the ocfs2 1.4 tree. All patches list the mainline commit hash. Thanks Sunil
2009 Mar 17
33
[git patches] Ocfs2 updates for 2.6.30
Hi, The following patches comprise the bulk of Ocfs2 updates for the 2.6.30 merge window. Aside from larger, more involved fixes, we're adding the following features, which I will describe in the order their patches are mailed. Sunil's exported some more state to our debugfs files, and consolidated some other aspects of our debugfs infrastructure. This will further aid us in debugging
2008 Apr 02
10
[PATCH 0/62] Ocfs2 updates for 2.6.26-rc1
...bal DLM_ constants in generic code. ocfs2: Use -errno instead of dlm_status for ocfs2_dlm_lock/unlock() API. ocfs2: Create the lock status block union. ocfs2: Introduce the new ocfs2_cluster_connect/disconnect() API. ocfs2: Abstract out node number queries. ocfs2: Move o2hb functionality into the stack glue. ocfs2: Remove CANCELGRANT from the view of dlmglue. ocfs2: Abstract out a debugging function for underlying dlms. ocfs2: Clean up stackglue initialization ocfs2: Split o2cb code from generic stack functions. ocfs2: Create ocfs2_stack_...