davide rossetti
2007-Sep-19 11:24 UTC
[Ocfs2-users] ocfs2_broadcast_vote ERROR: status = -75 on Fedora 7 kernel-2.6.22.5-76.fc7
dear all, I'm a bit depressed as of now... I upgraded my OCFS2 cluster to the latest F7 kernel 2.6.22.5-76.fc7 and I can't mount my filesystem on two PC at the same time, one is 32bit and the other is 64.... any idea ?? this is what I get on the 64bit host trying mounting my fs, AFTER having mounted it on the other 32bit host. [root@theboss ~]# mount /storage/disk1 mount.ocfs2: Value too large for defined data type while mounting /dev/sdc1 on /storage/disk1. Check 'dmesg' for more information on this error. kernel log is: Sep 19 20:11:08 theboss kernel: OCFS2 Node Manager 1.3.3 Sep 19 20:11:08 theboss kernel: OCFS2 DLM 1.3.3 Sep 19 20:11:08 theboss kernel: OCFS2 DLMFS 1.3.3 Sep 19 20:11:08 theboss kernel: OCFS2 User DLM kernel interface loaded Sep 19 20:12:51 theboss kernel: o2net: connected to node rack1 (num 1) at 10.0.2.21:7777 Sep 19 20:12:55 theboss kernel: OCFS2 1.3.3 Sep 19 20:12:55 theboss kernel: ocfs2_dlm: Nodes in domain ("41AE1AA4C5534E50A93784D2AD94A94D"): 1 3 Sep 19 20:12:55 theboss kernel: kjournald starting. Commit interval 5 seconds Sep 19 20:12:55 theboss kernel: (29020,2):ocfs2_broadcast_vote:434 ERROR: status = -75 Sep 19 20:12:55 theboss kernel: (29020,2):ocfs2_do_request_vote:504 ERROR: status = -75 Sep 19 20:12:55 theboss kernel: (29020,2):ocfs2_mount_volume:1117 ERROR: status = -75 Sep 19 20:12:55 theboss kernel: (29020,3):ocfs2_broadcast_vote:434 ERROR: status = -75 Sep 19 20:12:55 theboss kernel: (29020,3):ocfs2_do_request_vote:504 ERROR: status = -75 Sep 19 20:12:55 theboss kernel: (29020,3):ocfs2_dismount_volume:1179 ERROR: status = -75 Sep 19 20:12:59 theboss kernel: ocfs2: Unmounting device (8,33) on (node 3) Sep 19 20:13:01 theboss kernel: o2net: no longer connected to node rack1 (num 1) at 10.0.2.21:7777 Sep 19 20:15:25 theboss kernel: o2net: connected to node rack1 (num 1) at 10.0.2.21:7777 Sep 19 20:15:29 theboss kernel: ocfs2_dlm: Nodes in domain ("41AE1AA4C5534E50A93784D2AD94A94D"): 1 3 Sep 19 20:15:29 theboss kernel: kjournald starting. Commit interval 5 seconds Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_broadcast_vote:434 ERROR: status = -75 Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_do_request_vote:504 ERROR: status = -75 Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_mount_volume:1117 ERROR: status = -75 Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_broadcast_vote:434 ERROR: status = -75 Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_do_request_vote:504 ERROR: status = -75 Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_dismount_volume:1179 ERROR: status = -75 Sep 19 20:15:33 theboss kernel: ocfs2: Unmounting device (8,33) on (node 3) Sep 19 20:15:35 theboss kernel: o2net: no longer connected to node rack1 (num 1) at 10.0.2.21:7777 furthermore, if I swap the mounting order (64bit first, 32bit last), I get another error on the 32bit host. any help appreciated regards -- davide.rossetti@gmail.com ICQ:290677265 SKYPE:d.rossetti
davide rossetti
2007-Sep-19 11:41 UTC
[Ocfs2-users] Re: ocfs2_broadcast_vote ERROR: status = -75 on Fedora 7 kernel-2.6.22.5-76.fc7
On 9/19/07, davide rossetti <davide.rossetti@gmail.com> wrote:> dear all, > I'm a bit depressed as of now... I upgraded my OCFS2 cluster to the > latest F7 kernel 2.6.22.5-76.fc7 and I can't mount my filesystem on > two PC at the same time, one is 32bit and the other is 64.... any idea > Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_broadcast_vote:434 > ERROR: status = -75 > Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_do_request_vote:504 > ERROR: status = -75 > Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_mount_volume:1117 > ERROR: status = -75 > Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_broadcast_vote:434 > ERROR: status = -75 > Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_do_request_vote:504 > ERROR: status = -75 > Sep 19 20:15:29 theboss kernel: (29057,2):ocfs2_dismount_volume:1179 > ERROR: status = -75 > Sep 19 20:15:33 theboss kernel: ocfs2: Unmounting device (8,33) on (node 3) > Sep 19 20:15:35 theboss kernel: o2net: no longer connected to node > rack1 (num 1) at 10.0.2.21:7777investigating a little bit more, seems like (but not sure) EOVERFLOW can be thrown only in ocfs2/cluster/tcp.c if (sc->sc_page_off == sizeof(struct o2net_msg)) { hdr = page_address(sc->sc_page); if (be16_to_cpu(hdr->data_len) > O2NET_MAX_PAYLOAD_BYTES) ret = -EOVERFLOW; } I guess there could be some 32/64 cleanness problem somewhere... I'd like to debug it a bit more but I'm not able to activate enough debugging even after enabling it: [root@theboss ~]# /sbin/debugfs.ocfs2 -l /dev/sdc1 KTHREAD off NOTICE allow ERROR allow EXPORT off QUORUM off CONN off DCACHE off VOTE allow INODE off NAMEI off UPTODATE off BH_IO off DLM_GLUE off EXTENT_MAP off FILE_IO off SUPER allow DISK_ALLOC off JOURNAL off AIO off DLM_RECOVERY off DLM_MASTER off DLM_THREAD off DLM_DOMAIN off DLM off DLMFS off HB_BIO off HEARTBEAT off SOCKET off MSG allow TCP allow EXIT deny ENTRY deny .. there must be some 'turn-on debugging log' master switch somewhere... -- davide.rossetti@gmail.com ICQ:290677265 SKYPE:d.rossetti