Hello, I have a DELL MD3000i, and a couple of servers that I want to connect to the array. I have setup the main server with CentOS 5. - 2.6.18-53.el5 x86_64 - ocfs2-tools-1.2.7-1.el5 - ocfs2console-1.2.7-1.el5 - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module And two OpenSuSe 10.3 Servers - 2.6.22.5-31-default x86_64 - ocfs2-tools-1.2.6-18 - ocfs2console-1.2.6-18 The module has been already shipped with the kernel (thanks to novell) I had no problems mounting the FS on the CentOS 5 server. I can read/write in the FS without problems. ===========[root at arenamexico ~]# mounted.ocfs2 -d Device FS UUID Label /dev/sdb1 ocfs2 cffd81fa-7395-433e-912a-d9c9d7235df3 Coliseo [root at arenamexico ~]# mounted.ocfs2 -f Device FS Nodes /dev/sdb1 ocfs2 arenamexico [root at arenamexico ~]# mount|grep ocfs2 ocfs2_dlmfs on /dlm type ocfs2_dlmfs (rw) /dev/sdb1 on /mnt/projects type ocfs2 (rw,_netdev,noatime,heartbeat=local) =========== But when I try to start the o2cb service on the Opensuse servers, the servers just hang. ===========drwagner:~ # /etc/init.d/o2cb status Module "configfs": Loaded Filesystem "configfs": Mounted Module "ocfs2_nodemanager": Loaded Module "ocfs2_dlm": Loaded Module "ocfs2_dlmfs": Loaded Filesystem "ocfs2_dlmfs": Mounted drwagner:~ # /etc/init.d/o2cb online ocfs2 Loading module "configfs": OK Mounting configfs filesystem at /sys/kernel/config: OK Loading module "ocfs2_nodemanager": OK Loading module "ocfs2_dlm": OK Loading module "ocfs2_dlmfs": OK Mounting ocfs2_dlmfs filesystem at /dlm: OK Starting O2CB cluster ocfs2: [And it hangs] ============ Does anyone has a clue of what is happening? Thanks.
Check /etc/ocfs2/cluster.conf on the opensuse cluster. Starting cluster includes reading cluster.conf and populating /sys/kernel/config with the info. Make that file available as-is... as in, not a cut-paste. Miguel Santos wrote:> Hello, > > I have a DELL MD3000i, and a couple of servers that I want to connect to > the array. > > I have setup the main server with CentOS 5. > - 2.6.18-53.el5 x86_64 > - ocfs2-tools-1.2.7-1.el5 > - ocfs2console-1.2.7-1.el5 > - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module > > And two OpenSuSe 10.3 Servers > - 2.6.22.5-31-default x86_64 > - ocfs2-tools-1.2.6-18 > - ocfs2console-1.2.6-18 > The module has been already shipped with the kernel (thanks to novell) > > I had no problems mounting the FS on the CentOS 5 server. > I can read/write in the FS without problems. > > ===========> [root at arenamexico ~]# mounted.ocfs2 -d > Device FS UUID Label > /dev/sdb1 ocfs2 cffd81fa-7395-433e-912a-d9c9d7235df3 Coliseo > [root at arenamexico ~]# mounted.ocfs2 -f > Device FS Nodes > /dev/sdb1 ocfs2 arenamexico > [root at arenamexico ~]# mount|grep ocfs2 > ocfs2_dlmfs on /dlm type ocfs2_dlmfs (rw) > /dev/sdb1 on /mnt/projects type ocfs2 (rw,_netdev,noatime,heartbeat=local) > ===========> > But when I try to start the o2cb service on the Opensuse servers, the > servers just hang. > > ===========> drwagner:~ # /etc/init.d/o2cb status > Module "configfs": Loaded > Filesystem "configfs": Mounted > Module "ocfs2_nodemanager": Loaded > Module "ocfs2_dlm": Loaded > Module "ocfs2_dlmfs": Loaded > Filesystem "ocfs2_dlmfs": Mounted > > drwagner:~ # /etc/init.d/o2cb online ocfs2 > Loading module "configfs": OK > Mounting configfs filesystem at /sys/kernel/config: OK > Loading module "ocfs2_nodemanager": OK > Loading module "ocfs2_dlm": OK > Loading module "ocfs2_dlmfs": OK > Mounting ocfs2_dlmfs filesystem at /dlm: OK > Starting O2CB cluster ocfs2: > > [And it hangs] > ============> > Does anyone has a clue of what is happening? > > Thanks. > > > _______________________________________________ > Ocfs2-users mailing list > Ocfs2-users at oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs2-users >
Miguel Santos N.
2008-Apr-01 22:00 UTC
[Ocfs2-users] SuSe Hangs when /etc/init.d/o2cb online
Thanks.. I am attaching the file /etc/ocfs/cluster.conf, it is the same file on all the servers. Sunil Mushran escribi?:> Check /etc/ocfs2/cluster.conf on the opensuse cluster. Starting cluster > includes reading cluster.conf and populating /sys/kernel/config with > the info. Make that file available as-is... as in, not a cut-paste. > > Miguel Santos wrote: >> Hello, >> >> I have a DELL MD3000i, and a couple of servers that I want to connect to >> the array. >> >> I have setup the main server with CentOS 5. >> - 2.6.18-53.el5 x86_64 >> - ocfs2-tools-1.2.7-1.el5 >> - ocfs2console-1.2.7-1.el5 >> - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module >> >> And two OpenSuSe 10.3 Servers >> - 2.6.22.5-31-default x86_64 >> - ocfs2-tools-1.2.6-18 >> - ocfs2console-1.2.6-18 >> The module has been already shipped with the kernel (thanks to novell) >> >> I had no problems mounting the FS on the CentOS 5 server. >> I can read/write in the FS without problems. >> >> ===========>> [root at arenamexico ~]# mounted.ocfs2 -d >> Device FS UUID Label >> /dev/sdb1 ocfs2 cffd81fa-7395-433e-912a-d9c9d7235df3 >> Coliseo >> [root at arenamexico ~]# mounted.ocfs2 -f >> Device FS Nodes >> /dev/sdb1 ocfs2 arenamexico >> [root at arenamexico ~]# mount|grep ocfs2 >> ocfs2_dlmfs on /dlm type ocfs2_dlmfs (rw) >> /dev/sdb1 on /mnt/projects type ocfs2 >> (rw,_netdev,noatime,heartbeat=local) >> ===========>> >> But when I try to start the o2cb service on the Opensuse servers, the >> servers just hang. >> >> ===========>> drwagner:~ # /etc/init.d/o2cb status >> Module "configfs": Loaded >> Filesystem "configfs": Mounted >> Module "ocfs2_nodemanager": Loaded >> Module "ocfs2_dlm": Loaded >> Module "ocfs2_dlmfs": Loaded >> Filesystem "ocfs2_dlmfs": Mounted >> >> drwagner:~ # /etc/init.d/o2cb online ocfs2 >> Loading module "configfs": OK >> Mounting configfs filesystem at /sys/kernel/config: OK >> Loading module "ocfs2_nodemanager": OK >> Loading module "ocfs2_dlm": OK >> Loading module "ocfs2_dlmfs": OK >> Mounting ocfs2_dlmfs filesystem at /dlm: OK >> Starting O2CB cluster ocfs2: >> >> [And it hangs] >> ============>> >> Does anyone has a clue of what is happening? >> >> Thanks. >> >> >> _______________________________________________ >> Ocfs2-users mailing list >> Ocfs2-users at oss.oracle.com >> http://oss.oracle.com/mailman/listinfo/ocfs2-users >> >-------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: cluster.conf Url: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20080401/b7fef31e/attachment-0001.pl
So that looks ok. Next step would be to do some strac-ing. Do: /etc/init.d/o2cb load then do: strace -ff -o /tmp/out o2cb_ctl -H -n ocfs2 -t cluster -a online=yes The last command is what "onlines" the cluster. /tmp/out should have some information. Miguel Santos N. wrote:> Thanks.. > > I am attaching the file /etc/ocfs/cluster.conf, it is the same file on > all the servers. > > > > Sunil Mushran escribi?: >> Check /etc/ocfs2/cluster.conf on the opensuse cluster. Starting cluster >> includes reading cluster.conf and populating /sys/kernel/config with >> the info. Make that file available as-is... as in, not a cut-paste. >> >> Miguel Santos wrote: >>> Hello, >>> >>> I have a DELL MD3000i, and a couple of servers that I want to >>> connect to >>> the array. >>> >>> I have setup the main server with CentOS 5. >>> - 2.6.18-53.el5 x86_64 >>> - ocfs2-tools-1.2.7-1.el5 >>> - ocfs2console-1.2.7-1.el5 >>> - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module >>> >>> And two OpenSuSe 10.3 Servers >>> - 2.6.22.5-31-default x86_64 >>> - ocfs2-tools-1.2.6-18 >>> - ocfs2console-1.2.6-18 >>> The module has been already shipped with the kernel (thanks to novell) >>> >>> I had no problems mounting the FS on the CentOS 5 server. >>> I can read/write in the FS without problems. >>> >>> ===========>>> [root at arenamexico ~]# mounted.ocfs2 -d >>> Device FS UUID >>> Label >>> /dev/sdb1 ocfs2 cffd81fa-7395-433e-912a-d9c9d7235df3 >>> Coliseo >>> [root at arenamexico ~]# mounted.ocfs2 -f >>> Device FS Nodes >>> /dev/sdb1 ocfs2 arenamexico >>> [root at arenamexico ~]# mount|grep ocfs2 >>> ocfs2_dlmfs on /dlm type ocfs2_dlmfs (rw) >>> /dev/sdb1 on /mnt/projects type ocfs2 >>> (rw,_netdev,noatime,heartbeat=local) >>> ===========>>> >>> But when I try to start the o2cb service on the Opensuse servers, the >>> servers just hang. >>> >>> ===========>>> drwagner:~ # /etc/init.d/o2cb status >>> Module "configfs": Loaded >>> Filesystem "configfs": Mounted >>> Module "ocfs2_nodemanager": Loaded >>> Module "ocfs2_dlm": Loaded >>> Module "ocfs2_dlmfs": Loaded >>> Filesystem "ocfs2_dlmfs": Mounted >>> >>> drwagner:~ # /etc/init.d/o2cb online ocfs2 >>> Loading module "configfs": OK >>> Mounting configfs filesystem at /sys/kernel/config: OK >>> Loading module "ocfs2_nodemanager": OK >>> Loading module "ocfs2_dlm": OK >>> Loading module "ocfs2_dlmfs": OK >>> Mounting ocfs2_dlmfs filesystem at /dlm: OK >>> Starting O2CB cluster ocfs2: >>> >>> [And it hangs] >>> ============>>> >>> Does anyone has a clue of what is happening? >>> >>> Thanks. >>> >>> >>> _______________________________________________ >>> Ocfs2-users mailing list >>> Ocfs2-users at oss.oracle.com >>> http://oss.oracle.com/mailman/listinfo/ocfs2-users >>> >> > > ------------------------------------------------------------------------ > > node: > ip_port = 7777 > ip_address = 192.168.130.1 > number = 0 > name = arenamexico > cluster = ocfs2 > > node: > ip_port = 7777 > ip_address = 192.168.130.2 > number = 1 > name = suse1 > cluster = ocfs2 > > node: > ip_port = 7777 > ip_address = 192.168.130.3 > number = 2 > name = drwagner > cluster = ocfs2 > > cluster: > node_count = 3 > name = ocfs2 > >
> Hello, > > I have a DELL MD3000i, and a couple of servers that I want to connect to > the array. > > I have setup the main server with CentOS 5. > - 2.6.18-53.el5 x86_64 > - ocfs2-tools-1.2.7-1.el5 > - ocfs2console-1.2.7-1.el5 > - ocfs2-2.6.18-53.el5-1.2.8-2.el5 # Kernel Module > > And two OpenSuSe 10.3 Servers > - 2.6.22.5-31-default x86_64 > - ocfs2-tools-1.2.6-18 > - ocfs2console-1.2.6-18 > The module has been already shipped with the kernel (thanks to novell)Hmm, as far as I remember I also had some problems with older OpenSUSE 10.3 kernels and OCFS2 (resulting in hangs). Can you try with the latest kernel from OpenSUSE 10.3? Also I'm actually not sure whether 2.6.18 from RH and 2.6.22 from OpenSUSE will play well together - there were some incompatible network protocol changes but I'm not completely sure when exactly did they take place and what RH took into their kernel. Honza