sundar mahadevan
2009-Jun-09  16:26 UTC
[Ocfs-users] question about oracle shared home install
Hi All, Scenario: I'm trying to install 9i rac on a 2 node cluster on OCFS2 OS: Oracle enterprise linux To my understanding, OCFS2 supports shared home installs which to my knowledge is not only can i have datafile and control files but also clustermanager files and binaries (pretty much everything: no files or executables need to kept local to any nodes). I have one single shared file for $ORACLE_HOME/oracm/admin/cmcfg.ora for both the nodes. cat cmcfg.ora ClusterName=Oracle Cluster Manager, version 9i MissCount=620 PrivateNodeNames=sunny1prv sunny2prv PublicNodeNames=sunny1pub sunny2pub ServicePort=9998 CmDiskFile=/u01/oradata/orcl/cmquorumfile HostName=sunny1pub KernelModuleName=hangcheck-timer I started cluster manager on node1 with the value of HostName=sunny1pub Now to start the cluster manager on node2, the value of HostName should be sunny2pub. What do i do now? Any help is greatly appreciated. Thanks in advance.
Sunil Mushran
2009-Jun-09  18:59 UTC
[Ocfs-users] question about oracle shared home install
Yes, on shared oracle_home, but NO on crs_home. The crs_home needs to be local. However, one can create the votingdisk on ocfs2. This mailing list is for ocfs2 only. For other products, ping Oracle support directly or via the forums on otn. Sunil sundar mahadevan wrote:> Hi All, > > Scenario: I'm trying to install 9i rac on a 2 node cluster on OCFS2 > OS: Oracle enterprise linux > > To my understanding, OCFS2 supports shared home installs which to my > knowledge is not only can i have datafile and control files but also > clustermanager files and binaries (pretty much everything: no files or > executables need to kept local to any nodes). I have one single shared > file for $ORACLE_HOME/oracm/admin/cmcfg.ora for both the nodes. > > cat cmcfg.ora > ClusterName=Oracle Cluster Manager, version 9i > MissCount=620 > PrivateNodeNames=sunny1prv sunny2prv > PublicNodeNames=sunny1pub sunny2pub > ServicePort=9998 > CmDiskFile=/u01/oradata/orcl/cmquorumfile > HostName=sunny1pub > KernelModuleName=hangcheck-timer > > I started cluster manager on node1 with the value of > HostName=sunny1pub Now to start the cluster manager on node2, the > value of HostName should be sunny2pub. What do i do now? Any help is > greatly appreciated. Thanks in advance. > > _______________________________________________ > Ocfs-users mailing list > Ocfs-users at oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs-users >
Hi Sunil We are using OCFS2 in Suse 10 with 2 nodes RAC config. I always see some error messages in "messages" file at os level as below : Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: (0,0):o2net_idle_timer:1422 here are some times that might help debug the situation: (tmr 1245400353.59415 now 1245400363.57026 dr 1245400353.59405 adv 1245400353.59420:1245400353.59420 func (bcc65fc6:504) 1245400350.591109:1245400350.591114) Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: o2net: no longer connected to node SPRI-DATABASESERVER2 (num 1) at 192.168.1.2:7777 Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: (8911,0):dlm_send_proxy_ast_msg:457 ERROR: status = -107 Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: (8911,0):dlm_flush_asts:584 ERROR: status = -107 Jun 19 15:32:53 SPRI-DATABASESERVER1 kernel: (5939,0):o2net_connect_expired:1583 ERROR: no connection established with node 1 after 10.0 seconds, giving up and returning errors. Jun 19 15:33:03 SPRI-DATABASESERVER1 kernel: (5939,0):o2net_connect_expired:1583 ERROR: no connection established with node 1 after 10.0 seconds, giving up and returning errors. Jun 19 15:33:13 SPRI-DATABASESERVER1 kernel: (5939,0):o2net_connect_expired:1583 ERROR: no connection established with node 1 after 10.0 seconds, giving up and returning errors. Do you have any idea what is going on in my system? Please advice Thanks Jeram
Hard to say. The errors suggest a connection snap. It could be because the other node died, or the cable was snapped, or you are running into a bug that was fixed long ago. In either case, you should file an issue with Novell. They will go over your system and suggest upgrades if necessary. Jeram wrote:> Hi Sunil > > We are using OCFS2 in Suse 10 with 2 nodes RAC config. I always see some > error messages in "messages" file at os level as below : > > Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: (0,0):o2net_idle_timer:1422 > here are some times that might help debug the situation: (tmr > 1245400353.59415 now 1245400363.57026 dr 1245400353.59405 adv > 1245400353.59420:1245400353.59420 func (bcc65fc6:504) > 1245400350.591109:1245400350.591114) > Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: o2net: no longer connected > to node SPRI-DATABASESERVER2 (num 1) at 192.168.1.2:7777 > Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: > (8911,0):dlm_send_proxy_ast_msg:457 ERROR: status = -107 > Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: (8911,0):dlm_flush_asts:584 > ERROR: status = -107 > Jun 19 15:32:53 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > Jun 19 15:33:03 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > Jun 19 15:33:13 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > > Do you have any idea what is going on in my system? > > Please advice > > Thanks > Jeram >
Hi Sunil Thanks for you prompt reply. Can you give more guideline for the bugs mentioned? may be i can try that option. Thanks Jeram ________________________________ From: Sunil Mushran [mailto:sunil.mushran at oracle.com] Sent: Fri 6/19/2009 11:21 PM To: Jeram Cc: ocfs-users at oss.oracle.com Subject: Re: OCFS2 Error Hard to say. The errors suggest a connection snap. It could be because the other node died, or the cable was snapped, or you are running into a bug that was fixed long ago. In either case, you should file an issue with Novell. They will go over your system and suggest upgrades if necessary. Jeram wrote:> Hi Sunil > > We are using OCFS2 in Suse 10 with 2 nodes RAC config. I always see some > error messages in "messages" file at os level as below : > > Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: (0,0):o2net_idle_timer:1422 > here are some times that might help debug the situation: (tmr > 1245400353.59415 now 1245400363.57026 dr 1245400353.59405 adv > 1245400353.59420:1245400353.59420 func (bcc65fc6:504) > 1245400350.591109:1245400350.591114) > Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: o2net: no longer connected > to node SPRI-DATABASESERVER2 (num 1) at 192.168.1.2:7777 > Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: > (8911,0):dlm_send_proxy_ast_msg:457 ERROR: status = -107 > Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: (8911,0):dlm_flush_asts:584 > ERROR: status = -107 > Jun 19 15:32:53 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > Jun 19 15:33:03 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > Jun 19 15:33:13 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > > Do you have any idea what is going on in my system? > > Please advice > > Thanks > Jeram >-------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs-users/attachments/20090621/1fe1c786/attachment.html
Check the news section in the ocfs2 page. In it we list the changelog and the bugs fixed in that release. On Jun 20, 2009, at 8:24 PM, Jeram <jeram at JISEDU.OR.ID> wrote:> Hi Sunil > Thanks for you prompt reply. > > Can you give more guideline for the bugs mentioned? may be i can try > that option. > > Thanks > Jeram > > From: Sunil Mushran [mailto:sunil.mushran at oracle.com] > Sent: Fri 6/19/2009 11:21 PM > To: Jeram > Cc: ocfs-users at oss.oracle.com > Subject: Re: OCFS2 Error > > Hard to say. The errors suggest a connection snap. It could be > because the other node died, or the cable was snapped, or you > are running into a bug that was fixed long ago. > > In either case, you should file an issue with Novell. They will go > over your system and suggest upgrades if necessary. > > Jeram wrote: > > Hi Sunil > > > > We are using OCFS2 in Suse 10 with 2 nodes RAC config. I always > see some > > error messages in "messages" file at os level as below : > > > > Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: > (0,0):o2net_idle_timer:1422 > > here are some times that might help debug the situation: (tmr > > 1245400353.59415 now 1245400363.57026 dr 1245400353.59405 adv > > 1245400353.59420:1245400353.59420 func (bcc65fc6:504) > > 1245400350.591109:1245400350.591114) > > Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: o2net: no longer > connected > > to node SPRI-DATABASESERVER2 (num 1) at 192.168.1.2:7777 > > Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: > > (8911,0):dlm_send_proxy_ast_msg:457 ERROR: status = -107 > > Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: > (8911,0):dlm_flush_asts:584 > > ERROR: status = -107 > > Jun 19 15:32:53 SPRI-DATABASESERVER1 kernel: > > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > > with node 1 after 10.0 seconds, giving up and returning errors. > > Jun 19 15:33:03 SPRI-DATABASESERVER1 kernel: > > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > > with node 1 after 10.0 seconds, giving up and returning errors. > > Jun 19 15:33:13 SPRI-DATABASESERVER1 kernel: > > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > > with node 1 after 10.0 seconds, giving up and returning errors. > > > > Do you have any idea what is going on in my system? > > > > Please advice > > > > Thanks > > Jeram > > >-------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs-users/attachments/20090620/8b97a818/attachment.html
Hi Sunil Thanks a lot for the info. Rgds/Jeram ________________________________ From: Sunil Mushran [mailto:sunil.mushran at oracle.com] Sent: Sun 6/21/2009 12:58 PM To: Jeram Cc: <ocfs-users at oss.oracle.com> Subject: Re: OCFS2 Error Check the news section in the ocfs2 page. In it we list the changelog and the bugs fixed in that release. On Jun 20, 2009, at 8:24 PM, Jeram <jeram at JISEDU.OR.ID> wrote: Hi Sunil Thanks for you prompt reply. Can you give more guideline for the bugs mentioned? may be i can try that option. Thanks Jeram ________________________________ From: Sunil Mushran [mailto:sunil.mushran at oracle.com] Sent: Fri 6/19/2009 11:21 PM To: Jeram Cc: <mailto:ocfs-users at oss.oracle.com> ocfs-users at oss.oracle.com Subject: Re: OCFS2 Error Hard to say. The errors suggest a connection snap. It could be because the other node died, or the cable was snapped, or you are running into a bug that was fixed long ago. In either case, you should file an issue with Novell. They will go over your system and suggest upgrades if necessary. Jeram wrote: > Hi Sunil > > We are using OCFS2 in Suse 10 with 2 nodes RAC config. I always see some > error messages in "messages" file at os level as below : > > Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: (0,0):o2net_idle_timer:1422 > here are some times that might help debug the situation: (tmr > 1245400353.59415 now 1245400363.57026 dr 1245400353.59405 adv > 1245400353.59420:1245400353.59420 func (bcc65fc6:504) > 1245400350.591109:1245400350.591114) > Jun 19 15:32:43 SPRI-DATABASESERVER1 kernel: o2net: no longer connected > to node SPRI-DATABASESERVER2 (num 1) at 192.168.1.2:7777 > Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: > (8911,0):dlm_send_proxy_ast_msg:457 ERROR: status = -107 > Jun 19 15:32:45 SPRI-DATABASESERVER1 kernel: (8911,0):dlm_flush_asts:584 > ERROR: status = -107 > Jun 19 15:32:53 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > Jun 19 15:33:03 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > Jun 19 15:33:13 SPRI-DATABASESERVER1 kernel: > (5939,0):o2net_connect_expired:1583 ERROR: no connection established > with node 1 after 10.0 seconds, giving up and returning errors. > > Do you have any idea what is going on in my system? > > Please advice > > Thanks > Jeram > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs-users/attachments/20090621/52495b88/attachment.html