Hi,
  I sent an email to Mark Fisheh of Oracle Corp. & posted this issue at OTN
under Linux thread this morning. I hope that someone among you might have
experienced this and can help. On that basis, I am sending this to you too. I am
stuck & will really appreciate if you can shed some light on this.
  Thanks.
  Anjan
 
***********************************************************************************************************
  I have a 2 node CentOS 4.5 86_64 system (kernel 2.6.9-55.EL). On this I
installed Oracle OCFS2 1.2.7-1 (with exact kernel matching). After this I
installed Oracle CRS 10.2.0.1 and that installation went fine. Then I tried to
install Oracle RDBMS 10.2.0.1 and all the problems started from there. The
/var/log/messages file got filled up with messages (giving some to avoid
confusion):
ocfs2_read_locked_inode: .. : ERROR: Invalid dinode #0 signature ocfs2_lookup:
.. : ERROR: Unable to create inode ....
Then OUI gave several error messages, e.g.
.... Invalid stored block length on file ...../em/em.war followed by I/O error
in file
Errors in invoking to files ins_rdbms.mk and ins_ldap.mk
Then /var/log/messages gave:
OCFS2: ERROR (device ....): ocfs2_extend_file: Dinode # ...... has bad signature
O' # I ....
And the installation failed & CRS died. And the machines reboot.
  I ran fsck.ocfs2 -n /dev/...., it came clean.
I have tested this several timnes & always same thing happening.
If I use RAW partitions, everything works fine. So, the problem may be in the
OCFS2 & OS/Oracle -- but, not sure how to bypass this.
I have to have OCFS2 -- can't use RAW for various reasons.
Can somebody please help me to resolve this?
Thanks. 
 
***********************************************************************************************************
       
---------------------------------
Be a better pen pal. Text or chat with friends inside Yahoo! Mail. See how.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://oss.oracle.com/pipermail/ocfs2-users/attachments/20071126/f3e532aa/attachment.html
Anjan,
   You dont need to share the database binaries, only the CRS and the datafiles.
You can do it to save disk space, but it is not mandatory. The CRS and datafiles
are much less stressfull to the filesystem structures as there is a reduced
number of large files, although they usually have a heavy i/o load, and stress
the disk subsystem and the locking algorithms.
    So you can have two separated ext3 filesystems located at the same place on
each server, and one or more ocfs2 shared filesystems for the CRS and the
database datafiles. The Oracle installer takes care of copying the binaries
between the servers during the installation.
    It might be usefull to try a lower version like 1.2.6, as you are using the
latest version available. I am using 1.2.4-2 here with RH 4.0 and kernel
2.6.9-42 and it seems rather stable, only needed to increase the timeouts. (But
I dont have the oracle_home shared.)
    Also you might have a hardware problem somewhere on the SAN. And I still
have to check those mount options you sent...
    One detail. I dont know if the Centos distro includes the OCFS2 module. Are
you using the modules downloaded from the oss.oracle.com site for the equivalent
RH 4.0 kernel, or modules built by Centos? If using CENTOS modules you might get
better results by changing to the Oracle built modules for the equivalent RH 4.0
kernel.
Regards,
Luis
Anjan Chakraborty <anjan.chakraborty@yahoo.com> wrote: Luis,
  I am intending to use CRS/RAC that needs a Cluster File System. How does EXT3
falls into that area?
  Thanks.
  Anjan
Thanks a lot for the response. Here is what I am doing:    
1. 
    mkfs.ocfs2 -b 4K -C 32K -N 4 -L ocrvotcrs /dev/sdb3   -- for CRS
    mkfs.ocfs2 -b 4K -C 32K -N 4 -L orasys /dev/sdb4      -- for RDBMS
     
    2. Then mounting using /etc/fstab:
    /dev/sdb3     /home/oracle/ocrvotcrs    ocfs2    _netdev,datavolume,nointr 0
0
    /dev/sdb4     /home/oracle/orasys    ocfs2    _netdev 0 0
    If you find anything wrong here, can you please tell what to do?
    It's a non-production system & so I can experiment with whatever you
suggest and won't held you responsible for that.
    Thanks.
    Anjan
 
 
 
Luis Freitas <lfreitas34@yahoo.com> wrote:
  Anjan,
  Are you installing the binaries on OCSF2 too? How are you mounting the
filesystem?
  You might want to try using ext3 for the binaries and OCF2 only for datafiles
and archives, until you get this fixed.
Regards,
Luis
Anjan Chakraborty <anjan.chakraborty@yahoo.com> wrote:    Hi,
  I sent an email to Mark Fisheh of Oracle Corp. & posted this issue at OTN
under Linux thread this morning. I hope that someone among you might have
experienced this and can help. On that basis, I am sending this to you too. I am
stuck & will really appreciate if you can shed some light on this.
  Thanks.
  Anjan
 
***********************************************************************************************************
  I have a 2 node CentOS 4.5 86_64 system (kernel 2.6.9-55.EL). On this I
installed Oracle OCFS2 1.2.7-1 (with exact kernel matching). After this I
installed Oracle CRS 10.2.0.1 and that installation went fine. Then I tried to
install Oracle RDBMS 10.2.0.1 and all the problems started from there. The
/var/log/messages file got filled up with messages (giving some to avoid
confusion):
ocfs2_read_locked_inode: .. : ERROR: Invalid  dinode #0 signature ocfs2_lookup:
.. : ERROR: Unable to create inode ....
Then OUI gave several error messages, e.g.
.... Invalid stored block length on file ...../em/em.war followed by I/O error
in file
Errors in invoking to files ins_rdbms.mk and ins_ldap.mk
Then /var/log/messages gave:
OCFS2: ERROR (device ....): ocfs2_extend_file: Dinode # ...... has bad signature
O' # I ....
And the installation failed & CRS died. And the machines reboot.
  I ran fsck.ocfs2 -n /dev/...., it came clean.
I have tested this several timnes & always same thing happening.
If I use RAW partitions, everything works fine. So, the problem may be in the
OCFS2 & OS/Oracle -- but, not sure how to bypass this.
I have to have OCFS2 -- can't use RAW for various reasons.
Can somebody please help me to resolve this?
Thanks. 
 
***********************************************************************************************************
  
  
---------------------------------
  Be a better pen pal. Text or chat with friends inside Yahoo! Mail. See
how._______________________________________________
Ocfs2-users mailing  list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users
    
---------------------------------
  Be a better pen pal. Text or chat with friends inside Yahoo! Mail. See how.
           
---------------------------------
Never miss a thing.   Make Yahoo your homepage.  
       
---------------------------------
Get easy, one-click access to your favorites.  Make Yahoo! your homepage.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://oss.oracle.com/pipermail/ocfs2-users/attachments/20071127/9d1546ad/attachment.html
On Mon, Nov 26, 2007 at 07:37:06AM -0800, Anjan Chakraborty wrote:> Hi, > I sent an email to Mark Fisheh of Oracle Corp. & posted this issue at OTN under > Linux thread this morning. I hope that someone among you might have experienced > this and can help. On that basis, I am sending this to you too. I am stuck & > will really appreciate if you can shed some light on this.Probably a lot of folks in the US were on vacation this last week. You should get better traction now since most of us are back :)> Thanks. > Anjan > *********************************************************************************************************** > I have a 2 node CentOS 4.5 86_64 system (kernel 2.6.9-55.EL). On this I > installed Oracle OCFS2 1.2.7-1 (with exact kernel matching). After this I > installed Oracle CRS 10.2.0.1 and that installation went fine. Then I tried to > install Oracle RDBMS 10.2.0.1 and all the problems started from there. The /var > /log/messages file got filled up with messages (giving some to avoid > confusion): > ocfs2_read_locked_inode: .. : ERROR: Invalid dinode #0 signature > ocfs2_lookup: .. : ERROR: Unable to create inode ....Are there any other types of messages on either node? The "Invalid dinode" message is very generic unfortunately, so typically we're looking for something before that to indicate a root cause.> Then OUI gave several error messages, e.g. > .... Invalid stored block length on file ...../em/em.war followed by I/O error > in file > Errors in invoking to files ins_rdbms.mk and ins_ldap.mk > > Then /var/log/messages gave: > OCFS2: ERROR (device ....): ocfs2_extend_file: Dinode # ...... has bad > signature O' # I .... > And the installation failed & CRS died. And the machines reboot. > I ran fsck.ocfs2 -n /dev/...., it came clean. > I have tested this several timnes & always same thing happening. > If I use RAW partitions, everything works fine. So, the problem may be in the > OCFS2 & OS/Oracle -- but, not sure how to bypass this. > I have to have OCFS2 -- can't use RAW for various reasons. > Can somebody please help me to resolve this?Can you describe your shared disk setup? Also, send me your cluster.conf files from all nodes. Considering it's a fresh file system and you've only just started putting files on it, my initial reaction is to check the shared disk. It could be that blocks are somehow being cached so the file system is getting stale or invalid meta data. I think Luis suggested trying an older version of Ocfs2. Feel free to do that, it could only add a potentially useful data point. You really don't have to jump far back though - just try 1.2.6 for starters. Thanks, --Mark -- Mark Fasheh Senior Software Developer, Oracle mark.fasheh@oracle.com
Seemingly Similar Threads
- re: o2hb_do_disk_heartbeat:963 ERROR: Device "sdb1" another node is heartbeating in our slot!
- 6 node cluster with unexplained reboots
- Problem with OCFS2 on RHEL5.0 while installing CRS 10.2.01
- fs/ocfs2/super.c:1809 ocfs2_mount_volume() warn: missing error code 'status'
- Extended Attribute Support?