srinivas.eeda at oracle.com
2011-Jul-28 10:09 UTC
[Ocfs2-users] ocfs2, redhat 2.6.18-238.x kernel and async i/o hang
On 07/28/2011 09:20 AM, Matthew Huff wrote:> I migrated a Oracle 11gR2 database from one HP RHEL5 box to another. The only difference I can tell is that the older box was running 2.16.18-194 kernel, and the newer box is running 2.6.18.238 kernel (I've tried 2.6.18.238.el5, 238.5.1.el5 and 239.19.1.el5). On the new box, any Oracle instance hangs on startup mounting the files if async io is turned on. Direct I/O works file. The files are all on ocfs2 partitions. > > Anyone seen this issue? The databases run fine if the spfile has:We came to know about this recently and are looking into it. Till it is resolved please use the older working kernel.> *.disk_asynch_io=FALSE > *.filesystemio_options='directio' > > But hang 100% of the time (even after a few hours of waiting) if the parameters are set to: > *.disk_asynch_io=TRUE > *.filesystemio_options='setall' > > > Other than having libaio rpm loaded, I don't think there are any configuration or knobs I need to turn. > > > Here are some details: > > > [oracle at hubble lib64]$ ldd $ORACLE_HOME/bin/oracle > linux-vdso.so.1 => (0x00007fff9b7fc000) > libodm11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libodm11.so (0x00002b487959a000) > libcell11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libcell11.so (0x00002b487969c000) > libskgxp11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libskgxp11.so (0x00002b4879800000) > librt.so.1 => /lib64/librt.so.1 (0x00000032fa600000) > libnnz11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libnnz11.so (0x00002b48799ea000) > libclsra11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libclsra11.so (0x00002b4879db7000) > libdbcfg11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libdbcfg11.so (0x00002b4879ecf000) > libhasgen11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libhasgen11.so (0x00002b4879ff2000) > libskgxn2.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libskgxn2.so (0x00002b487a7c6000) > libocr11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libocr11.so (0x00002b487a8c8000) > libocrb11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libocrb11.so (0x00002b487aa6e000) > libocrutl11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libocrutl11.so (0x00002b487ac0e000) > libaio.so.1 => /usr/lib64/libaio.so.1 (0x00002b487ad19000) > libdl.so.2 => /lib64/libdl.so.2 (0x00000032f9a00000) > libm.so.6 => /lib64/libm.so.6 (0x00000032f9600000) > libpthread.so.0 => /lib64/libpthread.so.0 (0x00000032f9e00000) > libnsl.so.1 => /lib64/libnsl.so.1 (0x00000032fc600000) > libc.so.6 => /lib64/libc.so.6 (0x00000032f9200000) > /lib64/ld-linux-x86-64.so.2 (0x00000032f8e00000) > > [oracle at hubble lib64]$ rpm -qa | grep aio > libaio-0.3.106-5 > libaio-devel-0.3.106-5 > libsane-hpaio-1.6.7-6.el5_6.1 > libaio-0.3.106-5 > libaio-devel-0.3.106-5 > > [oracle at hubble dbs]$ cd /usr/lib64 > [oracle at hubble lib64]$ ls -lF *aio* > -rw-r--r-- 1 root root 15870 Dec 21 2009 libaio.a > lrwxrwxrwx 1 root root 15 Jul 23 15:21 libaio.so -> libaio.so.1.0.1* > lrwxrwxrwx 1 root root 15 Jul 22 21:59 libaio.so.1 -> libaio.so.1.0.1* > -rwxr-xr-x 1 root root 3768 Dec 21 2009 libaio.so.1.0.0* > -rwxr-xr-x 1 root root 3768 Dec 21 2009 libaio.so.1.0.1* > -rw-r--r-- 1 root root 92866 Apr 8 09:14 librtkaio.a > > [root at hubble ~]# grep ocfs2 /etc/fstab > LABEL=oracle_logfiles /u01 ocfs2 _netdev,datavolume,nointr > LABEL=oracle_datafiles /u02 ocfs2 _netdev,datavolume,nointr > > [root at hubble ~]# grep ocfs2 /etc/mtab > ocfs2_dlmfs /dlm ocfs2_dlmfs rw 0 0 > /dev/cciss/c0d1 /u01 ocfs2 rw,_netdev,datavolume,nointr,heartbeat=local 0 0 > /dev/cciss/c0d2 /u02 ocfs2 rw,_netdev,datavolume,nointr,heartbeat=local 0 0 > > [oracle at hubble lib64]$ rpm -qa | grep ocfs > ocfs2console-1.4.4-1.el5 > ocfs2-2.6.18-238.19.1.el5-1.4.7-1.el5 > ocfs2-tools-1.4.4-1.el5 > > [root at hubble ~]# uname -a > Linux hubble.ox.com 2.6.18-238.19.1.el5 #1 SMP Sun Jul 10 08:43:41 EDT 2011 x86_64 x86_64 x86_64 GNU/Linux > > > ---- > Matthew Huff | 1 Manhattanville Rd > Director of Operations | Purchase, NY 10577 > OTA Management LLC | Phone: 914-460-4039 > aim: matthewbhuff | Fax: 914-460-4139 > > > > _______________________________________________ > Ocfs2-users mailing list > Ocfs2-users at oss.oracle.com > http://oss.oracle.com/mailman/listinfo/ocfs2-users
Matthew Huff
2011-Jul-28 16:20 UTC
[Ocfs2-users] ocfs2, redhat 2.6.18-238.x kernel and async i/o hang
I migrated a Oracle 11gR2 database from one HP RHEL5 box to another. The only difference I can tell is that the older box was running 2.16.18-194 kernel, and the newer box is running 2.6.18.238 kernel (I've tried 2.6.18.238.el5, 238.5.1.el5 and 239.19.1.el5). On the new box, any Oracle instance hangs on startup mounting the files if async io is turned on. Direct I/O works file. The files are all on ocfs2 partitions. Anyone seen this issue? The databases run fine if the spfile has: *.disk_asynch_io=FALSE *.filesystemio_options='directio' But hang 100% of the time (even after a few hours of waiting) if the parameters are set to: *.disk_asynch_io=TRUE *.filesystemio_options='setall' Other than having libaio rpm loaded, I don't think there are any configuration or knobs I need to turn. Here are some details: [oracle at hubble lib64]$ ldd $ORACLE_HOME/bin/oracle linux-vdso.so.1 => (0x00007fff9b7fc000) libodm11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libodm11.so (0x00002b487959a000) libcell11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libcell11.so (0x00002b487969c000) libskgxp11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libskgxp11.so (0x00002b4879800000) librt.so.1 => /lib64/librt.so.1 (0x00000032fa600000) libnnz11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libnnz11.so (0x00002b48799ea000) libclsra11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libclsra11.so (0x00002b4879db7000) libdbcfg11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libdbcfg11.so (0x00002b4879ecf000) libhasgen11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libhasgen11.so (0x00002b4879ff2000) libskgxn2.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libskgxn2.so (0x00002b487a7c6000) libocr11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libocr11.so (0x00002b487a8c8000) libocrb11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libocrb11.so (0x00002b487aa6e000) libocrutl11.so => /opt/oracle/product/11.2.0/dbhome_1/lib/libocrutl11.so (0x00002b487ac0e000) libaio.so.1 => /usr/lib64/libaio.so.1 (0x00002b487ad19000) libdl.so.2 => /lib64/libdl.so.2 (0x00000032f9a00000) libm.so.6 => /lib64/libm.so.6 (0x00000032f9600000) libpthread.so.0 => /lib64/libpthread.so.0 (0x00000032f9e00000) libnsl.so.1 => /lib64/libnsl.so.1 (0x00000032fc600000) libc.so.6 => /lib64/libc.so.6 (0x00000032f9200000) /lib64/ld-linux-x86-64.so.2 (0x00000032f8e00000) [oracle at hubble lib64]$ rpm -qa | grep aio libaio-0.3.106-5 libaio-devel-0.3.106-5 libsane-hpaio-1.6.7-6.el5_6.1 libaio-0.3.106-5 libaio-devel-0.3.106-5 [oracle at hubble dbs]$ cd /usr/lib64 [oracle at hubble lib64]$ ls -lF *aio* -rw-r--r-- 1 root root 15870 Dec 21 2009 libaio.a lrwxrwxrwx 1 root root 15 Jul 23 15:21 libaio.so -> libaio.so.1.0.1* lrwxrwxrwx 1 root root 15 Jul 22 21:59 libaio.so.1 -> libaio.so.1.0.1* -rwxr-xr-x 1 root root 3768 Dec 21 2009 libaio.so.1.0.0* -rwxr-xr-x 1 root root 3768 Dec 21 2009 libaio.so.1.0.1* -rw-r--r-- 1 root root 92866 Apr 8 09:14 librtkaio.a [root at hubble ~]# grep ocfs2 /etc/fstab LABEL=oracle_logfiles /u01 ocfs2 _netdev,datavolume,nointr LABEL=oracle_datafiles /u02 ocfs2 _netdev,datavolume,nointr [root at hubble ~]# grep ocfs2 /etc/mtab ocfs2_dlmfs /dlm ocfs2_dlmfs rw 0 0 /dev/cciss/c0d1 /u01 ocfs2 rw,_netdev,datavolume,nointr,heartbeat=local 0 0 /dev/cciss/c0d2 /u02 ocfs2 rw,_netdev,datavolume,nointr,heartbeat=local 0 0 [oracle at hubble lib64]$ rpm -qa | grep ocfs ocfs2console-1.4.4-1.el5 ocfs2-2.6.18-238.19.1.el5-1.4.7-1.el5 ocfs2-tools-1.4.4-1.el5 [root at hubble ~]# uname -a Linux hubble.ox.com 2.6.18-238.19.1.el5 #1 SMP Sun Jul 10 08:43:41 EDT 2011 x86_64 x86_64 x86_64 GNU/Linux ---- Matthew Huff? | 1 Manhattanville Rd Director of Operations???| Purchase, NY 10577 OTA Management LLC | Phone: 914-460-4039 aim: matthewbhuff? | Fax:?? 914-460-4139