Ocfs2 users - Apr 2008 - Weird lock

Hi,

We are having a problem with apache+perl being hang.

 1626 ?        Ss     0:00 sendmail: rejecting connections on daemon MTA: load 
average: 152
 1634 ?        Ss     0:00 sendmail: Queue runner at 01:00:00 
for /var/spool/clientmqueue
 1741 ?        Ss     0:00 /usr/sbin/httpd
 1744 ?        S      0:00  
\_ /usr/local/sbin/cronolog /site/logssite/access_log.%Y%m%d
21377 ?        S      0:00  \_ /usr/sbin/httpd
23942 ?        D      0:00  |   
\_ /usr/bin/perl -w
/storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
21518 ?        S      0:00  \_ /usr/sbin/httpd
23987 ?        D      0:00  |   
\_ /usr/bin/perl -w
/storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
21552 ?        S      0:00  \_ /usr/sbin/httpd
23873 ?        D      0:00  |   
\_ /usr/bin/perl -w
/storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
21563 ?        S      0:00  \_ /usr/sbin/httpd
23948 ?        D      0:00  |   
\_ /usr/bin/perl -w
/storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
21590 ?        S      0:00  \_ /usr/sbin/httpd
23866 ?        R     39:21  |   
\_ /usr/bin/perl -w
/storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi
21596 ?        S      0:00  \_ /usr/sbin/httpd
23929 ?        D      0:00  |   
\_ /usr/bin/perl -w
/storage/webhosting/site/public_html/alojados/site/MT/come2x.cgi

Process 23866  keeps on running and all the others freeze. Strace also blocks.
Attached i'm sending locking_state data. Dmesg:

-----
OCFS2 Node Manager 1.2.5 Tue Apr 10 12:29:33 EDT 2007 (build 
9e5f332181e8ebfad464946bcc4888af)
OCFS2 DLM 1.2.5 Tue Apr 10 12:29:33 EDT 2007 (build 
e2556a71429f31033b275dff4b5594aa)
OCFS2 DLMFS 1.2.5 Tue Apr 10 12:29:33 EDT 2007 (build 
e2556a71429f31033b275dff4b5594aa)
OCFS2 User DLM kernel interface loaded
o2net: accepted connection from node ws3 (num 19) at 172.16.42.3:7777
o2net: connected to node ws1 (num 0) at 172.16.42.1:7777
o2net: connected to node ws2 (num 1) at 172.16.42.2:7777
OCFS2 1.2.5 Tue Apr 10 12:29:28 EDT 2007 (build 
0f745576f5282c9408787369d99ba880)
ocfs2_dlm: Nodes in domain ("C1B50B9082BC4B74A13FF6F34D35B68B"): 0 1
12 19
kjournald starting.  Commit interval 5 seconds
ocfs2: Mounting device (3,3) on (node 12, slot 2)
-----

These lockouts keep on happening from time to time (about 3 times a week). 
Today it happended 2 times already.

Thanks for any info
Nuno Fernandes
-------------- next part --------------
A non-text attachment was scrubbed...
Name: locking_state.bz2
Type: application/x-bzip2
Size: 162483 bytes
Desc: not available
Url :
oss.oracle.com/pipermail/ocfs2-users/attachments/20080404/0e2f6483/attachment-0001.bz2

Ocfs2 users - Apr 2008 - Weird lock

[Ocfs2-users] Weird lock

[Ocfs2-users] Weird lock

[Ocfs2-users] Weird lock

[Ocfs2-users] Weird lock

[Ocfs2-users] Weird lock

[Ocfs2-users] Weird lock

[Ocfs2-users] Weird lock

[Ocfs2-users] Weird lock