Thompson,John
2006-Apr-10  20:34 UTC
[Ocfs2-users] ocfs2 "waiting for abort to complete" message during scp copy
We've just setup ocfs2 v1.2 on a 6-way Redhat Linux (Linux rac99003
2.6.9-34.ELsmp #1 SMP Tue Mar 7 15:16:40 CST 2006 x86_64 x86_64 x86_64
GNU/Linux) cluster.  In trying to copy over a 5gb file from a server
outside the cluster to a node on the cluster, the copy always hangs the
cluster being copied to.  We've set the elevator=deadline in the
/boot/grub/grub.conf file with no luck.  Any ideas?  Here's the
/var/log/messages output when the hand occurs:
 
Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:384 Nodes in
my domain ("AD8CB11991A54F7B87050F9336E43B77"):
Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 0
Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 1
Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 2
Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 3
Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 4
Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 5
Apr 10 15:20:44 rac99003 sshd(pam_unix)[7778]: session opened for user
oracle by (uid=0)
Apr 10 15:21:06 rac99003 su(pam_unix)[6075]: session opened for user
oracle by (uid=0)
Apr 10 15:21:06 rac99003 logger: Running CRSD with TZ = 
Apr 10 15:21:08 rac99003 su(pam_unix)[7991]: session opened for user
oracle by (uid=0)
Apr 10 15:26:26 rac99003 kernel: lpfc 0000:05:03.0: 1:0748 abort handler
timed out waiting for abort to complete. Data: x0 x0 x3 x2c1cd
Apr 10 15:26:26 rac99003 kernel: lpfc 0000:05:01.0: 0:0748 abort handler
timed out waiting for abort to complete. Data: x0 x0 x3 x2c1ca
Apr 10 15:27:17 rac99003 sshd(pam_unix)[17175]: authentication failure;
logname= uid=0 euid=0 tty=ssh ruser= rhost=172.19.198.57  user=root
Apr 10 15:27:23 rac99003 sshd(pam_unix)[17452]: session opened for user
root by root(uid=0)
Apr 10 15:27:28 rac99003 kernel: lpfc 0000:05:03.0: 1:0748 abort handler
timed out waiting for abort to complete. Data: x0 x0 x3 x2c1ce
Apr 10 15:27:28 rac99003 kernel: lpfc 0000:05:01.0: 0:0748 abort handler
timed out waiting for abort to complete. Data: x0 x0 x3 x2c1cf
 
John H. Thompson
Infrastructure and Database Services
H-E-B
646 S. Main Ave. 
San Antonio, TX 78204
Office: 210-938-8528
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://oss.oracle.com/pipermail/ocfs2-users/attachments/20060410/f68d2b52/attachment.html