Thompson,John
2006-Apr-10 20:34 UTC
[Ocfs2-users] ocfs2 "waiting for abort to complete" message during scp copy
We've just setup ocfs2 v1.2 on a 6-way Redhat Linux (Linux rac99003 2.6.9-34.ELsmp #1 SMP Tue Mar 7 15:16:40 CST 2006 x86_64 x86_64 x86_64 GNU/Linux) cluster. In trying to copy over a 5gb file from a server outside the cluster to a node on the cluster, the copy always hangs the cluster being copied to. We've set the elevator=deadline in the /boot/grub/grub.conf file with no luck. Any ideas? Here's the /var/log/messages output when the hand occurs: Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:384 Nodes in my domain ("AD8CB11991A54F7B87050F9336E43B77"): Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388 node 0 Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388 node 1 Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388 node 2 Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388 node 3 Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388 node 4 Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388 node 5 Apr 10 15:20:44 rac99003 sshd(pam_unix)[7778]: session opened for user oracle by (uid=0) Apr 10 15:21:06 rac99003 su(pam_unix)[6075]: session opened for user oracle by (uid=0) Apr 10 15:21:06 rac99003 logger: Running CRSD with TZ = Apr 10 15:21:08 rac99003 su(pam_unix)[7991]: session opened for user oracle by (uid=0) Apr 10 15:26:26 rac99003 kernel: lpfc 0000:05:03.0: 1:0748 abort handler timed out waiting for abort to complete. Data: x0 x0 x3 x2c1cd Apr 10 15:26:26 rac99003 kernel: lpfc 0000:05:01.0: 0:0748 abort handler timed out waiting for abort to complete. Data: x0 x0 x3 x2c1ca Apr 10 15:27:17 rac99003 sshd(pam_unix)[17175]: authentication failure; logname= uid=0 euid=0 tty=ssh ruser= rhost=172.19.198.57 user=root Apr 10 15:27:23 rac99003 sshd(pam_unix)[17452]: session opened for user root by root(uid=0) Apr 10 15:27:28 rac99003 kernel: lpfc 0000:05:03.0: 1:0748 abort handler timed out waiting for abort to complete. Data: x0 x0 x3 x2c1ce Apr 10 15:27:28 rac99003 kernel: lpfc 0000:05:01.0: 0:0748 abort handler timed out waiting for abort to complete. Data: x0 x0 x3 x2c1cf John H. Thompson Infrastructure and Database Services H-E-B 646 S. Main Ave. San Antonio, TX 78204 Office: 210-938-8528 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://oss.oracle.com/pipermail/ocfs2-users/attachments/20060410/f68d2b52/attachment.html