van Ammers, Marnix
2009-Apr-11 23:37 UTC
Rsync 3.0.5 sometimes hangs copying local disk to other local disk (Solaris 9)
I hope I'm sending this report to the right place. I recently upgraded the rsync binaries in a bunch of our Solaris 9 systems from rsync 2.6.9 to rsync 3.0.5 . I have a script that runs in each machine that syncs the file systems from an active drive to a backup drive. Wihtin 3 days, this script had hung in two of the ~20 hosts that I had updated with the new rsync. Those two hosts happened to be two of the busiest ones, so very likely files were being removed and added as the rsync was taking place. I have done an ps -fp on the rsync processes involved, plus an lsof, a truss trace, and a netstat -na . The results are in files, each around 30K bytes in size. In case it helps, I'll include all but the long netstat outputs from one host here. If someone asks for the full results for one or both of these hosts, I can send them to him/her. I have never had hung processes with the older rsync-2.6.9 . Marnix A. van Ammers The rsync command was: /usr/pge2/bin/rsync -vaxH --delete --exclude-from=/tmp/fs_rsync.25232 / /mnt 2009-04-11 14:28:46 //mta02/home/mav6 @# ps -fp 25242,25243,25239,25232 UID PID PPID C STIME TTY TIME CMD root 25232 25229 0 Apr 10 ? 0:02 /usr/pge2/bin/perl /usr/pge2/bin/fs_rsync --verbosity=2 --allowvanish --configf root 25243 25242 0 Apr 10 ? 0:17 /usr/pge2/bin/rsync -vaxH --delete --exclude-from=/tmp/fs_rsync.25232 / /mnt root 25239 25232 0 Apr 10 ? 0:12 /usr/pge2/bin/rsync -vaxH --delete --exclude-from=/tmp/fs_rsync.25232 / /mnt root 25242 25239 0 Apr 10 ? 0:12 /usr/pge2/bin/rsync -vaxH --delete --exclude-from=/tmp/fs_rsync.25232 / /mnt 2009-04-11 14:28:56 //mta02/home/mav6 @# lsof -p 25239,25242,25243 COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAME rsync-3.0 25239 root cwd VDIR 32,0 1024 2 / rsync-3.0 25239 root txt VREG 32,5 2906260 2350 /usr2/pge2/bin/rsync-3.0.5 rsync-3.0 25239 root txt VREG 32,0 867796 5746 /usr/lib/libc.so.1 rsync-3.0 25239 root txt VREG 32,0 35064 2903 /usr/lib/nss_files.so.1 rsync-3.0 25239 root txt VREG 32,0 744320 6074 /usr/lib/libnsl.so.1 rsync-3.0 25239 root txt VREG 32,0 21676 2864 /usr/lib/libmp.so.2 rsync-3.0 25239 root txt VREG 32,0 58504 2888 /usr/lib/libsocket.so.1 rsync-3.0 25239 root txt VREG 32,0 316504 2810 /usr/lib/libresolv.so.2 rsync-3.0 25239 root txt VREG 32,0 21268 2884 /usr/lib/libsec.so.1 rsync-3.0 25239 root txt VREG 32,0 4504 3208 /usr/platform/sun4u-us3/lib/libc_psr.so.1 rsync-3.0 25239 root txt VREG 32,0 231076 46004 /usr/lib/ld.so.1 rsync-3.0 25239 root txt VREG 32,0 4400 10401 /usr/lib/libdl.so.1 rsync-3.0 25239 root 0r VCHR 13,2 0t0 3261 /devices/pseudo/mm@0:null rsync-3.0 25239 root 1u FIFO 0x3004f0aec88 0t102 74926527 (fifofs) ->0x3004f0aeb80 rsync-3.0 25239 root 2u FIFO 0x3004f0aec88 0t102 74926527 (fifofs) ->0x3004f0aeb80 rsync-3.0 25239 root 4u unix 105,118 0xea13c57 3270 /devices/pseudo/tl@0:ticots->(socketpair: 0x230f23f00000000) (0x3100852b048) rsync-3.0 25239 root 7u unix 105,121 0t1285476 3270 /devices/pseudo/tl@0:ticots->(socketpair: 0xf241) (0x30002fab018) rsync-3.0 25242 root cwd VDIR 32,8 1024 2 /mnt rsync-3.0 25242 root txt VREG 32,5 2906260 2350 /usr2/pge2/bin/rsync-3.0.5 rsync-3.0 25242 root txt VREG 32,0 867796 5746 /usr/lib/libc.so.1 rsync-3.0 25242 root txt VREG 32,0 35064 2903 /usr/lib/nss_files.so.1 rsync-3.0 25242 root txt VREG 32,0 744320 6074 /usr/lib/libnsl.so.1 rsync-3.0 25242 root txt VREG 32,0 21676 2864 /usr/lib/libmp.so.2 rsync-3.0 25242 root txt VREG 32,0 58504 2888 /usr/lib/libsocket.so.1 rsync-3.0 25242 root txt VREG 32,0 316504 2810 /usr/lib/libresolv.so.2 rsync-3.0 25242 root txt VREG 32,0 21268 2884 /usr/lib/libsec.so.1 rsync-3.0 25242 root txt VREG 32,0 4504 3208 /usr/platform/sun4u-us3/lib/libc_psr.so.1 rsync-3.0 25242 root txt VREG 32,0 231076 46004 /usr/lib/ld.so.1 rsync-3.0 25242 root txt VREG 32,0 4400 10401 /usr/lib/libdl.so.1 rsync-3.0 25242 root 1u unix 105,120 0t1285476 3270 /devices/pseudo/tl@0:ticots->(socketpair: 0x230f24100000000) (0x300b8bb6ca0) rsync-3.0 25242 root 2u FIFO 0x3004f0aec88 0t102 74926527 (fifofs) ->0x3004f0aeb80 rsync-3.0 25242 root 5u unix 105,125 0t1447584 3270 /devices/pseudo/tl@0:ticots->(socketpair: 0xf243) (0x300b8bb73c0) rsync-3.0 25243 root cwd VDIR 32,8 1024 2 /mnt rsync-3.0 25243 root txt VREG 32,5 2906260 2350 /usr2/pge2/bin/rsync-3.0.5 rsync-3.0 25243 root txt VREG 32,0 867796 5746 /usr/lib/libc.so.1 rsync-3.0 25243 root txt VREG 32,0 35064 2903 /usr/lib/nss_files.so.1 rsync-3.0 25243 root txt VREG 32,0 744320 6074 /usr/lib/libnsl.so.1 rsync-3.0 25243 root txt VREG 32,0 21676 2864 /usr/lib/libmp.so.2 rsync-3.0 25243 root txt VREG 32,0 58504 2888 /usr/lib/libsocket.so.1 rsync-3.0 25243 root txt VREG 32,0 316504 2810 /usr/lib/libresolv.so.2 rsync-3.0 25243 root txt VREG 32,0 21268 2884 /usr/lib/libsec.so.1 rsync-3.0 25243 root txt VREG 32,0 4504 3208 /usr/platform/sun4u-us3/lib/libc_psr.so.1 rsync-3.0 25243 root txt VREG 32,0 231076 46004 /usr/lib/ld.so.1 rsync-3.0 25243 root txt VREG 32,0 4400 10401 /usr/lib/libdl.so.1 rsync-3.0 25243 root 0u unix 105,119 0xea13c57 3270 /devices/pseudo/tl@0:ticots->(socketpair: 0xf23f) (0x30002fabc90) rsync-3.0 25243 root 2u FIFO 0x3004f0aec88 0t102 74926527 (fifofs) ->0x3004f0aeb80 rsync-3.0 25243 root 4u unix 105,124 0t1447584 3270 /devices/pseudo/tl@0:ticots->(socketpair: 0x230f24300000000) (0x310134af3d0) 2009-04-11 14:33:17 //mta02/home/mav6 @# truss -p 25239 poll(0xFFBFD4B8, 1, 60000) (sleeping...) ^C2009-04-11 14:33:38 //mta02/home/mav6 @# truss -p 25242 poll(0xFFBFBCD0, 1, 60000) (sleeping...) ^C2009-04-11 14:33:59 //mta02/home/mav6 @# truss -p 25243 poll(0xFFBFB6E0, 1, 60000) (sleeping...) ^C -------------- next part -------------- HTML attachment scrubbed and removed