Hey list,
I am having problems as of late with my rsync backup. On the client
side I am using the following:
OPTS="-avvvrz
--compress-level=9
--itemize-changes
--delete
--delete-excluded
--human-readable
--files-from=$FILES
--include-from=$INCLUDES
--exclude-from=$EXCLUDES
--partial
--progress
--owner
--perms
--progress
--timeout=0
--times
--stats"
sudo rsync -e "ssh -i ${IDENTITY_FILE} -v -p ${REMOTE_PORT}" $OPTS
/ $REMOTE_USER@$REMOTE_HOST:$REMOTE_PATH
Note that I have temporarily disabled timeouts and added extra
verbosity. The transfer to the remote host via SSH works fine, up until
it gets to a 30+ GB file (a VM image). It gets about 90+ percent of the
way through, hangs, and then times out. On the client side I see the
following:
...
rsync: connection unexpectedly closed (3542035 bytes received so far)
[sender]
rsync error: unexplained error (code 255) at io.c(226) [sender=3.1.1]
[sender] _exit_cleanup(code=12, file=io.c, line=226): about to call
exit(255)
On the server side if I attach to the rsync process via strace, I see
the following:
$ strace -f -p 3095
...
3095 select(4, [3], [1], [1], {60, 0}) = 1 (out [1], left {59, 999971})
3095 write(1, "\3\0\0\7\1\0\0", 7) = 7
3095 gettimeofday({1476036967, 673095}, NULL) = 0
3095 select(4, [3], [1], [1], {60, 0}) = 1 (out [1], left {59, 999970})
3095 write(1, "H\0\0\trecv_files(home/kip/.Virtual"..., 76) = 76
3095 gettimeofday({1476036967, 680312}, NULL) = 0
3095 select(4, [3], [], NULL, {60, 0}) = 0 (Timeout)
3095 select(4, [3], [], NULL, {60, 0}) = 0 (Timeout)
3095 select(4, [3], [], NULL, {60, 0}) = 0 (Timeout)
3095 select(4, [3], [], NULL, {60, 0}) = 0 (Timeout)
3095 select(4, [3], [], NULL, {60, 0}) = 1 (in [3], left {40, 364402})
3095 read(3, "B\0\0\trecv_files(home/kip/.Virtual"..., 8184) =
160
3095 select(4, [3], [1], [1], {60, 0}) = 1 (out [1], left {59, 999973})
3095 write(1, "B\0\0\trecv_files(home/kip/.Virtual"..., 70) = 70
3095 gettimeofday({1476037227, 506412}, NULL) = 0
3095 select(4, [3], [1], [1], {60, 0}) = 1 (out [1], left {59, 999971})
3095 write(1, "V\0\0\trecv mapped home/kip/.Virtua"..., 90) = 90
3095 gettimeofday({1476037227, 512591}, NULL) = 0
3095 select(4, [3], [], NULL, {60, 0}) = 0 (Timeout)
... a couple hundred times or so repeats ...
3095 select(4, [3], [], NULL, {60, 0}) = 0 (Timeout)
3095 select(4, [3], [], NULL, {60, 0}
Note that it looks like the select() call is timing out for what I
presume is a regular file descriptor (4 since stdin, stdout, and stderr
are 0-3 respectively). This could have nothing to do with rsync at all
and could be a file system issue, but I figured I'd ask.
The server the data is being uploaded to with the strace running on it
has rsync version:
$ rsync --version
rsync version 3.0.9 protocol version 30
The client reported:
$ rsync --version
rsync version 3.1.1 protocol version 31
Any help appreciated.
Regards,
--
Kip Warner -- Senior Software Engineer
OpenPGP encrypted/signed mail preferred
http://www.thevertigo.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 163 bytes
Desc: This is a digitally signed message part
URL:
<http://lists.samba.org/pipermail/rsync/attachments/20161009/35c55455/signature.sig>