thr3ads.net - rsync - Extremely poor rsync performance on very large files (near 100GB and larger) [Jan 2007]

If this information is useful, please help other people find it:
Share via:

Evan Harris

2007-Jan-08 07:52 UTC

Extremely poor rsync performance on very large files (near 100GB and larger)

I've been playing with rsync and very large files approaching and surpassing
100GB, and have found that rsync has excessively very poor performance on 
these very large files, and the performance appears to degrade the larger 
the file gets.

The problem only appears to happen when the file is being "updated",
that
is, when it already exists on the receiving side.

For instance, between two machines with 3Ghz processors, 2gig or more of 
ram, and gigabit ethernet, I'm getting about 1.5MB/sec transfer rates from 
the following command on an 80GB file:

rsync -avxP -e ssh --inplace test1:/mnt/database/ ./

However, I get around 30MB/sec from the same command using the whole-file 
(-W) switch.  If the file doesn't exist on the receiving side, I also get 
the much more reasonable 30MB/sec rates wether using -W or not.

I haven't tested this without using the --inplace switch (as I don't
have
enough free space on the test filesystems to make a full copy of the large 
files being rsync'd), but I would presume it behaves similarly.

Another thing of note is that the sending process running on the test1 
machine during these poor performance tests appears to be cpu bound 
(consuming 99% of cpu time), while during a whole-file rsync of the same 
file, the sending rsync process is only utilizing about 10-15% of cpu, and 
the ssh process it is spawned through is consuming around 30-35% of cpu. I'm
assuming this is from the encryption overhead of the must faster transfer 
rate.

I haven't dug into it very deeply, but maybe it has something to do with the
block size chosen by rsync?  Perhaps rsync is picking much too small of a 
block size and is continually traversing the list of block hashes?  I doubt 
it is the hashing of the data blocks themselves, as that would load both 
sides of the connection fairly equally, and that is not the case.  Is there 
another reason that might cause the sending side to be so strangely cpu 
intensive?

This was tested with rsync 2.6.8 on both sides.  I haven't yet tested with 
2.6.9, but after looking at the NEWS file for the 2.6.9 release, it doesn't 
look like anything has changed that is likely to affect this issue.

Thanks.

Evan

Wayne Davison

2007-Jan-08 18:16 UTC

head link

Extremely poor rsync performance on very large files (near 100GB and larger)

On Mon, Jan 08, 2007 at 01:37:45AM -0600, Evan Harris
wrote:> I've been playing with rsync and very large files approaching and 
> surpassing 100GB, and have found that rsync has excessively very poor 
> performance on these very large files, and the performance appears to 
> degrade the larger the file gets.
Yes, this is caused by the current hashing algorithm that the sender
uses to find matches for moved data.  The current hash table has a fixed
size of 65536 slots, and can get overloaded for really large files.

There is a diff in the patches dir that makes rsync work better with
large files: dynamic_hash.diff.  This makes the size of the hash table
depend on how many blocks there are in the transfer.  It does speed up
the transfer of large files significantly, but since it introduces a mod
(%) operation on a per-byte basis, it slows down the transfer of normal
sized files significantly.

I'm going to be checking into using a hash algorithm with a table that
is always a power of 2 in size as an alternative implementation of this
dynamic hash algorithm.  That will hopefully not bloat the CPU time for
normal-sized files.  Alternately, the hashing algorithm could be made to
vary depending on the file's size.  I'm hoping to have this improved in
the upcoming 3.0.0 release.

And one final thought that occurred to me:  it would also be possible
for the sender to segment a really large file into several chunks,
handling each one without overlap, all without the generator or the
receiver knowing that it was happening.  The upside is that huge files
could be handled this way, but the downside is that the incremental-sync
algorithm would not find matches spanning the chunks.  It would be
interesting to test this and see if the rsync algorithm would be better
served by using a larger number of smaller chunks while segmenting the
file, rather than a smaller number of much larger chunks while
considering the file as a whole.

..wayne..

Possibly Parallel Threads

Search for more apparently analagous threads

rsync - Jan 2007 - Extremely poor rsync performance on very large files (near 100GB and larger)

Extremely poor rsync performance on very large files (near 100GB and larger)

Extremely poor rsync performance on very large files (near 100GB and larger)

Possibly Parallel Threads