samba-bugs at samba.org
2014-Apr-29 13:12 UTC
[Bug 10575] New: Long Delay for Large Folders Even with Incremental File-List
https://bugzilla.samba.org/show_bug.cgi?id=10575 Summary: Long Delay for Large Folders Even with Incremental File-List Product: rsync Version: 3.1.0 Platform: All OS/Version: All Status: NEW Severity: normal Priority: P5 Component: core AssignedTo: wayned at samba.org ReportedBy: samba at haravikk.com QAContact: rsync-qa at samba.org Okay, so I have a folder I need to backup that unfortunately contains a massive 32,000+ files (it's part of a several hundred gigabytes large disk-image). However, using rsync to synchronise this folder with a copy on another machine on my local network takes an incredibly long time, largely thanks to a huge delay before any files start being sent. It seems that even with the incremental file-list enabled that rsync is still waiting to scan the entire folder before any files start being transferred, even though files high up in the list should be ready to send right away. Perhaps I'm misunderstanding how the incremental file-list generation works, but surely the number of files in a folder shouldn't matter for incremental sending, as it can just send the file-paths in several batches so that the transfer can start right away? If this is not how it works then please let me know, and I can file an enhancement instead. Currently I'm having to use a timestamp based find to pipe files into rsync, but it's not really how I'd like to do things, as it won't detect deletions; fortunately those only happen if I manually request the disk image be compacted, so I can avoid the issue by simply never compacting the image (I don't need to anyway), but it seems like strange behaviour if incremental sending can't handle folders with tons of files properly. -- Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the QA contact for the bug.
samba-bugs at samba.org
2014-Apr-29 13:18 UTC
[Bug 10575] Long Delay for Large Folders Even with Incremental File-List
https://bugzilla.samba.org/show_bug.cgi?id=10575 --- Comment #1 from Kevin Korb <rsync at sanitarium.net> 2014-04-29 13:18:35 UTC --- What is your command line? -- Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the QA contact for the bug.
samba-bugs at samba.org
2014-Apr-29 17:19 UTC
[Bug 10575] Long Delay for Large Folders Even with Incremental File-List
https://bugzilla.samba.org/show_bug.cgi?id=10575 Wayne Davison <wayned at samba.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |WONTFIX --- Comment #2 from Wayne Davison <wayned at samba.org> 2014-04-29 17:19:54 UTC --- That's the way rsync works. It has to have the full directory listing so that it can sort it and work on it in order (and potentially run any --delete-during processing first), so it has to wait for the full list. Sadly, this isn't going to change. Sorry. -- Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the QA contact for the bug.
samba-bugs at samba.org
2014-May-01 19:25 UTC
[Bug 10575] Long Delay for Large Folders Even with Incremental File-List
https://bugzilla.samba.org/show_bug.cgi?id=10575 --- Comment #3 from Haravikk <samba at haravikk.com> 2014-05-01 19:25:02 UTC --- Actually it seems the specific issue I was seeing was related to --fuzzy and the fact that the destination machine is a NAS (so much slower than my main machine); even though there was no actual need for fuzzy matching to take place, it presumably forces rsync to wait for the full folder list on the receiving side before it can be sure there aren't any matches for new files. With --fuzzy removed there is no noticeable delay before changes start getting sent; still takes a while, but that's just the 32,000+ items and the copying, rather than the additional delay I was seeing. Anyway, I suppose I'll have to resolve the issue by doing my normal --fuzzy command with an exclude rule for the disk image, and then run a separate sync without --fuzzy for just the disk image. I've posted issue #10581 with some proposals for tuning fuzzy performance to avoid this for a general purpose command. -- Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the QA contact for the bug.
Possibly Parallel Threads
- [Bug 10581] New: --fuzzy-delay and --fuzzy-limit for fuzzy match tuning
- [Bug 10379] New: rsync metadata files
- [Bug 10601] New: Retry Delay for Vanished Files/Directories
- [Bug 9864] New: Allow permanent compression of destination files
- [Bug 10380] New: Non-Nested Folder Optimisation