similar to: [Bug 13082] New: [REQ] Hardware / SSE based MD5 operations

Displaying 20 results from an estimated 4000 matches similar to: "[Bug 13082] New: [REQ] Hardware / SSE based MD5 operations"

2020 May 18
2
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
Well, don't get too excited, get_checksum1() (the function optimized here) is not the great performance limiter in this case, it's get_checksum2() and sum_update(), which will be using MD5. You can force using MD4, but on the slower CPU's I've tested in practice that is slower rather than faster, contrary to what would be expected. While this patch will improve things a little, to
2020 May 18
1
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
I think you're missing a point here. Two different checksum algorithms are used in concert, the Adler-based one and the MD5 one. I SSE-optimized the Adler-based one. The Adler-based hash is used to _find_ blocks that might have shifted, while the MD5 hash is a strong cryptographic hash used to _verify_ blocks and files. You wouldn't want to replace the MD5 hash with the Adler-based hash,
2020 May 18
6
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
This drop-in patch increases the performance of the get_checksum1() function on x86-64. On the target slow CPU performance of the function increased by nearly 50% in the x86-64 default SSE2 mode, and by nearly 100% if the compiler was told to enable SSSE3 support. The increase was over 200% on the fastest CPU tested in SSSE3 mode. Transfer time improvement with large files existing on both ends
2020 May 18
2
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
I don't disagree that MD5 could (or even should) be replaced so it is no longer the bottleneck in several real-world cases (including mine). However this patch is not for MD5 performance, rather for the rolling checksum rsync uses to match blocks on existing files on both ends to reduce transfer size. On Mon, May 18, 2020 at 5:44 PM Filipe Maia via rsync <rsync at lists.samba.org>
2020 May 24
3
[PATCH] file_checksum() optimization
When a whole-file checksum is performed, hashing was done in 64 byte blocks, causing overhead and limiting performance. Testing showed the performance improvement to go up quickly going from 64 to 512 bytes, with diminishing returns above, 4096 was where it seemed to plateau for me. Re-used CHUNK_SIZE (32 kB) as it already exists and should be fine to use here anyway. Noticed this because
2017 Oct 04
10
[Bug 13071] New: [PATCH] Allow --partial-dir with --inplace
https://bugzilla.samba.org/show_bug.cgi?id=13071 Bug ID: 13071 Summary: [PATCH] Allow --partial-dir with --inplace Product: rsync Version: 3.1.3 Hardware: All OS: All Status: NEW Severity: normal Priority: P5 Component: core Assignee: wayned at samba.org Reporter:
2020 May 19
5
[PATCHv2] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
I've read up some more on the subject, and it seems the proper way to do this with GCC is g++ and target attributes. I've refactored the patch that way, and it indeed uses SSSE3 automatically on supporting CPUs, regardless of the build host, so this should be ideal both for home builders and distros. Getting the code to build right in c++ mode (checksum_sse2.cpp only) was a bit of an
2018 May 18
9
[Bug 13445] New: Fuzzy searching in link-dest tries to open regular file as directory
https://bugzilla.samba.org/show_bug.cgi?id=13445 Bug ID: 13445 Summary: Fuzzy searching in link-dest tries to open regular file as directory Product: rsync Version: 3.1.3 Hardware: All OS: All Status: NEW Severity: normal Priority: P5 Component: core
2014 Jul 24
8
[Bug 10738] New: report --stats output when termination signal arrives
https://bugzilla.samba.org/show_bug.cgi?id=10738 Summary: report --stats output when termination signal arrives Product: rsync Version: 3.0.9 Platform: x64 OS/Version: Linux Status: NEW Severity: enhancement Priority: P5 Component: core AssignedTo: wayned at samba.org ReportedBy:
2020 May 23
4
[PATCH] Optimized assembler version of md5_process() for x86-64
On 2020-05-22 22:54:18 [-0700], Wayne Davison via rsync wrote: > Thanks for the optimizing patches, Jorrit! I've merged your latest changes > into the git master branch. Wouldn't it be better to add support for a crypto library (like openssl) which would provide optimized algorithms for more than just one platform without the need to maintain it separately? > ..wayne..
2020 May 18
3
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
What do you base this on? Per https://gcc.gnu.org/onlinedocs/gcc/x86-Options.html : "For the x86-32 compiler, you must use -march=cpu-type, -msse or -msse2 switches to enable SSE extensions and make this option effective. For the x86-64 compiler, these extensions are enabled by default." That reads to me like we're fine for SSE2. As stated in my comments, SSSE3 support must be
2017 Jan 16
4
[Bug 12522] New: [PATCH] Send last error messages to sender
https://bugzilla.samba.org/show_bug.cgi?id=12522 Bug ID: 12522 Summary: [PATCH] Send last error messages to sender Product: rsync Version: 3.1.2 Hardware: All OS: All Status: NEW Severity: normal Priority: P5 Component: core Assignee: wayned at samba.org Reporter:
2017 Jun 04
5
[Bug 12817] New: [PATCH] Allow daemon itself to chroot
https://bugzilla.samba.org/show_bug.cgi?id=12817 Bug ID: 12817 Summary: [PATCH] Allow daemon itself to chroot Product: rsync Version: 3.1.2 Hardware: All OS: All Status: NEW Severity: normal Priority: P5 Component: core Assignee: wayned at samba.org Reporter:
2018 Mar 14
3
rsync of a reflink from OCFS2
no backup - no mercy ----- On Mar 14, 2018, at 2:19 PM, Ben RUBSON ben.rubson at gmail.com wrote: > On 14 Mar 2018, Lentes, Bernd via rsync wrote: > >> I would now expect a rsync from the snap would transfer just some megay >> bytes to the file from the day before. >> But it doesn't: >> >> ha-idg-1:/cluster/guests/servers_alive # time rsync -av --stats
2017 Jan 05
3
[Bug 12498] New: --fuzzy --fuzzy hugely impacts performance even if its' not needed
https://bugzilla.samba.org/show_bug.cgi?id=12498 Bug ID: 12498 Summary: --fuzzy --fuzzy hugely impacts performance even if its' not needed Product: rsync Version: 3.1.2 Hardware: All OS: All Status: NEW Severity: normal Priority: P5 Component: core
2017 Jan 17
3
[Bug 12525] New: [PATCH] Avoid uploading whole file when transfer fails
https://bugzilla.samba.org/show_bug.cgi?id=12525 Bug ID: 12525 Summary: [PATCH] Avoid uploading whole file when transfer fails Product: rsync Version: 3.1.2 Hardware: All OS: All Status: NEW Severity: normal Priority: P5 Component: core Assignee: wayned at samba.org
2017 Oct 12
2
[Bug 13083] New: [PATCH] Use both partial file + basis file
https://bugzilla.samba.org/show_bug.cgi?id=13083 Bug ID: 13083 Summary: [PATCH] Use both partial file + basis file Product: rsync Version: 3.1.3 Hardware: All OS: All Status: NEW Severity: normal Priority: P5 Component: core Assignee: wayned at samba.org Reporter:
2017 Jun 13
2
[Bug 12838] New: [PATCH] Log sent/received bytes even in case of error
https://bugzilla.samba.org/show_bug.cgi?id=12838 Bug ID: 12838 Summary: [PATCH] Log sent/received bytes even in case of error Product: rsync Version: 3.1.2 Hardware: All OS: All Status: NEW Severity: normal Priority: P5 Component: core Assignee: wayned at samba.org
2013 Dec 19
5
[Bug 10338] New: Start deletion from the top of the hierarchy
https://bugzilla.samba.org/show_bug.cgi?id=10338 Summary: Start deletion from the top of the hierarchy Product: rsync Version: 3.1.0 Platform: All OS/Version: All Status: NEW Severity: enhancement Priority: P5 Component: core AssignedTo: wayned at samba.org ReportedBy: ben.rubson at
2018 Mar 14
3
rsync of a reflink from OCFS2
--no-whole-file would only make it even worse. It would have to read the remote file over the network in order to do the diff then it would write the whole file over the network anyway (--inplace would help a little). Local copies force --whole-file for a good reason. On 03/14/2018 10:05 AM, Ben RUBSON via rsync wrote: > On 14 Mar 2018, Lentes, Bernd via rsync wrote: > >> ----- On