Displaying 6 results from an estimated 6 matches for "checksum_sse2".
2020 May 18
6
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...>From f5d0b32df869a23a74b8b8295e4983b0943866df Mon Sep 17 00:00:00 2001
From: Jorrit Jongma <git at jongma.org>
Date: Mon, 18 May 2020 00:21:39 +0200
Subject: [PATCH 1/1] SSE2/SSSE3 optimized version of get_checksum1() for
x86-64
---
Makefile.in | 2 +-
checksum.c | 2 +
checksum_sse2.c | 243 ++++++++++++++++++++++++++++++++++++++++++++++++
3 files changed, 246 insertions(+), 1 deletion(-)
create mode 100644 checksum_sse2.c
diff --git a/Makefile.in b/Makefile.in
index 59649562..e4202336 100644
--- a/Makefile.in
+++ b/Makefile.in
@@ -40,7 +40,7 @@ OBJS1=flist.o rsync.o generat...
2020 May 18
0
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...943866df Mon Sep 17 00:00:00 2001
> From: Jorrit Jongma <git at jongma.org>
> Date: Mon, 18 May 2020 00:21:39 +0200
> Subject: [PATCH 1/1] SSE2/SSSE3 optimized version of get_checksum1() for
> x86-64
>
> ---
> Makefile.in | 2 +-
> checksum.c | 2 +
> checksum_sse2.c | 243 ++++++++++++++++++++++++++++++++++++++++++++++++
> 3 files changed, 246 insertions(+), 1 deletion(-)
> create mode 100644 checksum_sse2.c
>
> diff --git a/Makefile.in b/Makefile.in
> index 59649562..e4202336 100644
> --- a/Makefile.in
> +++ b/Makefile.in
> @@ -40,7...
2020 May 20
0
[PATCHv2] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...s with GCC is g++ and target attributes. I've refactored the
> patch that way, and it indeed uses SSSE3 automatically on supporting
> CPUs, regardless of the build host, so this should be ideal both for
> home builders and distros.
>
> Getting the code to build right in c++ mode (checksum_sse2.cpp only)
> was a bit of an adventure, requiring modifications to mkproto.awk,
> configure.ac, and Makefile.in.
>
> I've done my best to prevent any c++ compilation happening in case the
> optimizations are not enabled (g++ isn't used, build target isn't
> x86-64, or -...
2020 May 19
5
[PATCHv2] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...the proper way to
do this with GCC is g++ and target attributes. I've refactored the
patch that way, and it indeed uses SSSE3 automatically on supporting
CPUs, regardless of the build host, so this should be ideal both for
home builders and distros.
Getting the code to build right in c++ mode (checksum_sse2.cpp only)
was a bit of an adventure, requiring modifications to mkproto.awk,
configure.ac, and Makefile.in.
I've done my best to prevent any c++ compilation happening in case the
optimizations are not enabled (g++ isn't used, build target isn't
x86-64, or --disable-sse2 was passed to c...
2020 May 18
2
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...t;> From: Jorrit Jongma <git at jongma.org>
>> Date: Mon, 18 May 2020 00:21:39 +0200
>> Subject: [PATCH 1/1] SSE2/SSSE3 optimized version of get_checksum1() for
>> x86-64
>>
>> ---
>> Makefile.in | 2 +-
>> checksum.c | 2 +
>> checksum_sse2.c | 243 ++++++++++++++++++++++++++++++++++++++++++++++++
>> 3 files changed, 246 insertions(+), 1 deletion(-)
>> create mode 100644 checksum_sse2.c
>>
>> diff --git a/Makefile.in b/Makefile.in
>> index 59649562..e4202336 100644
>> --- a/Makefile.in
>> +++...
2020 May 18
3
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
What do you base this on?
Per https://gcc.gnu.org/onlinedocs/gcc/x86-Options.html :
"For the x86-32 compiler, you must use -march=cpu-type, -msse or
-msse2 switches to enable SSE extensions and make this option
effective. For the x86-64 compiler, these extensions are enabled by
default."
That reads to me like we're fine for SSE2. As stated in my comments,
SSSE3 support must be