search for: immintrin

Displaying 20 results from an estimated 31 matches for "immintrin".

Did you mean: xmmintrin
2013 Jul 29
3
[PATCH 1/2] xv: fix last pixel for big-endian machines in YV12 -> NV12 conversion
Signed-off-by: Ilia Mirkin <imirkin at alum.mit.edu> --- src/nouveau_xv.c | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/src/nouveau_xv.c b/src/nouveau_xv.c index 8eafcf0..567e30c 100644 --- a/src/nouveau_xv.c +++ b/src/nouveau_xv.c @@ -552,8 +552,11 @@ NVCopyNV12ColorPlanes(unsigned char *src1, unsigned char *src2, if (e) { unsigned short *vud = (unsigned
2019 Feb 05
2
debugging installation problem
...have a build of a recent (Monday) llvm/clang which I have installed in the expected way in my environment but I am getting failures like this; In file included from <some directory>/lib/clang/stable/include/x86intrin.h:29: In file included from <some directory>/lib/clang/stable/include/immintrin.h:118: <some directory>/lib/clang/stable/include/avx512cdintrin.h:37:20: error: use of undeclared identifier '__builtin_ia32_vpconflictdi_512' return (__m512i) __builtin_ia32_vpconflictdi_512 ((__v8di) __A); ^ <some directory>/lib/clang/stable/include/avx512...
2019 Feb 05
2
debugging installation problem
...have installed in the expected way in my environment but I am getting failures like this; > > > > In file included from <some directory>/lib/clang/stable/include/x86intrin.h:29: > > In file included from <some directory>/lib/clang/stable/include/immintrin.h:118: > > <some directory>/lib/clang/stable/include/avx512cdintrin.h:37:20: error: use of undeclared identifier '__builtin_ia32_vpconflictdi_512' > > return (__m512i) __builtin_ia32_vpconflictdi_512 ((__v8di) __A); > > ^...
2013 Jul 31
0
[PATCH 2/2] xv: speed up YV12 -> NV12 conversion using SSE2 if available
On 2013-07-31 19:18 +0200, Ilia Mirkin wrote: > On Wed, Jul 31, 2013 at 1:16 PM, Sven Joachim <svenjoac at gmx.de> wrote: >> >> Unfortunately, immintrin.h is not available on most architectures, >> leading to build failures as can be seen on >> https://buildd.debian.org/status/package.php?p=xserver-xorg-video-nouveau. > > Sorry :( I thought that immintrin.h would be available everywhere and > just end up empty since none of the...
2014 Sep 30
2
[LLVMdev] size_t?
...ded from C:\Program Files (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\intrin.h:34: 1> In file included from C:\Program Files (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\x86intrin.h:29: 1> In file included from C:\Program Files (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\immintrin.h:28: 1>C:\Program Files (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\mmintrin.h(52,40): error : cannot initialize a parameter of type '__attribute__((__vector_size__(2 * sizeof(int)))) int' (vector of 2 'int' values) with an rvalue of type '__v2si' (aka 'int...
2017 Apr 08
2
How to insert vector type input parameter for function in C/C++ API?
...ction by C/C++ API. As you see below, the AVX2 data type is __m256i which is vector type. How can I indicate vector type (function return type, input parameters) for IRBuiler by C/C++ APIs? I don't see any example online and please let me know if anybody has examples. SOURCE: #include "immintrin.h" __m256i sum(__m256i a, __m256i b) { return a+b; } TARGET: michael at michael-Precision-Tower-3420:~/Year_2017/work_DEMO$ cat avx2_add2.ll ; ModuleID = 'avx2_add2.c' target datalayout = "e-m:w-i64:64-f80:128-n8:16:32:64-S128" target triple = "x86_64-unknown-window...
2017 Apr 08
3
How to insert vector type input parameter for function in C/C++ API?
...parameters) for >>>> IRBuiler by C/C++ APIs? >>>> >>>> I don't see any example online and please let me know if anybody has >>>> examples. >>>> >>>> >>>> >>>> SOURCE: >>>> #include "immintrin.h" >>>> __m256i sum(__m256i a, __m256i b) { >>>> return a+b; >>>> } >>>> >>>> TARGET: >>>> michael at michael-Precision-Tower-3420:~/Year_2017/work_DEMO$ cat >>>> avx2_add2.ll >>>> ; ModuleID =...
2016 May 31
2
[PATCH 1/2] Modify autoconf tests for intrinsics to stop clang from optimizing them away.
..._X86_MAY_HAVE_SSE4_1" = x"1" && test x"$OPUS_X86_PRESUME_SSE4_1" != x"1"], @@ -576,10 +585,13 @@ AS_IF([test x"$enable_intrinsics" = x"yes"],[ [OPUS_X86_MAY_HAVE_AVX], [OPUS_X86_PRESUME_AVX], [[#include <immintrin.h> + #include <time.h> ]], [[ - static __m256 mtest; - mtest = _mm256_setzero_ps(); + __m256 mtest; + mtest = _mm256_set1_ps((float)time(NULL)); + mtest = _mm256_addsub_ps(mtest, mtest); + r...
2017 Apr 08
2
How to insert vector type input parameter for function in C/C++ API?
...pe. How >> can I indicate vector type (function return type, input parameters) for >> IRBuiler by C/C++ APIs? >> >> I don't see any example online and please let me know if anybody has >> examples. >> >> >> >> SOURCE: >> #include "immintrin.h" >> __m256i sum(__m256i a, __m256i b) { >> return a+b; >> } >> >> TARGET: >> michael at michael-Precision-Tower-3420:~/Year_2017/work_DEMO$ cat >> avx2_add2.ll >> ; ModuleID = 'avx2_add2.c' >> target datalayout = "e-m:w-i...
2014 Sep 30
2
[LLVMdev] size_t?
...LVM\msbuild-bin\..\lib\clang\3.6.0\include\intrin.h:34: > 1> In file included from C:\Program Files > (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\x86intrin.h:29: > 1> In file included from C:\Program Files > (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\immintrin.h:28: > 1>C:\Program Files > (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\mmintrin.h(52,40): > error : cannot initialize a parameter of type > '__attribute__((__vector_size__(2 * sizeof(int)))) int' (vector of > 2 'int' values) with an rval...
2014 Oct 01
2
[LLVMdev] size_t?
...LVM\msbuild-bin\..\lib\clang\3.6.0\include\intrin.h:34: >> 1> In file included from C:\Program Files >> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\x86intrin.h:29: >> 1> In file included from C:\Program Files >> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\immintrin.h:28: >> 1>C:\Program Files >> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\mmintrin.h(52,40): error >> : cannot initialize a parameter of type '__attribute__((__vector_size__(2 * >> sizeof(int)))) int' (vector of 2 'int' values) with an rvalue of typ...
2013 Jul 29
0
[PATCH 2/2] xv: speed up YV12 -> NV12 conversion using SSE2 if available
...ouveau_xv.c | 33 ++++++++++++++++++++++++++------- 1 file changed, 26 insertions(+), 7 deletions(-) diff --git a/src/nouveau_xv.c b/src/nouveau_xv.c index 567e30c..5569b7c 100644 --- a/src/nouveau_xv.c +++ b/src/nouveau_xv.c @@ -25,6 +25,8 @@ #include "config.h" #endif +#include <immintrin.h> + #include "xf86xv.h" #include <X11/extensions/Xv.h> #include "exa.h" @@ -532,30 +534,47 @@ NVCopyNV12ColorPlanes(unsigned char *src1, unsigned char *src2, w >>= 1; h >>= 1; +#ifdef __SSE2__ + l = w >> 3; + e = w & 7; +#else l = w &gt...
2013 Nov 07
0
[ANNOUNCE] xf86-video-nouveau 1.0.10
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Ben Skeggs (2): recognise GK208 as a Kepler board bump to 1.0.10 for release Dave Airlie (3): nouveau: fix build on ppc by wrapping immintrin include. nouveau: add gpu identifier to connector names for secondary gpus nouveau: fix build against older servers. git tag: xf86-video-nouveau-1.0.10 http://xorg.freedesktop.org/archive/individual/driver/xf86-video-nouveau-1.0.10.tar.bz2 MD5: 44fd65897630fdd19c086133cc1bd679 xf86-...
2013 Nov 07
0
[ANNOUNCE] xf86-video-nouveau 1.0.10
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Ben Skeggs (2): recognise GK208 as a Kepler board bump to 1.0.10 for release Dave Airlie (3): nouveau: fix build on ppc by wrapping immintrin include. nouveau: add gpu identifier to connector names for secondary gpus nouveau: fix build against older servers. git tag: xf86-video-nouveau-1.0.10 http://xorg.freedesktop.org/archive/individual/driver/xf86-video-nouveau-1.0.10.tar.bz2 MD5: 44fd65897630fdd19c086133cc1bd679 xf86-...
2014 Oct 01
2
[LLVMdev] size_t?
.....\lib\clang\3.6.0\include\intrin.h:34: >>> 1> In file included from C:\Program Files >>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\x86intrin.h:29: >>> 1> In file included from C:\Program Files >>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\immintrin.h:28: >>> 1>C:\Program Files >>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\mmintrin.h(52,40): error >>> : cannot initialize a parameter of type '__attribute__((__vector_size__(2 * >>> sizeof(int)))) int' (vector of 2 'int' values) with...
2016 Jul 06
2
GCC prerequisites for building LLVM head?
I've no idea, but without it, the 'std::to_string' definitions are not enabled! -----Original Message----- From: Renato Golin [mailto:renato.golin at linaro.org] Sent: 06 July 2016 19:35 To: Martin.ORiordan at movidius.com Cc: LLVM Developers <llvm-dev at lists.llvm.org> Subject: Re: [llvm-dev] GCC prerequisites for building LLVM head? On 6 July 2016 at 19:02, Martin J.
2017 Jun 14
2
Default FPENV state
Hi, We are interesting in expanding some vector operations directly in the IR form as constants https://reviews.llvm.org/D33406, for example: _mm256_cmp_ps("any input", "any input", _CMP_TRUE_UQ) should produce -1, -1, -1, ... vector, but for some values for example "1.00 -nan" if FPU exceptions were enabled this operation triggers the exception. Here is the question:
2020 May 18
6
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...n the + * pure C version anyway. + * + * GCC automatically enables SSE2 support on x86-64 builds. The SSSE3 code + * path must be enabled manually: ./configure CFLAGS="-mssse3 -O2" + */ + +#ifdef __x86_64__ +#ifdef __SSE2__ + +#include "rsync.h" + +#ifdef __SSSE3__ +#include <immintrin.h> +#else +#include <tmmintrin.h> +#endif + +/* Compatibility functions to let our SSSE3 algorithm run on SSE2 */ + +static inline __m128i sse_load_si128(void const* buf) { +#ifdef __SSSE3__ + return _mm_lddqu_si128(buf); // same as loadu on all but the oldest SSSE3 CPUs +#else + re...
2012 Jul 11
0
[LLVMdev] Compiling llvm and Clang on Linux
It's undocumented FAQ, if you are using RHEL5 (or clone). - install gcc44-c++ - Build with CC=gcc44 CXX=g++44 - You may need "CC=clang -std=gnu89" to use clang with its glibc. Have fun! ps. AFAIK, clang can be built more easily on centos6. ...Takumi 2012/7/11 Sitvanit Ruah <RUAH at il.ibm.com>: > > Hello all, > I am new to this mailing list so I hope this is
2020 May 18
0
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...atically enables SSE2 support on x86-64 builds. The SSSE3 code > + * path must be enabled manually: ./configure CFLAGS="-mssse3 -O2" > + */ > + > +#ifdef __x86_64__ > +#ifdef __SSE2__ > + > +#include "rsync.h" > + > +#ifdef __SSSE3__ > +#include <immintrin.h> > +#else > +#include <tmmintrin.h> > +#endif > + > +/* Compatibility functions to let our SSSE3 algorithm run on SSE2 */ > + > +static inline __m128i sse_load_si128(void const* buf) { > +#ifdef __SSSE3__ > + return _mm_lddqu_si128(buf); // same as loadu on...