Displaying 20 results from an estimated 31 matches for "immintrin".
Did you mean:
xmmintrin
2013 Jul 29
3
[PATCH 1/2] xv: fix last pixel for big-endian machines in YV12 -> NV12 conversion
Signed-off-by: Ilia Mirkin <imirkin at alum.mit.edu>
---
src/nouveau_xv.c | 5 ++++-
1 file changed, 4 insertions(+), 1 deletion(-)
diff --git a/src/nouveau_xv.c b/src/nouveau_xv.c
index 8eafcf0..567e30c 100644
--- a/src/nouveau_xv.c
+++ b/src/nouveau_xv.c
@@ -552,8 +552,11 @@ NVCopyNV12ColorPlanes(unsigned char *src1, unsigned char *src2,
if (e) {
unsigned short *vud = (unsigned
2019 Feb 05
2
debugging installation problem
...have a build of a recent (Monday) llvm/clang which I have installed in the expected way in my environment but I am getting failures like this;
In file included from <some directory>/lib/clang/stable/include/x86intrin.h:29:
In file included from <some directory>/lib/clang/stable/include/immintrin.h:118:
<some directory>/lib/clang/stable/include/avx512cdintrin.h:37:20: error: use of undeclared identifier '__builtin_ia32_vpconflictdi_512'
return (__m512i) __builtin_ia32_vpconflictdi_512 ((__v8di) __A);
^
<some directory>/lib/clang/stable/include/avx512...
2019 Feb 05
2
debugging installation problem
...have installed in the expected way in my environment but I am getting failures like this;
>
>
>
> In file included from <some directory>/lib/clang/stable/include/x86intrin.h:29:
>
> In file included from <some directory>/lib/clang/stable/include/immintrin.h:118:
>
> <some directory>/lib/clang/stable/include/avx512cdintrin.h:37:20: error: use of undeclared identifier '__builtin_ia32_vpconflictdi_512'
>
> return (__m512i) __builtin_ia32_vpconflictdi_512 ((__v8di) __A);
>
> ^...
2013 Jul 31
0
[PATCH 2/2] xv: speed up YV12 -> NV12 conversion using SSE2 if available
On 2013-07-31 19:18 +0200, Ilia Mirkin wrote:
> On Wed, Jul 31, 2013 at 1:16 PM, Sven Joachim <svenjoac at gmx.de> wrote:
>>
>> Unfortunately, immintrin.h is not available on most architectures,
>> leading to build failures as can be seen on
>> https://buildd.debian.org/status/package.php?p=xserver-xorg-video-nouveau.
>
> Sorry :( I thought that immintrin.h would be available everywhere and
> just end up empty since none of the...
2014 Sep 30
2
[LLVMdev] size_t?
...ded from C:\Program Files
(x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\intrin.h:34:
1> In file included from C:\Program Files
(x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\x86intrin.h:29:
1> In file included from C:\Program Files
(x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\immintrin.h:28:
1>C:\Program Files
(x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\mmintrin.h(52,40):
error : cannot initialize a parameter of type
'__attribute__((__vector_size__(2 * sizeof(int)))) int' (vector of 2
'int' values) with an rvalue of type '__v2si' (aka 'int...
2017 Apr 08
2
How to insert vector type input parameter for function in C/C++ API?
...ction by C/C++ API.
As you see below, the AVX2 data type is __m256i which is vector type. How
can I indicate vector type (function return type, input parameters) for
IRBuiler by C/C++ APIs?
I don't see any example online and please let me know if anybody has
examples.
SOURCE:
#include "immintrin.h"
__m256i sum(__m256i a, __m256i b) {
return a+b;
}
TARGET:
michael at michael-Precision-Tower-3420:~/Year_2017/work_DEMO$ cat avx2_add2.ll
; ModuleID = 'avx2_add2.c'
target datalayout = "e-m:w-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-window...
2017 Apr 08
3
How to insert vector type input parameter for function in C/C++ API?
...parameters) for
>>>> IRBuiler by C/C++ APIs?
>>>>
>>>> I don't see any example online and please let me know if anybody has
>>>> examples.
>>>>
>>>>
>>>>
>>>> SOURCE:
>>>> #include "immintrin.h"
>>>> __m256i sum(__m256i a, __m256i b) {
>>>> return a+b;
>>>> }
>>>>
>>>> TARGET:
>>>> michael at michael-Precision-Tower-3420:~/Year_2017/work_DEMO$ cat
>>>> avx2_add2.ll
>>>> ; ModuleID =...
2016 May 31
2
[PATCH 1/2] Modify autoconf tests for intrinsics to stop clang from optimizing them away.
..._X86_MAY_HAVE_SSE4_1" = x"1" && test x"$OPUS_X86_PRESUME_SSE4_1" != x"1"],
@@ -576,10 +585,13 @@ AS_IF([test x"$enable_intrinsics" = x"yes"],[
[OPUS_X86_MAY_HAVE_AVX],
[OPUS_X86_PRESUME_AVX],
[[#include <immintrin.h>
+ #include <time.h>
]],
[[
- static __m256 mtest;
- mtest = _mm256_setzero_ps();
+ __m256 mtest;
+ mtest = _mm256_set1_ps((float)time(NULL));
+ mtest = _mm256_addsub_ps(mtest, mtest);
+ r...
2017 Apr 08
2
How to insert vector type input parameter for function in C/C++ API?
...pe. How
>> can I indicate vector type (function return type, input parameters) for
>> IRBuiler by C/C++ APIs?
>>
>> I don't see any example online and please let me know if anybody has
>> examples.
>>
>>
>>
>> SOURCE:
>> #include "immintrin.h"
>> __m256i sum(__m256i a, __m256i b) {
>> return a+b;
>> }
>>
>> TARGET:
>> michael at michael-Precision-Tower-3420:~/Year_2017/work_DEMO$ cat
>> avx2_add2.ll
>> ; ModuleID = 'avx2_add2.c'
>> target datalayout = "e-m:w-i...
2014 Sep 30
2
[LLVMdev] size_t?
...LVM\msbuild-bin\..\lib\clang\3.6.0\include\intrin.h:34:
> 1> In file included from C:\Program Files
> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\x86intrin.h:29:
> 1> In file included from C:\Program Files
> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\immintrin.h:28:
> 1>C:\Program Files
> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\mmintrin.h(52,40):
> error : cannot initialize a parameter of type
> '__attribute__((__vector_size__(2 * sizeof(int)))) int' (vector of
> 2 'int' values) with an rval...
2014 Oct 01
2
[LLVMdev] size_t?
...LVM\msbuild-bin\..\lib\clang\3.6.0\include\intrin.h:34:
>> 1> In file included from C:\Program Files
>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\x86intrin.h:29:
>> 1> In file included from C:\Program Files
>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\immintrin.h:28:
>> 1>C:\Program Files
>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\mmintrin.h(52,40): error
>> : cannot initialize a parameter of type '__attribute__((__vector_size__(2 *
>> sizeof(int)))) int' (vector of 2 'int' values) with an rvalue of typ...
2013 Jul 29
0
[PATCH 2/2] xv: speed up YV12 -> NV12 conversion using SSE2 if available
...ouveau_xv.c | 33 ++++++++++++++++++++++++++-------
1 file changed, 26 insertions(+), 7 deletions(-)
diff --git a/src/nouveau_xv.c b/src/nouveau_xv.c
index 567e30c..5569b7c 100644
--- a/src/nouveau_xv.c
+++ b/src/nouveau_xv.c
@@ -25,6 +25,8 @@
#include "config.h"
#endif
+#include <immintrin.h>
+
#include "xf86xv.h"
#include <X11/extensions/Xv.h>
#include "exa.h"
@@ -532,30 +534,47 @@ NVCopyNV12ColorPlanes(unsigned char *src1, unsigned char *src2,
w >>= 1;
h >>= 1;
+#ifdef __SSE2__
+ l = w >> 3;
+ e = w & 7;
+#else
l = w >...
2013 Nov 07
0
[ANNOUNCE] xf86-video-nouveau 1.0.10
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Ben Skeggs (2):
recognise GK208 as a Kepler board
bump to 1.0.10 for release
Dave Airlie (3):
nouveau: fix build on ppc by wrapping immintrin include.
nouveau: add gpu identifier to connector names for secondary gpus
nouveau: fix build against older servers.
git tag: xf86-video-nouveau-1.0.10
http://xorg.freedesktop.org/archive/individual/driver/xf86-video-nouveau-1.0.10.tar.bz2
MD5: 44fd65897630fdd19c086133cc1bd679 xf86-...
2013 Nov 07
0
[ANNOUNCE] xf86-video-nouveau 1.0.10
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Ben Skeggs (2):
recognise GK208 as a Kepler board
bump to 1.0.10 for release
Dave Airlie (3):
nouveau: fix build on ppc by wrapping immintrin include.
nouveau: add gpu identifier to connector names for secondary gpus
nouveau: fix build against older servers.
git tag: xf86-video-nouveau-1.0.10
http://xorg.freedesktop.org/archive/individual/driver/xf86-video-nouveau-1.0.10.tar.bz2
MD5: 44fd65897630fdd19c086133cc1bd679 xf86-...
2014 Oct 01
2
[LLVMdev] size_t?
.....\lib\clang\3.6.0\include\intrin.h:34:
>>> 1> In file included from C:\Program Files
>>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\x86intrin.h:29:
>>> 1> In file included from C:\Program Files
>>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\immintrin.h:28:
>>> 1>C:\Program Files
>>> (x86)\LLVM\msbuild-bin\..\lib\clang\3.6.0\include\mmintrin.h(52,40): error
>>> : cannot initialize a parameter of type '__attribute__((__vector_size__(2 *
>>> sizeof(int)))) int' (vector of 2 'int' values) with...
2016 Jul 06
2
GCC prerequisites for building LLVM head?
I've no idea, but without it, the 'std::to_string' definitions are not enabled!
-----Original Message-----
From: Renato Golin [mailto:renato.golin at linaro.org]
Sent: 06 July 2016 19:35
To: Martin.ORiordan at movidius.com
Cc: LLVM Developers <llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] GCC prerequisites for building LLVM head?
On 6 July 2016 at 19:02, Martin J.
2017 Jun 14
2
Default FPENV state
Hi,
We are interesting in expanding some vector operations directly in the
IR form as constants https://reviews.llvm.org/D33406,
for example: _mm256_cmp_ps("any input", "any input", _CMP_TRUE_UQ)
should produce -1, -1, -1, ... vector, but for some values for example
"1.00 -nan" if FPU exceptions were enabled this operation triggers the
exception. Here is the question:
2020 May 18
6
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...n the
+ * pure C version anyway.
+ *
+ * GCC automatically enables SSE2 support on x86-64 builds. The SSSE3 code
+ * path must be enabled manually: ./configure CFLAGS="-mssse3 -O2"
+ */
+
+#ifdef __x86_64__
+#ifdef __SSE2__
+
+#include "rsync.h"
+
+#ifdef __SSSE3__
+#include <immintrin.h>
+#else
+#include <tmmintrin.h>
+#endif
+
+/* Compatibility functions to let our SSSE3 algorithm run on SSE2 */
+
+static inline __m128i sse_load_si128(void const* buf) {
+#ifdef __SSSE3__
+ return _mm_lddqu_si128(buf); // same as loadu on all but the
oldest SSSE3 CPUs
+#else
+ re...
2012 Jul 11
0
[LLVMdev] Compiling llvm and Clang on Linux
It's undocumented FAQ, if you are using RHEL5 (or clone).
- install gcc44-c++
- Build with CC=gcc44 CXX=g++44
- You may need "CC=clang -std=gnu89" to use clang with its glibc.
Have fun!
ps. AFAIK, clang can be built more easily on centos6.
...Takumi
2012/7/11 Sitvanit Ruah <RUAH at il.ibm.com>:
>
> Hello all,
> I am new to this mailing list so I hope this is
2020 May 18
0
[PATCH] SSE2/SSSE3 optimized version of get_checksum1() for x86-64
...atically enables SSE2 support on x86-64 builds. The SSSE3 code
> + * path must be enabled manually: ./configure CFLAGS="-mssse3 -O2"
> + */
> +
> +#ifdef __x86_64__
> +#ifdef __SSE2__
> +
> +#include "rsync.h"
> +
> +#ifdef __SSSE3__
> +#include <immintrin.h>
> +#else
> +#include <tmmintrin.h>
> +#endif
> +
> +/* Compatibility functions to let our SSSE3 algorithm run on SSE2 */
> +
> +static inline __m128i sse_load_si128(void const* buf) {
> +#ifdef __SSSE3__
> + return _mm_lddqu_si128(buf); // same as loadu on...