search for: _m128

Displaying 7 results from an estimated 7 matches for "_m128".

Did you mean: __m128
2006 Jan 06
2
Re: sigsegv in _mm_load_ups (linux/gcc 3.x)
...on win32), and the problem > was that the stack was misaligned when entering the function, so the temp > registers weren't at 16-byte boundries. That's a possibility. It's easy to check by printing the address of the variables. I know that gcc 3.3 had some alignment issues with _m128 that were supposed to be fixed in version 3.4 and above. Jean-Marc
2014 Dec 15
2
[LLVMdev] ABI incompatability when passing vector parameters on 32-bit x86
Hi all, Recently, Reid Kleckner found an ABI incompatibility between clang and GCC in the way vector parameters are passed on 32-bit x86. (This is documented in PR21510.) Specifically, GCC uses XMM0-XMM2 to pass the first 3 __m128 parameters, and the rest are passed on the stack. Clang passes an additional parameter by register, using XMM0-XMM3. The same applies to __m256 with YMM0-2 vs. YMM0-3. In theory, it would apply to __m512 as well, but currently clang doesn't support passing __m512 in x86 mode at all. ICC has the...
2004 Aug 06
2
Notes on 1.1.4 Windows. Testing of SSE Intrinics Code and others
Jean-Marc, Good catch on the debug mode. After compiling the same code in release mode it does appear to be using all the registers correctly. Give us a few days to integrate our run-time flags into 1.1.4 and I will let you know how are testing turns out. Aron Rosenberg SightSpeed At 08:54 PM 1/21/2004, you wrote: > > 1. Compile Error with regular mode (FIXED_POINT undefined)
2004 Aug 06
0
Notes on 1.1.4 Windows. Testing of SSE Intrinics Code and others
...rrectly. Give > us a few days to integrate our run-time flags into 1.1.4 and I will let you > know how are testing turns out. Just a note. With the last optimization I did in cb_search.c, it's no longer easy to have a run-time SSE flag. The reason is that there is now data exchanged in _m128 format between some functions (even the layout of the values changes if you turn SSE on). Jean-Marc -- Jean-Marc Valin, M.Sc.A., ing. jr. LABORIUS (http://www.gel.usherb.ca/laborius) Université de Sherbrooke, Québec, Canada -------------- next part -------------- A non-text attachment...
2006 Jan 05
2
Re: sigsegv in _mm_load_ups (linux/gcc 3.x)
That's definitely strange and I've never encountered that. Normally, the only way for _mm_load_ups to generate a segfault is for the input to be invalid memory, in which case the C version should crash too. I suspect the compiler (or something else) may be hiding the real problem. Can you get a debugger and see exactly what assembly statement is causing the crash and what the operands are?
2006 Jan 06
0
Re: sigsegv in _mm_load_ups (linux/gcc 3.x)
...roblem >> was that the stack was misaligned when entering the function, so the temp >> registers weren't at 16-byte boundries. > > That's a possibility. It's easy to check by printing the address of the > variables. I know that gcc 3.3 had some alignment issues with _m128 that > were supposed to be fixed in version 3.4 and above. I just checked it in the debugger, and this was with gcc 3.4.4 (mingw)... And the addresses were not properly aligned :( From a bit of googling, this seems to be a thread problem, as the gcc just maintains 16-byte alignment of the st...
2004 Aug 06
2
Notes on 1.1.4 Windows. Testing of SSE Intrinics Code and others
...ew days to integrate our run-time flags into 1.1.4 and I will let > you > > know how are testing turns out. > >Just a note. With the last optimization I did in cb_search.c, it's no >longer easy to have a run-time SSE flag. The reason is that there is now >data exchanged in _m128 format between some functions (even the layout >of the values changes if you turn SSE on). > > Jean-Marc > >-- >Jean-Marc Valin, M.Sc.A., ing. jr. >LABORIUS (http://www.gel.usherb.ca/laborius) >Université de Sherbrooke, Québec, Canada <p>--- >8 ---- List ar...