Displaying 2 results from an estimated 2 matches for "e9f3454f".
2013 Nov 15
3
[LLVMdev] Limit loop vectorizer to SSE
I'm embarrassed to say my bug ended up being a user error. I was passing in
pointers that were 16-byte aligned instead of 32. Explains why they worked
fine for SSE but not AVX :) Sorry for the noise!
--
View this message in context: http://llvm.1065342.n5.nabble.com/Limit-loop-vectorizer-to-SSE-tp63175p63419.html
Sent from the LLVM - Dev mailing list archive at Nabble.com.
2013 Nov 15
0
[LLVMdev] Limit loop vectorizer to SSE
Good catch! That was the problem in my case too. I totally
overlooked the alignment requirement for AVX.
Frank
On 15/11/13 14:43, Josh Klontz wrote:
> I'm embarrassed to say my bug ended up being a user error. I was passing in
> pointers that were 16-byte aligned instead of 32. Explains why they worked
> fine for SSE but not AVX :) Sorry for the noise!
>
>
>
> --
>