search for: e9f3454f

Displaying 2 results from an estimated 2 matches for "e9f3454f".

2013 Nov 15
3
[LLVMdev] Limit loop vectorizer to SSE
I'm embarrassed to say my bug ended up being a user error. I was passing in pointers that were 16-byte aligned instead of 32. Explains why they worked fine for SSE but not AVX :) Sorry for the noise! -- View this message in context: http://llvm.1065342.n5.nabble.com/Limit-loop-vectorizer-to-SSE-tp63175p63419.html Sent from the LLVM - Dev mailing list archive at Nabble.com.
2013 Nov 15
0
[LLVMdev] Limit loop vectorizer to SSE
Good catch! That was the problem in my case too. I totally overlooked the alignment requirement for AVX. Frank On 15/11/13 14:43, Josh Klontz wrote: > I'm embarrassed to say my bug ended up being a user error. I was passing in > pointers that were 16-byte aligned instead of 32. Explains why they worked > fine for SSE but not AVX :) Sorry for the noise! > > > > -- >