I'm embarrassed to say my bug ended up being a user error. I was passing in pointers that were 16-byte aligned instead of 32. Explains why they worked fine for SSE but not AVX :) Sorry for the noise! -- View this message in context: http://llvm.1065342.n5.nabble.com/Limit-loop-vectorizer-to-SSE-tp63175p63419.html Sent from the LLVM - Dev mailing list archive at Nabble.com.
Good catch! That was the problem in my case too. I totally overlooked the alignment requirement for AVX. Frank On 15/11/13 14:43, Josh Klontz wrote:> I'm embarrassed to say my bug ended up being a user error. I was passing in > pointers that were 16-byte aligned instead of 32. Explains why they worked > fine for SSE but not AVX :) Sorry for the noise! > > > > -- > View this message in context: http://llvm.1065342.n5.nabble.com/Limit-loop-vectorizer-to-SSE-tp63175p63419.html > Sent from the LLVM - Dev mailing list archive at Nabble.com. > _______________________________________________ > LLVM Developers mailing list > LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu > http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
On 15 November 2013 20:05, Frank Winter <fwinter at jlab.org> wrote:> Good catch! That was the problem in my case too. I totally > overlooked the alignment requirement for AVX.Wow! Two bugs closed without even looking at them! I must be a wizard! :D Good work Josh, thanks for letting us know. cheers, --renato -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20131115/e9f3454f/attachment.html>
On 15 November 2013 20:05, Frank Winter <fwinter at jlab.org> wrote:> Good catch! That was the problem in my case too. I totally > overlooked the alignment requirement for AVX.I wonder if the validation mechanism shouldn't have caught it earlier... Do you guys run validate on the modules before JIT-ing? --renato -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20131115/ca6723ee/attachment.html>