search for: movqda

Displaying 4 results from an estimated 4 matches for "movqda".

Did you mean: movdqa
2017 Aug 18
2
[PATCH] fix alignment exceptions
We see the MOVQ instruction but this patch deliberately uses it rather than MOVQDA (load 128-bits aligned). We were seeing that with the trace below, the final invocation is not 128-bit aligned but MOVQDA insists on it (the calling function was pitch_sse4_1.c:90, in the 4-way N - i >= 4 loop). 07-31 11:00:13.469 210 2540 <(469)%20210-2540> D opus_sse1: RBE celt_inner...
2017 Aug 18
1
[PATCH] fix alignment exceptions
...xmm2 = mem[0],zero ! movq (%esi,%edi,2), %xmm1 # xmm1 = mem[0],zero addl $4, %edi movdqa %xmm2, %xmm3 pmullw %xmm1, %xmm2 On Fri, Aug 18, 2017 at 12:11 PM, Felicia Lim <flim at google.com> wrote: > We see the MOVQ instruction but this patch deliberately uses it rather > than MOVQDA (load 128-bits aligned). We were seeing that with the trace > below, the final invocation is not 128-bit aligned but MOVQDA insists on it > (the calling function was pitch_sse4_1.c:90, in the 4-way N - i >= 4 > loop). > > 07-31 11:00:13.469 210 2540 <(469)%20210-2540> D o...
2017 Aug 18
2
[PATCH] fix alignment exceptions
Hi, Please find attached a patch to fix alignment exceptions. Without this change, we were seeing occasional alignment faults when using this with clang. Thanks, Felicia -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xiph.org/pipermail/opus/attachments/20170818/e0d6bb06/attachment.html> -------------- next part -------------- A non-text
2017 Aug 22
0
[PATCH] fix alignment exceptions
...ovq (%esi,%edi,2), %xmm1 # xmm1 = mem[0],zero addl $4, %edi movdqa %xmm2, %xmm3 pmullw %xmm1, %xmm2 On Fri, Aug 18, 2017 at 12:11 PM, Felicia Lim <flim at google.com<mailto:flim at google.com>> wrote: We see the MOVQ instruction but this patch deliberately uses it rather than MOVQDA (load 128-bits aligned). We were seeing that with the trace below, the final invocation is not 128-bit aligned but MOVQDA insists on it (the calling function was pitch_sse4_1.c:90, in the 4-way N - i >= 4 loop). 07-31 11:00:13.469 210 2540<tel:(469)%20210-2540> D opus_sse1: RBE celt_in...