Displaying 4 results from an estimated 4 matches for "movqda".
Did you mean:
movdqa
2017 Aug 18
2
[PATCH] fix alignment exceptions
We see the MOVQ instruction but this patch deliberately uses it rather than
MOVQDA (load 128-bits aligned). We were seeing that with the trace below,
the final invocation is not 128-bit aligned but MOVQDA insists on it (the
calling function was pitch_sse4_1.c:90, in the 4-way N - i >= 4 loop).
07-31 11:00:13.469 210 2540 <(469)%20210-2540> D opus_sse1: RBE
celt_inner...
2017 Aug 18
1
[PATCH] fix alignment exceptions
...xmm2 = mem[0],zero
! movq (%esi,%edi,2), %xmm1 # xmm1 = mem[0],zero
addl $4, %edi
movdqa %xmm2, %xmm3
pmullw %xmm1, %xmm2
On Fri, Aug 18, 2017 at 12:11 PM, Felicia Lim <flim at google.com> wrote:
> We see the MOVQ instruction but this patch deliberately uses it rather
> than MOVQDA (load 128-bits aligned). We were seeing that with the trace
> below, the final invocation is not 128-bit aligned but MOVQDA insists on it
> (the calling function was pitch_sse4_1.c:90, in the 4-way N - i >= 4
> loop).
>
> 07-31 11:00:13.469 210 2540 <(469)%20210-2540> D o...
2017 Aug 18
2
[PATCH] fix alignment exceptions
Hi,
Please find attached a patch to fix alignment exceptions. Without this
change, we were seeing occasional alignment faults when using this with
clang.
Thanks,
Felicia
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xiph.org/pipermail/opus/attachments/20170818/e0d6bb06/attachment.html>
-------------- next part --------------
A non-text
2017 Aug 22
0
[PATCH] fix alignment exceptions
...ovq (%esi,%edi,2), %xmm1 # xmm1 = mem[0],zero
addl $4, %edi
movdqa %xmm2, %xmm3
pmullw %xmm1, %xmm2
On Fri, Aug 18, 2017 at 12:11 PM, Felicia Lim <flim at google.com<mailto:flim at google.com>> wrote:
We see the MOVQ instruction but this patch deliberately uses it rather than MOVQDA (load 128-bits aligned). We were seeing that with the trace below, the final invocation is not 128-bit aligned but MOVQDA insists on it (the calling function was pitch_sse4_1.c:90, in the 4-way N - i >= 4 loop).
07-31 11:00:13.469 210 2540<tel:(469)%20210-2540> D opus_sse1: RBE celt_in...