Displaying 6 results from an estimated 6 matches for "1a0h".
Did you mean:
10h
2008 Jul 14
5
[LLVMdev] Spilled variables using unaligned moves
...aligned moves even though the stack is
aligned. This seems like an optimization opportunity.
The attached replacement of fibonacci.cpp generates x86 code like this:
03A70010 push ebp
03A70011 mov ebp,esp
03A70013 and esp,0FFFFFFF0h
03A70019 sub esp,1A0h
...
03A7006C movups xmmword ptr [esp+180h],xmm7
...
03A70229 mulps xmm1,xmmword ptr [esp+180h]
...
03A70682 movups xmm0,xmmword ptr [esp+180h]
Note how stores and loads use unaligned moves while it could use aligned
moves. It's also interesting that the multiply doe...
2008 Jul 14
0
[LLVMdev] Spilled variables using unaligned moves
...gt; stack is aligned. This seems like an optimization opportunity.
>
> The attached replacement of fibonacci.cpp generates x86 code like
> this:
>
> 03A70010 push ebp
> 03A70011 mov ebp,esp
> 03A70013 and esp,0FFFFFFF0h
> 03A70019 sub esp,1A0h
> ...
> 03A7006C movups xmmword ptr [esp+180h],xmm7
> ...
> 03A70229 mulps xmm1,xmmword ptr [esp+180h]
> ...
> 03A70682 movups xmm0,xmmword ptr [esp+180h]
>
> Note how stores and loads use unaligned moves while it could use
> aligned moves. It’s also...
2008 Jul 14
0
[LLVMdev] Spilled variables using unaligned moves
...is this? Linux doesn't have a 16-byte aligned stack.
-Chris
>
> The attached replacement of fibonacci.cpp generates x86 code like
> this:
>
> 03A70010 push ebp
> 03A70011 mov ebp,esp
> 03A70013 and esp,0FFFFFFF0h
> 03A70019 sub esp,1A0h
> ...
> 03A7006C movups xmmword ptr [esp+180h],xmm7
> ...
> 03A70229 mulps xmm1,xmmword ptr [esp+180h]
> ...
> 03A70682 movups xmm0,xmmword ptr [esp+180h]
>
> Note how stores and loads use unaligned moves while it could use
> aligned moves. It’s also...
2008 Jul 15
1
[LLVMdev] Spilled variables using unaligned moves
...use aligned moves even though the stack is
aligned. This seems like an optimization opportunity.
The attached replacement of fibonacci.cpp generates x86 code like this:
03A70010 push ebp
03A70011 mov ebp,esp
03A70013 and esp,0FFFFFFF0h
03A70019 sub esp,1A0h
...
03A7006C movups xmmword ptr [esp+180h],xmm7
...
03A70229 mulps xmm1,xmmword ptr [esp+180h]
...
03A70682 movups xmm0,xmmword ptr [esp+180h]
Note how stores and loads use unaligned moves while it could use aligned
moves. It's also interesting that the multiply doe...
2013 Jul 19
0
[LLVMdev] llvm.x86.sse2.sqrt.pd not using sqrtpd, calling a function that modifies ECX
...B068D movapd xmm0,xmmword ptr [eax+330130h]
002B0695 movaps xmmword ptr [esp+1C0h],xmm0
002B069D movapd xmm0,xmmword ptr [eax+330140h]
002B06A5 movaps xmmword ptr [esp+1B0h],xmm0
002B06AD movapd xmm0,xmmword ptr [eax+330150h]
002B06B5 movaps xmmword ptr [esp+1A0h],xmm0
002B06BD movaps xmm0,xmmword ptr [esp+230h]
002B06C5 mulpd xmm0,xmm5
002B06C9 movaps xmm5,xmmword ptr [esp+240h]
002B06D1 mulpd xmm5,xmm4
002B06D5 movaps xmm4,xmmword ptr [esp+250h]
002B06DD mulpd xmm4,xmm1
002B06E1 addpd xmm4,xmm5
002B06E...
2013 Jul 19
4
[LLVMdev] SIMD instructions and memory alignment on X86
Hmm, I'm not able to get those .ll files to compile if I disable SSE and I
end up with SSE instructions(including sqrtpd) if I don't disable it.
On Thu, Jul 18, 2013 at 10:53 PM, Peter Newman <peter at uformia.com> wrote:
> Is there something specifically required to enable SSE? If it's not
> detected as available (based from the target triple?) then I don't think