Displaying 2 results from an estimated 2 matches for "_intfoo".
Did you mean:
intfoo
2010 May 11
0
[LLVMdev] How does SSEDomainFix work?
...2> @intfoo(<4 x i32> %x, <4 x i32> %y, <4 x i32> %z)
nounwind readnone {
entry:
%0 = add <4 x i32> %x, %z
%not = xor <4 x i32> %z, <i32 -1, i32 -1, i32 -1, i32 -1>
%1 = and <4 x i32> %not, %y
%2 = xor <4 x i32> %0, %1
ret <4 x i32> %2
}
_intfoo:
movdqa %xmm0, %xmm3
paddd %xmm2, %xmm3
pandn %xmm1, %xmm2
movdqa %xmm2, %xmm0
pxor %xmm3, %xmm0
ret
All the instructions moved to the int domain because the add forced them.
> Please tell me if something would be wrong for me.
You should measure if LLVM's code is actually slower th...
2010 May 11
2
[LLVMdev] How does SSEDomainFix work?
Hello. This is my 1st post.
I have tried SSE execution domain fixup pass.
But I am not able to see any improvements.
I expect for the example below to use MOVDQA, PAND &c.
(On nehalem, ANDPS is extremely slower than PAND)
Please tell me if something would be wrong for me.
Thank you.
Takumi
Host: i386-mingw32
Build: trunk at 103373
foo.ll:
define <4 x i32> @foo(<4 x i32> %x,