Displaying 5 results from an estimated 5 matches for "pandn".
Did you mean:
panda
2015 Jul 27
3
[LLVMdev] i1* function argument on x86-64
I am running into a problem with 'i1*' as a function's argument which
seems to have appeared since I switched to LLVM 3.6 (but can have other
source, of course). If I look at the assembler that the MCJIT generates
for an x86-64 target I see that the array 'i1*' is taken as a sequence
of 1 bit wide elements. (I guess that's correct). However, I used to
call the function
2006 May 25
2
Compilation issues with s390
Hi all,
I'm trying to compile asterisk on the mainframe (s390 / s390x) and I am
running into issues. I was wondering if somebody could give a hand?
I'm thinking that I should be able to do this. I have noticed that Debian
even has binary RPM's out for Asterisk now. I'm trying to do this on SuSE
SLES8 (with the 2.4 kernel).
What I see is, an issue that arch=s390 isn't
2010 May 11
0
[LLVMdev] How does SSEDomainFix work?
...y, <4 x i32> %z)
nounwind readnone {
entry:
%0 = add <4 x i32> %x, %z
%not = xor <4 x i32> %z, <i32 -1, i32 -1, i32 -1, i32 -1>
%1 = and <4 x i32> %not, %y
%2 = xor <4 x i32> %0, %1
ret <4 x i32> %2
}
_intfoo:
movdqa %xmm0, %xmm3
paddd %xmm2, %xmm3
pandn %xmm1, %xmm2
movdqa %xmm2, %xmm0
pxor %xmm3, %xmm0
ret
All the instructions moved to the int domain because the add forced them.
> Please tell me if something would be wrong for me.
You should measure if LLVM's code is actually slower that the code you want. If it is, I would like to h...
2010 May 11
2
[LLVMdev] How does SSEDomainFix work?
Hello. This is my 1st post.
I have tried SSE execution domain fixup pass.
But I am not able to see any improvements.
I expect for the example below to use MOVDQA, PAND &c.
(On nehalem, ANDPS is extremely slower than PAND)
Please tell me if something would be wrong for me.
Thank you.
Takumi
Host: i386-mingw32
Build: trunk at 103373
foo.ll:
define <4 x i32> @foo(<4 x i32> %x,
2010 Aug 02
0
[LLVMdev] Register Allocation ERROR! Ran out of registers during register allocation!
...movdqa %xmm0, %xmm1
pabsw %xmm0, %xmm0
psubusw %xmm6, %xmm0
pmulhw %xmm5, %xmm0
por %xmm0, %xmm4
psignw %xmm1, %xmm0
movdqa %xmm0, ($5, %eax)
pcmpeqw %xmm7, %xmm0
movdqa ($4, %eax), %xmm1
movdqa %xmm7, ($1, %eax)
pandn %xmm1, %xmm0
pmaxsw %xmm0, %xmm3
add $$16, %eax
js 1b
movhlps %xmm3, %xmm0
pmaxsw %xmm0, %xmm3
pshuflw $$0x0E, %xmm3, %xmm0
pmaxsw %xmm0, %xmm3
pshuflw $$0x01, %xmm3, %xmm0
pmaxsw %xmm0, %xmm3
movd %xmm3, %eax...