Displaying 1 result from an estimated 1 matches for "broadcast_16".
2010 Oct 28
2
[LLVMdev] llvm 2.8 fixes?
...d generate inefficient code (couldn't
really come up with anything which actually generated correct code which
didn't require at least a pshufb).
For reference, this is the bug in question:
http://llvm.org/bugs/show_bug.cgi?id=8381
Here's also a short example:
define <8 x i16> @broadcast_16(<8 x i16> %var1, <8 x i16> %var2) {
entry:
%0 = shufflevector <8 x i16> %var2, <8 x i16> undef, <8 x i32>
zeroinitializer
ret <8 x i16> %0
}
Which miscompiles badly to
punpcklwd %xmm0, %xmm1
pshufd $0, %xmm1, %xmm0
ret
(This happens for all similar...