Displaying 3 results from an estimated 3 matches for "is36_d".
Did you mean:
is26_d
2015 Jul 24
1
[LLVMdev] SIMD for sdiv <2 x i64>
This snippet of IR is interesting:
%sub.ptr.div.iS37_D = sdiv <2 x i64> %sub.ptr.sub.iS36_D, <i64 24,
i64 24>
%cmp10S38_D = icmp ugt <2 x i64> %sub.ptr.div.iS37_D,
%splatInsMapS1_D.splat
%zextS39_D = sext <2 x i1> %cmp10S38_D to <2 x i64>
%BCS39_D = bitcast <2 x i64> %zextS39_D to i128
%mskS39_D = icmp ne i128 %BCS39_D, 0
br i1 %mskS39_D, lab...
2015 Jul 24
0
[LLVMdev] SIMD for sdiv <2 x i64>
...voke.cont
invoke.cont: ; preds =
%if.then.i.i.i.i.i.i, %if.then4
%sub.ptr.rhs.cast.i = ptrtoint %class.Vector* %__position.coerce to i64
%sub.ptr.rhs.cast.iS35_D = ptrtoint <2 x %class.Vector*>
%splatInsMapS35_D.splat to <2 x i64>
%sub.ptr.sub.iS36_D = sub <2 x i64> %sub.ptr.rhs.castS8_D,
%sub.ptr.rhs.cast.iS35_D
%sub.ptr.div.iS37_D = sdiv <2 x i64> %sub.ptr.sub.iS36_D, <i64 24, i64 24>
%extractS196_D = extractelement <2 x i64> %sub.ptr.div.iS37_D, i32 1
%cmp10S38_D = icmp ugt <2 x i64> %sub.ptr.div.iS37_D,
%...
2015 Jul 24
2
[LLVMdev] SIMD for sdiv <2 x i64>
On 07/24/2015 03:42 AM, Benjamin Kramer wrote:
>> On 24.07.2015, at 08:06, zhi chen <zchenhn at gmail.com> wrote:
>>
>> It seems that that it's hard to vectorize int64 in LLVM. For example, LLVM 3.4 generates very complicated code for the following IR. I am running on a Haswell processor. Is it because there is no alternative AVX/2 instructions for int64? The same thing