Displaying 7 results from an estimated 7 matches for "vecinit5".
Did you mean:
vecinit
2012 Feb 28
1
[LLVMdev] How to vectorize a vector type cast?
...= insertelement <4 x float> undef, float %conv, i32 0
%2 = extractelement <4 x i8> %0, i32 1
%conv2 = uitofp i8 %2 to float
%vecinit3 = insertelement <4 x float> %vecinit, float %conv2, i32 1
%3 = extractelement <4 x i8> %0, i32 2
%conv4 = uitofp i8 %3 to float
%vecinit5 = insertelement <4 x float> %vecinit3, float %conv4, i32 2
%4 = extractelement <4 x i8> %0, i32 3
%conv6 = uitofp i8 %4 to float
%vecinit7 = insertelement <4 x float> %vecinit5, float %conv6, i32 3
ret <4 x float> %vecinit7
Which does the cast as a sequence of scala...
2017 Sep 13
2
RFC phantom memory intrinsic
...s double, double* %ptr, i64 %add
%1 = load double, double* %arrayidx1, align 8
%vecinit2 = insertelement <4 x double> %vecinit, double %1, i32 1
%add3 = add i64 %i, 2
%arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3
%2 = load double, double* %arrayidx4, align 8
%vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2
%add6 = add i64 %i, 3
%arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6
%3 = load double, double* %arrayidx7, align 8
%vecinit8 = insertelement <4 x double> %vecinit5, double %3, i32 3
%shuffle = shuffl...
2017 Sep 13
2
RFC phantom memory intrinsic
...e, double* %arrayidx1, align 8
>> %vecinit2 = insertelement <4 x double> %vecinit, double %1, i32 1
>> %add3 = add i64 %i, 2
>> %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3
>> %2 = load double, double* %arrayidx4, align 8
>> %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2
>> %add6 = add i64 %i, 3
>> %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6
>> %3 = load double, double* %arrayidx7, align 8
>> %vecinit8 = insertelement <4 x double> %vecini...
2017 Sep 26
0
RFC phantom memory intrinsic
...ign 8
>>> %vecinit2 = insertelement <4 x double> %vecinit, double %1, i32 1
>>> %add3 = add i64 %i, 2
>>> %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3
>>> %2 = load double, double* %arrayidx4, align 8
>>> %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2
>>> %add6 = add i64 %i, 3
>>> %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6
>>> %3 = load double, double* %arrayidx7, align 8
>>> %vecinit8 = insertelement <4...
2017 Sep 26
2
RFC phantom memory intrinsic
...t; %vecinit2 = insertelement <4 x double> %vecinit, double %1, i32 1
>>>> %add3 = add i64 %i, 2
>>>> %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3
>>>> %2 = load double, double* %arrayidx4, align 8
>>>> %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2
>>>> %add6 = add i64 %i, 3
>>>> %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6
>>>> %3 = load double, double* %arrayidx7, align 8
>>>> %vecinit8 = inse...
2017 Sep 26
0
RFC phantom memory intrinsic
...nsertelement <4 x double> %vecinit, double %1, i32 1
>>>>> %add3 = add i64 %i, 2
>>>>> %arrayidx4 = getelementptr inbounds double, double* %ptr, i64 %add3
>>>>> %2 = load double, double* %arrayidx4, align 8
>>>>> %vecinit5 = insertelement <4 x double> %vecinit2, double %2, i32 2
>>>>> %add6 = add i64 %i, 3
>>>>> %arrayidx7 = getelementptr inbounds double, double* %ptr, i64 %add6
>>>>> %3 = load double, double* %arrayidx7, align 8
>>>>>...
2017 Sep 12
3
RFC phantom memory intrinsic
Hi,
For PR21780 solution, I plan to add a new functionality to restore
memory operations that was once deleted, in this particular case it is
the load operations that were deleted by InstCombine, please note that
once the load was removed there is no way to restore it back and that
prevents us from vectorizing the shuffle operation. There are probably
more similar issues where this approach could