Displaying 4 results from an estimated 4 matches for "d60262".
2019 Apr 04
5
[RFC] Changes to llvm.experimental.vector.reduce intrinsics
...ics. In this proposal the intrinsic is prefixed with a 'v2' for the time being, with the expectation this will be dropped when we remove 'experimental' from the reduction intrinsics in the future.
[Option B] Having separate ordered and unordered intrinsics (https://reviews.llvm.org/D60262).
declare float @llvm.experimental.vector.reduce.ordered.fadd.f32.v4f32(float %start_value, <4 x float> %vec)
declare float @llvm.experimental.vector.reduce.unordered.fadd.f32.v4f32(<4 x float> %vec)
This will mean that the behaviour is explicit from the intrinsic and the use of &...
2019 May 16
4
[RFC] Changes to llvm.experimental.vector.reduce intrinsics
...t ordered and unordered reductions the same (in that they have the same arguments). Fast-math flags determine whether or not the intrinsic needs to be evaluated in strict order. Codegen for non-strict reductions should be able to fold away the identity-value.
For Option B (https://reviews.llvm.org/D60262), David made the argument that making the reduction-order explicit (as opposed to deducing this from fast-math flags) would ensure the ordering is always as expected, even when FMF on the call sites are dropped for some reason.
Is it correct that I sensed a slight preference for Option A? i.e. R...
2019 Apr 10
2
[RFC] Changes to llvm.experimental.vector.reduce intrinsics
...is prefixed with a 'v2' for the time being, with the expectation this will be dropped when we remove 'experimental' from the reduction intrinsics in the future.
>>>>
>>>> [Option B] Having separate ordered and unordered intrinsics (https://reviews.llvm.org/D60262).
>>>>
>>>> declare float @llvm.experimental.vector.reduce.ordered.fadd.f32.v4f32(float %start_value, <4 x float> %vec)
>>>> declare float @llvm.experimental.vector.reduce.unordered.fadd.f32.v4f32(<4 x float> %vec)
>>>>
>>>...
2019 Apr 05
4
[RFC] Changes to llvm.experimental.vector.reduce intrinsics
...with a 'v2' for
>> the time being, with the expectation this will be dropped when we
>> remove 'experimental' from the reduction intrinsics in the future.
>>
>> [Option B] Having separate ordered and unordered intrinsics
>> (https://reviews.llvm.org/D60262).
>>
>> declare float
>> @llvm.experimental.vector.reduce.ordered.fadd.f32.v4f32(float
>> %start_value, <4 x float> %vec)
>>
>> declare float
>> @llvm.experimental.vector.reduce.unordered.fadd.f32.v4f32(<4 x float>
>> %vec)
>>...