search for: d91353

Displaying 6 results from an estimated 6 matches for "d91353".

Did you mean: d91354
2020 Nov 12
5
Complex proposal v3 + roundtable agenda
...ontends to emit. > > To better illustrate what this approach could look like, I put up a set of rough patches that introduce a @llvm.complex.multiply intrinsic (https://reviews.llvm.org/D91347), replace a set of fadd/fsub/fmul instructions with @llvm.complex.multiply (https://reviews.llvm.org/D91353) and lower the intrinsic for FCMLA on AArch64 (https://reviews.llvm.org/D91354). Note that those are just rough proof-of-concept patches. > > Cheers, > Florian Hi Florian, The proposed experimental intrinsics are a difficult detour to accept for performance reasons. With a complex type,...
2020 Nov 12
0
Complex proposal v3 + roundtable agenda
...ontends to emit. > > To better illustrate what this approach could look like, I put up a set of rough patches that introduce a @llvm.complex.multiply intrinsic (https://reviews.llvm.org/D91347), replace a set of fadd/fsub/fmul instructions with @llvm.complex.multiply (https://reviews.llvm.org/D91353) and lower the intrinsic for FCMLA on AArch64 (https://reviews.llvm.org/D91354). Note that those are just rough proof-of-concept patches. > > Cheers, > Florian Hi Florian, The proposed experimental intrinsics are a difficult detour to accept for performance reasons. With a complex type,...
2020 Nov 12
0
Complex proposal v3 + roundtable agenda
...o emit. >> >> To better illustrate what this approach could look like, I put up a set of rough patches that introduce a @llvm.complex.multiply intrinsic (https://reviews.llvm.org/D91347), replace a set of fadd/fsub/fmul instructions with @llvm.complex.multiply (https://reviews.llvm.org/D91353) and lower the intrinsic for FCMLA on AArch64 (https://reviews.llvm.org/D91354). Note that those are just rough proof-of-concept patches. >> >> Cheers, >> Florian > > > The proposed experimental intrinsics are a difficult detour to accept > for performance reasons....
2020 Nov 12
0
Complex proposal v3 + roundtable agenda
...ustrate what this approach could look like, I put up a set of rough patches that introduce a @llvm.complex.multiply intrinsic (https://reviews.llvm.org/D91347 <https://reviews.llvm.org/D91347>), replace a set of fadd/fsub/fmul instructions with @llvm.complex.multiply (https://reviews.llvm.org/D91353 <https://reviews.llvm.org/D91353>) and lower the intrinsic for FCMLA on AArch64 (https://reviews.llvm.org/D91354 <https://reviews.llvm.org/D91354>). Note that those are just rough proof-of-concept patches. Cheers, Florian [1] https://developer.arm.com/docs/ddi0596/h/simd-and-floating...
2019 Oct 22
4
Complex proposal v3 + roundtable agenda
Ahead of the Wednesday’s roundtable at the developers’ conference, here is version three of the proposal for first-class complex types in LLVM. I was not able to add Krzysztof Parzyszek’s suggestion of a “cunzip” intrinsic returning two vectors as I could not find examples of intrinsics that return two values at the IR level. The Hexagon intrinsics declared to return two values do not actually
2020 Nov 12
2
Complex proposal v3 + roundtable agenda
...ontends to emit. > > To better illustrate what this approach could look like, I put up a set of rough patches that introduce a @llvm.complex.multiply intrinsic (https://reviews.llvm.org/D91347), replace a set of fadd/fsub/fmul instructions with @llvm.complex.multiply (https://reviews.llvm.org/D91353) and lower the intrinsic for FCMLA on AArch64 (https://reviews.llvm.org/D91354). Note that those are just rough proof-of-concept patches. > > Cheers, > Florian > > > > The proposed experimental intrinsics are a difficult detour to accept > for performance reasons. With a co...