Displaying 4 results from an estimated 4 matches for "mask_readwrite".
2019 Aug 08
4
[LLVM] (RFC) Addition/Support of new Vectorization Pragmas in LLVM
...titute of Technology(IIT), Hyderabad, we
would like to propose the addition of the following pragmas in LLVM that
aide in (or possibly increase the scope of) vectorization in LLVM (in
comparison with other compilers).
1.
ivdep
2.
Nontemporal
3.
[no]vecremainder
4.
[no]mask_readwrite
5.
[un]aligned
Could you please check the following Google document for the semantic
description of these pragmas:
https://docs.google.com/document/d/1YjGnyzWFKJvqbpCsZicCUczzU8HlLHkmG9MssUw-R1A/edit?usp=sharing
It would be great if you could please review the above document and suggest...
2019 Aug 08
3
[LLVM] (RFC) Addition/Support of new Vectorization Pragmas in LLVM
...vectorizer to attempt to vectorize the remainder loop, or should the vectorizer use a different method?
Something like that. There were patches posted at some point to enable tail-loop vectorization. At this point, I imagine that you'd construct a VPlan with the vectorized tail.
*
* mask_readwrite/nomask_readwrite: Is it a good idea to implement a pragma that will generate mask intrinsics in the IR? What other architectures (except x86) has support for masked read/writes?
ARM SVE might also fall into this category.
*
Reference:https://llvm.org/devmtg/2015-04/slides/MaskedIntrinsics.p...
2019 Aug 09
3
[LLVM] (RFC) Addition/Support of new Vectorization Pragmas in LLVM
...gt;>
>>
>> Something like that. There were patches posted at some point to enable
>> tail-loop vectorization. At this point, I imagine that you'd construct a
>> VPlan with the vectorized tail.
>>
>>
>>
>> -
>> -
>>
>> mask_readwrite/nomask_readwrite: Is it a good idea to implement a
>> pragma that will generate mask intrinsics in the IR? What other
>> architectures (except x86) has support for masked read/writes?
>>
>>
>> ARM SVE might also fall into this category.
>>
>>
>>...
2019 Aug 13
2
[LLVM] (RFC) Addition/Support of new Vectorization Pragmas in LLVM
...vectorizer to attempt to vectorize the remainder loop, or should the vectorizer use a different method?
Something like that. There were patches posted at some point to enable tail-loop vectorization. At this point, I imagine that you'd construct a VPlan with the vectorized tail.
*
* mask_readwrite/nomask_readwrite: Is it a good idea to implement a pragma that will generate mask intrinsics in the IR? What other architectures (except x86) has support for masked read/writes?
ARM SVE might also fall into this category.
*
Reference:https://llvm.org/devmtg/2015-04/slides/MaskedIntrinsics.p...