Displaying 6 results from an estimated 6 matches for "d57460".
2019 Jan 31
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...h statement in the user code (module) [1, line 111] with<br>a fallback if there are potentially [1, line 212] hidden parallel<br>regions.<br><br><br>Does that make sense?<br><br><br>[0] </font></tt><a href="https://reviews.llvm.org/D57460#change-e9Ljd9RgdWYz"><tt><font size=2>https://reviews.llvm.org/D57460#change-e9Ljd9RgdWYz</font></tt></a><tt><font size=2><br>[1] </font></tt><a href="https://reviews.llvm.org/D57460#change-8gnnGNfJVR4B"><tt>&...
2019 Mar 13
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...ea adding of one more class for NVPTX codegen.
All your changes should be on top of the eixisting solution.
-------------
Best regards,
Alexey Bataev
13.03.2019 15:08, Doerfert, Johannes пишет:
> Please consider reviewing the code for the proposed approach here:
> https://reviews.llvm.org/D57460
>
> Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the
> rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was
> enabled automatically. The code in nw is conceptually equivalent to the
> first example in the "to_SPMD_mode.ll" test case that...
2019 Mar 13
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...X codegen. All your changes should be on top of the eixisting
> solution.
>
> -------------
> Best regards,
> Alexey Bataev
> 13.03.2019 15:08, Doerfert, Johannes пишет:
>> Please consider reviewing the code for the proposed approach here:
>> https://reviews.llvm.org/D57460
>>
>> Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the
>> rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was
>> enabled automatically. The code in nw is conceptually equivalent to the
>> first example in the "to_SPMD_mode.ll&...
2019 Jan 22
7
[RFC] Late (OpenMP) GPU code "SPMD-zation"
Where we are
------------
Currently, when we generate OpenMP target offloading code for GPUs, we
use sufficient syntactic criteria to decide between two execution modes:
1) SPMD -- All target threads (in an OpenMP team) run all the code.
2) "Guarded" -- The master thread (of an OpenMP team) runs the user
code. If an OpenMP distribute region is encountered,
2019 Mar 13
4
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...t;>> solution.
>>>
>>> -------------
>>> Best regards,
>>> Alexey Bataev
>>> 13.03.2019 15:08, Doerfert, Johannes пишет:
>>>> Please consider reviewing the code for the proposed approach here:
>>>> https://reviews.llvm.org/D57460
>>>>
>>>> Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the
>>>> rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was
>>>> enabled automatically. The code in nw is conceptually equivalent to the
>>>> fi...
2019 Mar 13
3
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...gt;>> solution.
>>>
>>> -------------
>>> Best regards,
>>> Alexey Bataev
>>> 13.03.2019 15:08, Doerfert, Johannes пишет:
>>>> Please consider reviewing the code for the proposed approach here:
>>>> https://reviews.llvm.org/D57460
>>>>
>>>> Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the
>>>> rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was
>>>> enabled automatically. The code in nw is conceptually equivalent to the
>>>> fir...