search for: d57460

Displaying 6 results from an estimated 6 matches for "d57460".

2019 Jan 31
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...h statement in the user code (module) [1, line 111] with<br>a fallback if there are potentially [1, line 212] hidden parallel<br>regions.<br><br><br>Does that make sense?<br><br><br>[0] </font></tt><a href="https://reviews.llvm.org/D57460#change-e9Ljd9RgdWYz"><tt><font size=2>https://reviews.llvm.org/D57460#change-e9Ljd9RgdWYz</font></tt></a><tt><font size=2><br>[1] </font></tt><a href="https://reviews.llvm.org/D57460#change-8gnnGNfJVR4B"><tt>&...
2019 Mar 13
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...ea adding of one more class for NVPTX codegen. All your changes should be on top of the eixisting solution. ------------- Best regards, Alexey Bataev 13.03.2019 15:08, Doerfert, Johannes пишет: > Please consider reviewing the code for the proposed approach here: > https://reviews.llvm.org/D57460 > > Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the > rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was > enabled automatically. The code in nw is conceptually equivalent to the > first example in the "to_SPMD_mode.ll" test case that...
2019 Mar 13
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...X codegen. All your changes should be on top of the eixisting > solution. > > ------------- > Best regards, > Alexey Bataev > 13.03.2019 15:08, Doerfert, Johannes пишет: >> Please consider reviewing the code for the proposed approach here: >> https://reviews.llvm.org/D57460 >> >> Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the >> rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was >> enabled automatically. The code in nw is conceptually equivalent to the >> first example in the "to_SPMD_mode.ll&...
2019 Jan 22
7
[RFC] Late (OpenMP) GPU code "SPMD-zation"
Where we are ------------ Currently, when we generate OpenMP target offloading code for GPUs, we use sufficient syntactic criteria to decide between two execution modes: 1) SPMD -- All target threads (in an OpenMP team) run all the code. 2) "Guarded" -- The master thread (of an OpenMP team) runs the user code. If an OpenMP distribute region is encountered,
2019 Mar 13
4
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...t;>> solution. >>> >>> ------------- >>> Best regards, >>> Alexey Bataev >>> 13.03.2019 15:08, Doerfert, Johannes пишет: >>>> Please consider reviewing the code for the proposed approach here: >>>> https://reviews.llvm.org/D57460 >>>> >>>> Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the >>>> rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was >>>> enabled automatically. The code in nw is conceptually equivalent to the >>>> fi...
2019 Mar 13
3
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...gt;>> solution. >>> >>> ------------- >>> Best regards, >>> Alexey Bataev >>> 13.03.2019 15:08, Doerfert, Johannes пишет: >>>> Please consider reviewing the code for the proposed approach here: >>>> https://reviews.llvm.org/D57460 >>>> >>>> Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the >>>> rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was >>>> enabled automatically. The code in nw is conceptually equivalent to the >>>> fir...