search for: spmd

Displaying 20 results from an estimated 70 matches for "spmd".

Did you mean: pmd
2019 Jan 23
1
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...dopting the same code generation strategy. The proposal is affecting us. It would be nice to know more details about the proposal. So we can prepare ourselves to adapt flang (if everything goes on the way). Have you find and a solution for data sharing? How are you going to manage data sharing for SPMD and non-SPMD? From: cfe-dev <cfe-dev-bounces at lists.llvm.org> On Behalf Of Doerfert, Johannes Rudolf via cfe-dev Sent: Wednesday, January 23, 2019 12:50 AM To: Alexey Bataev <a.bataev at outlook.com> Cc: llvm-dev <llvm-dev at lists.llvm.org>; cfe-dev at lists.llvm.org; openmp-d...
2019 Jan 22
3
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...hide (most of) the differences in the runtime. If I understand you correctly, you say the data sharing code looks very different and the differences cannot be hidden, correct? It would be helpful for me to understand your point if you could give me a piece of OpenMP for which the data sharing in SPMD mode and "guarded" mode are as different as possible. I can compile it in both modes myself so high-level OpenMP is fine (I will disable SPMD mode manually in the source if necessary). Thanks, Johannes ________________________________ From: llvm-dev <llvm-dev-bounces at lists....
2019 Jan 22
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...___ From: Alexey Bataev <a.bataev at outlook.com> Sent: Tuesday, January 22, 2019 12:52:42 PM To: Doerfert, Johannes Rudolf; cfe-dev at lists.llvm.org Cc: openmp-dev at lists.llvm.org; LLVM-Dev; Finkel, Hal J.; Alexey Bataev; Arpith Chacko Jacob Subject: Re: [RFC] Late (OpenMP) GPU code "SPMD-zation" The globalization for the local variables, for example. It must be implemented in the compiler to get the good performance, not in the runtime. ------------- Best regards, Alexey Bataev 22.01.2019 13:43, Doerfert, Johannes Rudolf пишет: Could you elaborate on what you refer to wrt...
2019 Jan 22
7
[RFC] Late (OpenMP) GPU code "SPMD-zation"
Where we are ------------ Currently, when we generate OpenMP target offloading code for GPUs, we use sufficient syntactic criteria to decide between two execution modes: 1) SPMD -- All target threads (in an OpenMP team) run all the code. 2) "Guarded" -- The master thread (of an OpenMP team) runs the user code. If an OpenMP distribute region is encountered, thus if all threads (in the OpenMP team) are supposed to...
2019 Jan 22
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...___ From: Alexey Bataev <a.bataev at outlook.com> Sent: Tuesday, January 22, 2019 12:34:01 PM To: Doerfert, Johannes Rudolf; cfe-dev at lists.llvm.org Cc: openmp-dev at lists.llvm.org; LLVM-Dev; Finkel, Hal J.; Alexey Bataev; Arpith Chacko Jacob Subject: Re: [RFC] Late (OpenMP) GPU code "SPMD-zation" ------------- Best regards, Alexey Bataev 22.01.2019 13:17, Doerfert, Johannes Rudolf пишет: Where we are ------------ Currently, when we generate OpenMP target offloading code for GPUs, we use sufficient syntactic criteria to decide between two execution modes: 1) SPMD --...
2019 Mar 13
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...aev 13.03.2019 15:08, Doerfert, Johannes пишет: > Please consider reviewing the code for the proposed approach here: > https://reviews.llvm.org/D57460 > > Initial tests, e.g., on the nw (needleman-wunsch) benchmark in the > rodinia 3.1 benchmark suite, showed 30% improvement after SPMD mode was > enabled automatically. The code in nw is conceptually equivalent to the > first example in the "to_SPMD_mode.ll" test case that can be found here: > https://reviews.llvm.org/D57460#change-sBfg7kuN4Bid > > The implementation is missing key features but one shoul...
2019 Jan 31
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
..."sans-serif">Thank you for the explanation.</font><br><br><font size=2 face="sans-serif">I think we need to clarify some details about code generation in Clang today:</font><br><br><font size=2 face="sans-serif">1. non-SPMD mode, or generic mode, uses the master-worker code gen scheme where the master thread and the worker threads are <b>disjoint </b>sets of threads (when one set runs the other set is blocked and doesn't participate in the execution):</font><br><br><font size=2 fac...
2011 Jun 22
0
[LLVMdev] Announcing ispc: a SPMD-on-SIMD compiler built on top of LLVM
Today Intel launched ispc, the Intel SPMD Program Compiler, an open-source compiler built on top of LLVM. It is available in both source and binary form from http://ispc.github.com/, under a BSD license. The goal of ispc is to provide a high-performance implementation of a C-based SPMD language for CPUs. The SPMD model has of course bee...
2019 Mar 13
2
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...lt;a.bataev at outlook.com> > *Sent:* Wednesday, March 13, 2019 2:15:39 PM > *To:* Doerfert, Johannes; cfe-dev at lists.llvm.org > *Cc:* openmp-dev at lists.llvm.org; LLVM-Dev; Finkel, Hal J.; Alexey > Bataev; Arpith Chacko Jacob > *Subject:* Re: [RFC] Late (OpenMP) GPU code "SPMD-zation" >   > > There are tooooooo(!) many changes, I don't who's going to review > sooooo big patch. You definitely need to split it into several smaller > patches. Also, I don't like the idea adding of one more class for > NVPTX codegen. All your changes should...
2015 May 13
8
[LLVMdev] RFC: Convergent attribute
Below is a proposal for a new "convergent" intrinsic attribute and MachineInstr property, needed for correctly modeling many SPMD/SIMT programming models in LLVM. Comments and feedback welcome. —Owen In order to make LLVM more suitable for programming models variously called SPMD and SIMT, we would like to propose a new intrinsic and MachineInstr annotation called "convergent", which will be used to impose ce...
2015 Jul 06
4
[LLVMdev] SPMD Autovectorizer
Hi, Are there any plans to integrate an autovectorizer for SPMD programs into LLVM? For example, there were previous discussions about integrating the whole function vectorizer (WFV) from Ralf Karrenberg into LLVM. Thanks, Zack -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachmen...
2019 Mar 13
4
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...into the definition and the implementation, though that does not make > sense to me. > > The second contains the code generation. It is very much like the NVPTX > code generation except that it does not contain logic. > > The third is the LLVM pass which could be split into two, SPMD-mode and > state machine creation. I'll wait for feedback on the other patches > until I go ahead. > > >>>> Also, I don't like the idea adding of one more class for NVPTX >>> codegen. All your changes should be on top of the eixisting solution. >>&gt...
2019 Mar 13
3
[RFC] Late (OpenMP) GPU code "SPMD-zation"
...; Sent: Wednesday, March 13, 2019 4:33:03 PM To: Doerfert, Johannes Cc: Alexey Bataev; cfe-dev at lists.llvm.org<mailto:cfe-dev at lists.llvm.org>; openmp-dev at lists.llvm.org<mailto:openmp-dev at lists.llvm.org>; llvm-dev; Finkel, Hal J. Subject: Re: [RFC] Late (OpenMP) GPU code "SPMD-zation" 1. You don't need to implement everything in a single patch. The development process is a step-by-step process, when you commit something in small pieces. The code must nit be fully functional, you may start from some basic features. Currently it is very hard to review. 2. I rathe...
2015 Jul 07
2
[LLVMdev] SPMD Autovectorizer
On 07/07/2015 01:32 PM, Renato Golin wrote: > Wouldn't OpenMP account for some of that? At least on a single > machine, could you have both parallel and simd optimisations done on > the same loop? The point in SPMD program description (e.g. CUDA or OpenCL C) autovectorization is to produce something like OpenMP parallel loops or SIMD pragmas automatically from the single thread/WI description, adhering to its barrier synchronization semantics etc. That is, the output of this pass could be also converted to O...
2015 Jul 07
2
[LLVMdev] SPMD Autovectorizer
...e model may not even apply > here. If you need write back loops, non-trivial synchronization > barriers between cores and other crazy stuff, adding all that to the > vectorizer would be bloating code beyond usability. On the other hand, > maybe not. Instead of implementing a monolithic SPMD-specific kernel vectorizer with lots of code duplication to simple loop vectorizers, what pocl does is quite the opposite. All it does is identify the parallel regions between barriers, marks them as parallel loops and let the other passes do what they like with the loops. Currently we apply inner...
2006 May 06
1
IPsec with racoon2
Hi, I'm trying to get IPsec running between 2 FreeBSD (VMware) boxes, using racoon2. spmd and iked start up okay, but I get an error when I try a ping across the tunnel. /var/log/messages shows: May 5 13:52:36 biosa-vm4 iked: [INTERNAL_ERR]: if_spmd.c:726: SLID failed: 550 Operation failed May 5 13:52:36 biosa-vm4 iked: [INTERNAL_ERR]: isakmp.c:647:isakmp_initiate_cont(): 0:172.20.36....
2015 May 14
2
[LLVMdev] RFC: Convergent attribute
...ich would be very hard to address doesn't mean it isn't a limitation. :) > > Philip > > On 05/13/2015 01:17 PM, Owen Anderson wrote: >> Below is a proposal for a new "convergent" intrinsic attribute and MachineInstr property, needed for correctly modeling many SPMD/SIMT programming models in LLVM. Comments and feedback welcome. >> >> —Owen >> >> >> >> >> >> In order to make LLVM more suitable for programming models variously called SPMD >> and SIMT, we would like to propose a new intrinsic and Machi...
2015 Aug 14
2
[LLVMdev] RFC: Convergent attribute
...convergent. > > Jingyue > > On Wed, May 13, 2015 at 1:17 PM, Owen Anderson <resistor at mac.com <mailto:resistor at mac.com>> wrote: > Below is a proposal for a new "convergent" intrinsic attribute and MachineInstr property, needed for correctly modeling many SPMD/SIMT programming models in LLVM. Comments and feedback welcome. > > —Owen > > > > > > In order to make LLVM more suitable for programming models variously called SPMD > and SIMT, we would like to propose a new intrinsic and MachineInstr annotation > called &quot...
2015 Jul 07
2
[LLVMdev] SPMD Autovectorizer
On 07/07/2015 09:30 PM, C Bergström wrote: > If you're going to "autopar" (turn a loop into a threads which run on > many cores or something) then please don't add a dependency on OMP. I wouldn't, but simply utilize the parallel loop metadata that was originally designed for this purpose. What is done with that MD is up to other passes. -- --Pekka
2015 Aug 14
2
[LLVMdev] RFC: Convergent attribute
...or the semantics of convergent. > > Jingyue > > On Wed, May 13, 2015 at 1:17 PM, Owen Anderson <resistor at mac.com> wrote: > >> Below is a proposal for a new "convergent" intrinsic attribute and >> MachineInstr property, needed for correctly modeling many SPMD/SIMT >> programming models in LLVM. Comments and feedback welcome. >> >> —Owen >> >> >> >> >> >> In order to make LLVM more suitable for programming models variously >> called SPMD >> and SIMT, we would like to propose a new intrins...