search for: __tgt_target

Displaying 6 results from an estimated 6 matches for "__tgt_target".

2018 Apr 25
0
[LLVM][RFC] Representing the target device information in the LLVM IR
On 4/25/2018 3:05 PM, Lin, Jin via llvm-dev wrote: > > RFC: Representing the target device information in the LLVM IR > > =========================================================================== > > Why this RFC change? > > ================= > > The target device information needs to be passed to the LLVM backend > when OpenMP backend outlining is enabled. For
2018 Apr 25
2
[LLVM][RFC] Representing the target device information in the LLVM IR
...rresponding IR on the host side. %0 = load double, double* @gg, align 8, !tbaa !3 %1 = bitcast double %0 to i64 ... %12 = getelementptr inbounds [4 x i8*], [4 x i8*]* %.offload_baseptrs, i32 0, i32 2 %13 = bitcast i8** %12 to i64* store i64 %1, i64* %13, align 8 ... %20 = call i32 @__tgt_target(i64 -1, i8* @.omp_offload.region_id, i32 4, i8** %4, i8** %6, i64* getelementptr inbounds ([4 x i64], [4 x i64]* @.offload_sizes, i32 0, i32 0), i64* getelementptr inbounds ([4 x i64], [4 x i64]* @.offload_maptypes, i32 0, i32 0)) Thanks, Jin From: Friedman, Eli [mailto:efriedma at codeaurora.or...
2018 Apr 25
2
[LLVM][RFC] Representing the target device information in the LLVM IR
RFC: Representing the target device information in the LLVM IR =========================================================================== Why this RFC change? ================= The target device information needs to be passed to the LLVM backend when OpenMP backend outlining is enabled. For example, for multiple target devices, the target compilation has to generate a single host to support all
2020 Mar 29
2
GSoC Interested student
...there are pending tasks, and then call its synchronous versions. So, is it a good idea to wrap them in functions that call them as asynchronous (using std::async), then return an object to wait on?. Then, waiting on the object returned. 2. I'm thinking on replacing the data transfer inside __tgt_target_teams with __tgt_target_data_begin. The problem with this is that __tgt_target_data_begin does not initialize [first]private variables, and doesn't check for lambda mappings. Therefore, my idea to go around this problem is to create another wrapper function F, that calls __tgt_target_data_...
2020 Apr 07
2
GSoC Interested student
Hi Johannes, In order to create the testcases with the FIXMEs showing how it should look after the transformation, I've been trying to come up with a way of splitting the runtime call __tgt_target_teams based on Silei's recent patch D77005. I like the idea of using its asynchronous version __tgt_target_teams_nowait, but I think we would have to modify it, either to return the async_info struct: __tgt_async_info __tgt_target_teams_nowait(...) // This would break the current canonical...
2020 Mar 25
2
GSoC Interested student
Hi Johannes, Sorry about the failing tests. I completely forgot about running all of them, I only ran the tests for the openmp ipo transforms. I found the GH issue in cache. I can post it here if you want. I analyzed these runtime calls, __tgt_target_teams_[nowait]. What I don't understand about the problem we want to solve (splitting the memory transfer into issue and wait and legally moving code inside) is that the memory transfer is made inside those runtime calls. Therefore, the runtime calls to the memory transfer cannot be analyze...