search for: gpu_kernel

Displaying 6 results from an estimated 6 matches for "gpu_kernel".

2012 Mar 30
0
[LLVMdev] Why this fails on X86_64 host?
hi justn, I have a llvm ir file which is generated by my own code generator. When I run *llc -march=ptx64 ./gpu_kernel.ll * on it, the following error was given LLVM ERROR: Cannot select: 0x269a7a0: ch = store 0x2666370, 0x2697760, 0x269a2a0, 0x2698d90<ST4[%p_arrayidx5], trunc to i32> [ID=20] 0x2697760: i64 = add 0x2699ea0, 0x2699590 [ORD=23] [ID=16] 0x2699ea0: i64 = shl 0x2699fa0, 0x269a6a0 [ORD=22]...
2012 Jul 26
0
[LLVMdev] [PROPOSAL] LLVM multi-module support
...------------------------------------------------------ > target datalayout = ... > target triple = "x86_64-unknown-linux-gnu" > > @llvm_kernel = private unnamed_addr constant llvm_kernel { > target triple = nvptx64-unknown-unknown > define internal ptx_kernel void @gpu_kernel(i8* %Array) { > ... > } > } > ------------------------------------------------------------------------ > > By default the global will be compiled to a llvm string stored in the > object file. We could also think about translating it to PTX or AMD's > HSA-IL, su...
2012 Jul 26
6
[LLVMdev] [PROPOSAL] LLVM multi-module support
...ariables. ------------------------------------------------------------------------ target datalayout = ... target triple = "x86_64-unknown-linux-gnu" @llvm_kernel = private unnamed_addr constant llvm_kernel { target triple = nvptx64-unknown-unknown define internal ptx_kernel void @gpu_kernel(i8* %Array) { ... } } ------------------------------------------------------------------------ By default the global will be compiled to a llvm string stored in the object file. We could also think about translating it to PTX or AMD's HSA-IL, such that e.g. PTX can be passed to a run...
2012 Jul 26
0
[LLVMdev] [PROPOSAL] LLVM multi-module support
...---------------------------------------------------- > target datalayout = ... > target triple = "x86_64-unknown-linux-gnu" > > @llvm_kernel = private unnamed_addr constant llvm_kernel { > target triple = nvptx64-unknown-unknown > define internal ptx_kernel void @gpu_kernel(i8* %Array) { > ... > } > } > ------------------------------------------------------------------------ > > By default the global will be compiled to a llvm string stored in the > object file. We could also think about translating it to PTX or AMD's > HSA-IL, su...
2012 Jul 26
7
[LLVMdev] [PROPOSAL] LLVM multi-module support
...---------------------- > > target datalayout = ... > > target triple = "x86_64-unknown-linux-gnu" > > > > @llvm_kernel = private unnamed_addr constant llvm_kernel { > > target triple = nvptx64-unknown-unknown > > define internal ptx_kernel void @gpu_kernel(i8* %Array) { > > ... > > } > > } > > ------------------------------------------------------------------------ > > > > By default the global will be compiled to a llvm string stored in the > > object file. We could also think about translating it...
2012 Jul 26
0
[LLVMdev] [PROPOSAL] LLVM multi-module support
...gt;> > target datalayout = ... >> > target triple = "x86_64-unknown-linux-gnu" >> > >> > @llvm_kernel = private unnamed_addr constant llvm_kernel { >> > target triple = nvptx64-unknown-unknown >> > define internal ptx_kernel void @gpu_kernel(i8* %Array) { >> > ... >> > } >> > } >> > ------------------------------------------------------------------------ >> > >> > By default the global will be compiled to a llvm string stored in the >> > object file. We could also...