Displaying 6 results from an estimated 6 matches for "gpu_kernel".
2012 Mar 30
0
[LLVMdev] Why this fails on X86_64 host?
hi justn,
I have a llvm ir file which is generated by my own code generator.
When I run
*llc -march=ptx64 ./gpu_kernel.ll *
on it, the following error was given
LLVM ERROR: Cannot select: 0x269a7a0: ch = store 0x2666370, 0x2697760,
0x269a2a0, 0x2698d90<ST4[%p_arrayidx5], trunc to i32> [ID=20]
0x2697760: i64 = add 0x2699ea0, 0x2699590 [ORD=23] [ID=16]
0x2699ea0: i64 = shl 0x2699fa0, 0x269a6a0 [ORD=22]...
2012 Jul 26
0
[LLVMdev] [PROPOSAL] LLVM multi-module support
...------------------------------------------------------
> target datalayout = ...
> target triple = "x86_64-unknown-linux-gnu"
>
> @llvm_kernel = private unnamed_addr constant llvm_kernel {
> target triple = nvptx64-unknown-unknown
> define internal ptx_kernel void @gpu_kernel(i8* %Array) {
> ...
> }
> }
> ------------------------------------------------------------------------
>
> By default the global will be compiled to a llvm string stored in the
> object file. We could also think about translating it to PTX or AMD's
> HSA-IL, su...
2012 Jul 26
6
[LLVMdev] [PROPOSAL] LLVM multi-module support
...ariables.
------------------------------------------------------------------------
target datalayout = ...
target triple = "x86_64-unknown-linux-gnu"
@llvm_kernel = private unnamed_addr constant llvm_kernel {
target triple = nvptx64-unknown-unknown
define internal ptx_kernel void @gpu_kernel(i8* %Array) {
...
}
}
------------------------------------------------------------------------
By default the global will be compiled to a llvm string stored in the
object file. We could also think about translating it to PTX or AMD's
HSA-IL, such that e.g. PTX can be passed to a run...
2012 Jul 26
0
[LLVMdev] [PROPOSAL] LLVM multi-module support
...----------------------------------------------------
> target datalayout = ...
> target triple = "x86_64-unknown-linux-gnu"
>
> @llvm_kernel = private unnamed_addr constant llvm_kernel {
> target triple = nvptx64-unknown-unknown
> define internal ptx_kernel void @gpu_kernel(i8* %Array) {
> ...
> }
> }
> ------------------------------------------------------------------------
>
> By default the global will be compiled to a llvm string stored in the
> object file. We could also think about translating it to PTX or AMD's
> HSA-IL, su...
2012 Jul 26
7
[LLVMdev] [PROPOSAL] LLVM multi-module support
...----------------------
> > target datalayout = ...
> > target triple = "x86_64-unknown-linux-gnu"
> >
> > @llvm_kernel = private unnamed_addr constant llvm_kernel {
> > target triple = nvptx64-unknown-unknown
> > define internal ptx_kernel void @gpu_kernel(i8* %Array) {
> > ...
> > }
> > }
> > ------------------------------------------------------------------------
> >
> > By default the global will be compiled to a llvm string stored in the
> > object file. We could also think about translating it...
2012 Jul 26
0
[LLVMdev] [PROPOSAL] LLVM multi-module support
...gt;> > target datalayout = ...
>> > target triple = "x86_64-unknown-linux-gnu"
>> >
>> > @llvm_kernel = private unnamed_addr constant llvm_kernel {
>> > target triple = nvptx64-unknown-unknown
>> > define internal ptx_kernel void @gpu_kernel(i8* %Array) {
>> > ...
>> > }
>> > }
>> > ------------------------------------------------------------------------
>> >
>> > By default the global will be compiled to a llvm string stored in the
>> > object file. We could also...