Displaying 2 results from an estimated 2 matches for "simeonehrig".
2017 Nov 14
1
OrcJIT + CUDA Prototype for Cling
...//github.com/llvm-mirror/clang/tree/master/examples/clang-interpreter
> <https://github.com/llvm-mirror/clang/tree/master/examples/clang-interpreter>
>
> I also pushed the source code to github with the install
> instructions and examples:
> https://github.com/SimeonEhrig/CUDA-Runtime-Interpreter
> <https://github.com/SimeonEhrig/CUDA-Runtime-Interpreter>
>
> The device code generation can be performed with either clang's
> CUDA frontend or NVCC to ptx.
>
> Here is the workflow in five stages:
>
> 1. generating...
2017 Sep 27
2
OrcJIT + CUDA Prototype for Cling
...t by changing the define INTERPRET from 1 to 0 .
=== Implementation
The prototype is based on the clang example in
https://github.com/llvm-mirror/clang/tree/master/examples/clang-interpreter
I also pushed the source code to github with the install instructions
and examples:
https://github.com/SimeonEhrig/CUDA-Runtime-Interpreter
The device code generation can be performed with either clang's CUDA
frontend or NVCC to ptx.
Here is the workflow in five stages:
1. generating ptx device code (a kind of nvidia assembler)
2. translate ptx to sass (machine code of ptx)
3. generate a fatbinray (a...