Displaying 3 results from an estimated 3 matches for "nvidiacl".
2016 Apr 08
2
LIBCLC with LLVM 3.9 Trunk
It's not clear what is actually wrong from your original message, I think
you need to give some more information as to what you are doing: Example
source, what target GPU, compiler error messages or other evidence of "it's
wrong" (llvm IR, disassembly, etc) ...
--
Mats
On 8 April 2016 at 09:55, Liu Xin via llvm-dev <llvm-dev at lists.llvm.org>
wrote:
> I built it
2015 Feb 05
2
[LLVMdev] Example for usage of LLVM/Clang/libclc
Hi,
> which works but it produces LLVM IR code for all OpenCL intrinsics
> implemented by libclc along with the kernel I am interested in, is their a
> possibility to avoid this ? and only produce the llvm code for the kernel
> required ?
Mark all functions apart from the kernel entry points with the
internal attribute and then run global dead code elimination (it
should remove most
2015 Feb 03
2
[LLVMdev] Example for usage of LLVM/Clang/libclc
Hi,
My goal is to use Clang/LLVM/libclc to compile an OpenCL kernel and
eventually generate a PTX code. I already did this but I am not sure if the
PTX code I am generating is correct (is the one that is supposed to be
generated).
For example, currently,
In OpenCL : get_global_id(0) translates to
In LLVM : %call = tail call i32 @get_global_id(i32 0) which translates
to
In PTX: