search for: cuda_runtim

Displaying 5 results from an estimated 5 matches for "cuda_runtim".

Did you mean: cuda_runtime
2013 Mar 18
2
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
Please find the .ll attached below . Yes, I am using the cuda_runtime.h from the toolkit. nbody.kernel.ll <http://llvm.1065342.n5.nabble.com/file/n56048/nbody.kernel.ll> - Uday -- View this message in context: http://llvm.1065342.n5.nabble.com/UNREACHABLE-executed-error-while-trying-to-generate-PTX-tp56026p56048.html Sent from the LLVM - Dev mailing list...
2013 Mar 18
0
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
I noticed you're using cuda_runtime.h in the source file. Where are you getting this file? From the CUDA toolkit? Since the error is in the back-end, can you just post the .ll or .bc file you are trying to compile? On Mon, Mar 18, 2013 at 12:42 AM, upit <uday_pitambare at yahoo.com> wrote: > I am trying to generate PT...
2013 Mar 18
2
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
I am trying to generate PTX code for 'nbody' sample program's kernel (nbody_kernel.cu) using clang/LLVM version 3.2. The nbody CUDA program is available in Nvidia's SDK. I am referring to https://github.com/jholewinski/llvm-ptx-samples project. Following are my commands, clang++ -O4 -S -I/usr/local/cuda/include -emit-llvm -target nvptx64 nbody_kernel.cu -o nbody_kernel.ll
2017 Jun 09
1
NVPTX Back-end: relocatable device code support for dynamic parallelism
...50 -E -x c++ -DCUDA_DOUBLE_MATH_FUNCTIONS -D__CUDACC__ -D__NVCC__ -D__CUDACC_RDC__ "-I/opt/cuda-8.0/bin/..//include" -D"__CUDACC_VER__=80061" -D"__CUDACC_VER_BUILD__=61" -D"__CUDACC_VER_MINOR__=0" -D"__CUDACC_VER_MAJOR__=8" -include "cuda_runtime.h" -m64 "../testApps/cuda_id_test.cu" > "/tmp/tmpxft_00007040_00000000-9_cuda_id_test.cpp1.ii" #$ cudafe --allow_managed --m64 --gnu_version=40805 --c++11 -tused --no_remove_unneeded_entities --device-c --gen_c_file_name "/tmp/tmpxft_00007040_00000000-4_cuda_id_t...
2013 Mar 20
0
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
...kit headers. If you want to use the __constant__ modifier from CUDA in Clang, define it like so: #define __constant__ __attribute__((address_space(2))) On Mon, Mar 18, 2013 at 6:31 PM, upit <uday_pitambare at yahoo.com> wrote: > Please find the .ll attached below . Yes, I am using the cuda_runtime.h > from > the toolkit. > nbody.kernel.ll > <http://llvm.1065342.n5.nabble.com/file/n56048/nbody.kernel.ll> > - Uday > > > > > -- > View this message in context: > http://llvm.1065342.n5.nabble.com/UNREACHABLE-executed-error-while-trying-to-generate-PTX-tp...