search for: fcuda

Displaying 20 results from an estimated 27 matches for "fcuda".

Did you mean: cuda
2016 Mar 15
2
instrumenting device code with gpucc
...When I only instrumented the device code, I used the >> following cmd to do so: >> >> "/mnt/wtf/tools/bin/clang-3.9" "-cc1" "-triple" >> "x86_64-unknown-linux-gnu" "-aux-triple" "nvptx64-nvidia-cuda" >> "-fcuda-target-overloads" "-fcuda-disable-target-call-checks" "-emit-obj" >> "-disable-free" "-main-file-name" "axpy.cu" "-mrelocation-model" >> "static" "-mthread-model" "posix" "-fmath-errno&qu...
2016 Mar 05
2
instrumenting device code with gpucc
...ad model: posix InstalledDir: /usr/local/google/home/jingyue/Work/llvm/install/bin "/usr/local/google/home/jingyue/Work/llvm/install-git/bin/clang-3.7" "-cc1" "-triple" "nvptx64-nvidia-cuda" "-aux-triple" "x86_64-unknown-linux-gnu" "-fcuda-target-overloads" "-fcuda-disable-target-call-checks" "-S" "-disable-free" "-main-file-name" " axpy.cu" "-mrelocation-model" "static" "-mthread-model" "posix" "-mdisable-fp-elim" "-fmath-er...
2016 Mar 13
2
instrumenting device code with gpucc
...nstrumented host code and fatbin together. When I only instrumented the device code, I used the following cmd to do so: "/mnt/wtf/tools/bin/clang-3.9" "-cc1" "-triple" "x86_64-unknown-linux-gnu" "-aux-triple" "nvptx64-nvidia-cuda" "-fcuda-target-overloads" "-fcuda-disable-target-call-checks" "-emit-obj" "-disable-free" "-main-file-name" "axpy.cu" "-mrelocation-model" "static" "-mthread-model" "posix" "-fmath-errno" "-masm-ve...
2013 Mar 22
2
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
Well, I tried the command line given by you and I get the following error clang++ nbody.kernel.cu -Xclang -fcuda-is-device -I/home/upitamba/llvm-3.2.src/tools/clang/test/SemaCUDA/ -Xclang -triple -Xclang nvptx64 -Xclang -target-cpu -Xclang sm_10 -S fatal error: error in backend: Cannot select: 0x334a870: v4f32 = NVPTXISD::MoveParam 0x334a770 [ORD=1] [ID=22] 0x334a770: v4f32 = TargetExternalSymbol'.PARA...
2016 Mar 10
4
instrumenting device code with gpucc
...r: /usr/local/google/home/jingyue/Work/llvm/install/bin >> "/usr/local/google/home/jingyue/Work/llvm/install-git/bin/clang-3.7" >> "-cc1" "-triple" "nvptx64-nvidia-cuda" "-aux-triple" >> "x86_64-unknown-linux-gnu" "-fcuda-target-overloads" >> "-fcuda-disable-target-call-checks" "-S" "-disable-free" "-main-file-name" " >> axpy.cu" "-mrelocation-model" "static" "-mthread-model" "posix" >> "-mdisable-fp...
2016 Mar 12
2
instrumenting device code with gpucc
Hey Jingyue, Though I tried `opt -nvvm-reflect` on both bc files, the nvvm reflect anchor didn't go away; ptxas is still complaining about the duplicate definition of of function '_ZL21__nvvm_reflect_anchorv' . Did I misused the nvvm-reflect pass? Thanks! yuanfeng On Fri, Mar 11, 2016 at 10:10 AM, Jingyue Wu <jingyue at google.com> wrote: > According to the examples you
2013 Mar 21
0
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
...nit tests, there is a cuda.h header that provides very basic support for these keywords: tests/SemaCUDA/cuda.h If you compile as CUDA (use .cu extension, or "-x cuda") and use this header, you will have basic support. You can invoke clang with something like: $ clang test1.cu -Xclang -fcuda-is-device -I ../src/clang/test/SemaCUDA -Xclang -triple -Xclang nvptx64 -Xclang -target-cpu -Xclang sm_20 -S ... assuming your clang source directory is ../src/clang, you want 64-bit PTX, and your target SM is 2.0. Adjust accordingly. Clang also knows how to map OpenCL to PTX, so you would do so...
2013 Mar 24
0
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
...in trunk, but with your code I'm now hitting a new issue. I'll get the fix in soon. On Fri, Mar 22, 2013 at 1:08 AM, upit <uday_pitambare at yahoo.com> wrote: > Well, I tried the command line given by you and I get the following error > > clang++ nbody.kernel.cu -Xclang -fcuda-is-device > -I/home/upitamba/llvm-3.2.src/tools/clang/test/SemaCUDA/ -Xclang -triple > -Xclang nvptx64 -Xclang -target-cpu -Xclang sm_10 -S > > fatal error: error in backend: Cannot select: 0x334a870: v4f32 = > NVPTXISD::MoveParam 0x334a770 [ORD=1] [ID=22] > 0x334a770: v4f32 = T...
2013 Mar 20
2
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
Thanks a lot Justin, I will remove the toolkit header. Just one last question..(maybe ;) ) If I do away with toolkit headers it says unknown type name '__device__'. Does this function qualifier have an alternative ? or I can just do away with ? -- View this message in context: http://llvm.1065342.n5.nabble.com/UNREACHABLE-executed-error-while-trying-to-generate-PTX-tp56026p56093.html
2012 Jun 12
2
[LLVMdev] [NVPTX] For linkonce_odr NVPTX generates .weak, but even newest PTXAS can't handle it
...inter - does it chain to some other printer?), and finally ptxas (both 4.2 and 5) fails to compile it to cubin. Below is the test case: > cat test3.cu __inline__ __attribute__((device)) __attribute__((used)) void test() { return; } > clang -cc1 -emit-llvm -triple ptx64-unknown-unknown -fcuda-is-device test3.cu -o test3.ll > cat test3.ll ; ModuleID = 'test3.cu' target datalayout = "e-p:64:64-i64:64:64-f64:64: 64-n1:8:16:32:64" target triple = "ptx64-unknown-unknown" @llvm.used = appending global [1 x i8*] [i8* bitcast (void ()* @_Z4testv to i8*)], section...
2012 Jul 10
2
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
....pred \t%%p2; \n\t" "setp.ne.u32 \t%%p1, %1, 0; \n\t" "vote.any.pred \t%%p2, %%p1; \n\t" "selp.s32 \t%0, 1, 0, %%p2; \n\t" "}" : "=r"(result) : "r"(a)); return result; } > clang -cc1 -emit-llvm -fcuda-is-device -triple ptx64-unknown-unknown test.cu -o test.ll > cat test.ll ; ModuleID = 'test.cu' target datalayout = "e-p:64:64-i64:64:64-f64:64:64-n1:8:16:32:64" target triple = "ptx64-unknown-unknown" define ptx_device i32 @_Z5__anyi(i32 %a) nounwind inlinehint { en...
2012 Sep 04
2
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...clang+nvptx and nvcc: > > CLANG/NVPTX: > ============= > > $ cat dayofweek.cu > __attribute__((device)) char yweek[7][4] = { "MON", "TUE", "WED", > "THU", "FRI", "SAT", "SUN" }; > $ clang -cc1 -emit-llvm -fcuda-is-device dayofweek.cu -o dayofweek.ll > $ cat dayofweek.ll > ; ModuleID = 'dayofweek.cu' > target datalayout = > "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128" > ta...
2012 Jul 10
0
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
...uot;setp.ne.u32 \t%%p1, %1, 0; \n\t" > "vote.any.pred \t%%p2, %%p1; \n\t" > "selp.s32 \t%0, 1, 0, %%p2; \n\t" > "}" : "=r"(result) : "r"(a)); > return result; > } > > > clang -cc1 -emit-llvm -fcuda-is-device -triple ptx64-unknown-unknown test.cu -o test.ll > > cat test.ll > ; ModuleID = 'test.cu' > target datalayout = "e-p:64:64-i64:64:64-f64:64:64-n1:8:16:32:64" > target triple = "ptx64-unknown-unknown" > > define ptx_device i32 @_Z5__anyi(i3...
2012 Jun 13
0
[LLVMdev] [NVPTX] For linkonce_odr NVPTX generates .weak, but even newest PTXAS can't handle it
...and finally ptxas (both 4.2 and 5) fails to compile it to cubin. > Below is the test case: > > > cat test3.cu > > __inline__ __attribute__((device)) __attribute__((used)) void test() > { > return; > } > > > clang -cc1 -emit-llvm -triple ptx64-unknown-unknown -fcuda-is-device > test3.cu -o test3.ll > > cat test3.ll > ; ModuleID = 'test3.cu' > target datalayout = "e-p:64:64-i64:64:64-f64:64: > 64-n1:8:16:32:64" > target triple = "ptx64-unknown-unknown" > > @llvm.used = appending global [1 x i8*] [i8* bitcas...
2012 Sep 04
0
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...try to process the same .cu file with clang+nvptx and nvcc: CLANG/NVPTX: ============= $ cat dayofweek.cu __attribute__((device)) char yweek[7][4] = { "MON", "TUE", "WED", "THU", "FRI", "SAT", "SUN" }; $ clang -cc1 -emit-llvm -fcuda-is-device dayofweek.cu -o dayofweek.ll $ cat dayofweek.ll ; ModuleID = 'dayofweek.cu' target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128" target triple = "x86_6...
2012 Jul 10
1
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
...quot;setp.ne.u32 \t%%p1, %1, 0; \n\t" > "vote.any.pred \t%%p2, %%p1; \n\t" > "selp.s32 \t%0, 1, 0, %%p2; \n\t" > "}" : "=r"(result) : "r"(a)); > return result; > } > > > clang -cc1 -emit-llvm -fcuda-is-device -triple ptx64-unknown-unknown > test.cu -o test.ll > > cat test.ll > ; ModuleID = 'test.cu' > target datalayout = "e-p:64:64-i64:64:64-f64:64:64-n1:8:16:32:64" > target triple = "ptx64-unknown-unknown" > > define ptx_device i32 @_Z5__any...
2012 Sep 06
0
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...>> CLANG/NVPTX: >> ============= >> >> $ cat dayofweek.cu >> __attribute__((device)) char yweek[7][4] = { "MON", "TUE", "WED", >> "THU", "FRI", "SAT", "SUN" }; >> $ clang -cc1 -emit-llvm -fcuda-is-device dayofweek.cu -o dayofweek.ll >> $ cat dayofweek.ll >> ; ModuleID = 'dayofweek.cu' >> target datalayout = >> "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S1...
2012 Sep 03
2
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
Dear all, Looks like the NVPTX backend cannot handle array-of-arrays contant (please see the reporocase below). Is it supposed to work? Any ideas how to get it working? Important for our target applications. Thanks, - Dima. $ cat test.ll ; ModuleID = '__kernelgen_main_module' target datalayout =
2016 Aug 01
0
[GPUCC] link against libdevice
...64* *Found CUDA installation: /usr/local/cuda* * "/usr/local/bin/clang-3.9" -cc1 -triple nvptx64-nvidia-cuda -aux-triple x86_64-unknown-linux-gnu -S -disable-free -main-file-name scalarProd.cu -mrelocation-model static -mthread-model posix -mdisable-fp-elim -fmath-errno -no-integrated-as -fcuda-is-device -target-cpu sm_50 -v -dwarf-column-info -debugger-tuning=gdb -resource-dir /usr/local/bin/../lib/clang/3.9.0 -I ../ -I /usr/local/cuda-7.0/samples/common/inc -internal-isystem /usr/lib/gcc/x86_64-linux-gnu/4.8/../../../../include/c++/4.8 -internal-isystem /usr/lib/gcc/x86_64-linux-gnu/4.8...
2016 Aug 01
3
[GPUCC] link against libdevice
...stallation: /usr/local/cuda > "/usr/local/bin/clang-3.9" -cc1 -triple nvptx64-nvidia-cuda -aux-triple > x86_64-unknown-linux-gnu -S -disable-free -main-file-name scalarProd.cu > -mrelocation-model static -mthread-model posix -mdisable-fp-elim > -fmath-errno -no-integrated-as -fcuda-is-device -target-cpu sm_50 -v > -dwarf-column-info -debugger-tuning=gdb -resource-dir > /usr/local/bin/../lib/clang/3.9.0 -I ../ -I > /usr/local/cuda-7.0/samples/common/inc -internal-isystem > /usr/lib/gcc/x86_64-linux-gnu/4.8/../../../../include/c++/4.8 > -internal-isystem > /us...