search for: sm_10

Displaying 20 results from an estimated 27 matches for "sm_10".

2016 Jun 02
3
PTX generation from CUDA file for compute capability 1.0 (sm_10)
Hello Bergström/Eric, Thanks for the reply. The G80(sm_10) architecture was ported on FPGA by a group of researchers (http://www.ecs.umass.edu/ece/tessier/andryc-fpt13.pdf). Our group have some further research interest on this work. I was working on modifying the Clang-LLVM for a couple of months and achieved the required changes. But Clang-LLVM is only...
2016 Jun 02
5
PTX generation from CUDA file for compute capability 1.0 (sm_10)
Hello, When generating the PTX output from CUDA file(.cu file), the minimum target that is accepted by LLVM is sm_20. But I have a specific requirement to generate PTX output for compute capability 1.0 (sm_10). Is there any previous version of LLVM supporting this? Thank you, Ginu -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20160602/77d9f743/attachment.html>
2012 Sep 03
2
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
..."MON", [3 x i8] c"TUE", [3 x i8] c"WED", [3 x i8] c"THU", [3 x i8] c"FRI", [3 x i8] c"SAT", [3 x i8] c"SUN"], align 4096 $ llc -march="nvptx" test.ll -o - // // Generated by LLVM NVPTX Back-End // .version 3.0 .target sm_10, texmode_independent .address_size 32 Unexpected Constant type UNREACHABLE executed at /home/marcusmae/rpmbuild/BUILD/llvm/lib/Target/NVPTX/NVPTXAsmPrinter.cpp:1903! 0 libLLVM-3.2svn.so 0x00007f1bcb71bf0e 1 libLLVM-3.2svn.so 0x00007f1bcb71bd0a 2 libpthread.so.0 0x00007f1bca33ccb0 3 libc.so.6...
2012 Sep 04
2
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...uot;WED\00", [4 x i8] c"THU\00", [4 x i8] c"FRI\00", > [4 x i8] c"SAT\00", [4 x i8] c"SUN\00"], align 16 > $ llc -march="nvptx" dayofweek.ll -o - > // > // Generated by LLVM NVPTX Back-End > // > > .version 3.0 > .target sm_10, texmode_independent > .address_size 32 > > Unexpected Constant type > UNREACHABLE executed at > /home/marcusmae/rpmbuild/BUILD/llvm/lib/Target/NVPTX/NVPTXAsmPrinter.cpp:1903! > > NVCC > ===== > > $ nvcc -c -keep dayofweek.cu > $ cat dayofweek.ptx > >...
2013 Mar 22
2
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
Well, I tried the command line given by you and I get the following error clang++ nbody.kernel.cu -Xclang -fcuda-is-device -I/home/upitamba/llvm-3.2.src/tools/clang/test/SemaCUDA/ -Xclang -triple -Xclang nvptx64 -Xclang -target-cpu -Xclang sm_10 -S fatal error: error in backend: Cannot select: 0x334a870: v4f32 = NVPTXISD::MoveParam 0x334a770 [ORD=1] [ID=22] 0x334a770: v4f32 = TargetExternalSymbol'.PARAM0' [ID=1] In function: computeBodyAccel Am I doing anything wrong here ? Attached my new nbody.kernel.cu <http://llvm.1065...
2012 Sep 04
0
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...x i8] c"TUE\00", [4 x i8] c"WED\00", [4 x i8] c"THU\00", [4 x i8] c"FRI\00", [4 x i8] c"SAT\00", [4 x i8] c"SUN\00"], align 16 $ llc -march="nvptx" dayofweek.ll -o - // // Generated by LLVM NVPTX Back-End // .version 3.0 .target sm_10, texmode_independent .address_size 32 Unexpected Constant type UNREACHABLE executed at /home/marcusmae/rpmbuild/BUILD/llvm/lib/Target/NVPTX/NVPTXAsmPrinter.cpp:1903! NVCC ===== $ nvcc -c -keep dayofweek.cu $ cat dayofweek.ptx .global .align 1 .b8 yweek[28] = {0x4d,0x4f,0x4e,0x0,0x54,0x55,0x45,...
2012 Sep 06
0
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...t;THU\00", [4 x i8] c"FRI\00", >> [4 x i8] c"SAT\00", [4 x i8] c"SUN\00"], align 16 >> $ llc -march="nvptx" dayofweek.ll -o - >> // >> // Generated by LLVM NVPTX Back-End >> // >> >> .version 3.0 >> .target sm_10, texmode_independent >> .address_size 32 >> >> Unexpected Constant type >> UNREACHABLE executed at >> /home/marcusmae/rpmbuild/BUILD/llvm/lib/Target/NVPTX/NVPTXAsmPrinter.cpp:1903! >> >> NVCC >> ===== >> >> $ nvcc -c -keep dayofweek.cu &gt...
2011 Feb 25
2
Missing R.h
Hi, I'm trying to install a module - gputools - and keep getting compile time errors about missing R.h Does anyone know where this file can be found? Thanks!
2013 Mar 24
0
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
...pitambare at yahoo.com> wrote: > Well, I tried the command line given by you and I get the following error > > clang++ nbody.kernel.cu -Xclang -fcuda-is-device > -I/home/upitamba/llvm-3.2.src/tools/clang/test/SemaCUDA/ -Xclang -triple > -Xclang nvptx64 -Xclang -target-cpu -Xclang sm_10 -S > > fatal error: error in backend: Cannot select: 0x334a870: v4f32 = > NVPTXISD::MoveParam 0x334a770 [ORD=1] [ID=22] > 0x334a770: v4f32 = TargetExternalSymbol'.PARAM0' [ID=1] > In function: computeBodyAccel > > Am I doing anything wrong here ? > > Attached my...
2012 Jul 18
2
[LLVMdev] [NVPTX] PTXAS - Unimplemented feature: labels as initial values
...cst12 = hidden constant [3 x i8] c"aa\00", align 4096 @.cst2 = hidden constant [26 x i8] c"Usage: %s <nx> <ny> <nz>\0A\00", align 4096 > llc -march=nvptx64 test.ll -o test.ptx > cat test.ptx // // Generated by LLVM NVPTX Back-End // .version 3.0 .target sm_10, texmode_independent .address_size 64 .visible .global .align 4096 .b8 _2E_cst[11] = {114, 101, 100, 117, 99, 101, 46, 102, 57, 48, 0}; .visible .global .align 4096 .u64 z = _2E_cst1; .visible .global .align 4096 .b8 _2E_cst1[3] = {122, 122, 0}; .visible .global .align 4096 .u64 a = _2E_cst12; .vi...
2012 Jul 10
2
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
...oc !0 store i32 %1, i32* %result, align 4 %2 = load i32* %result, align 4 ret i32 %2 } !0 = metadata !{i32 127, i32 132, i32 166, i32 200, i32 242, i32 285, i32 327} > llc -march=nvptx64 test.ll -o test.ptx > cat test.ptx // // Generated by LLVM NVPTX Back-End // .version 3.0 .target sm_10, texmode_independent .address_size 64 // .globl _Z5__anyi .visible .global .align 4 .b8 __local_depot0[8]; .func (.reg .b32 func_retval0) _Z5__anyi( .reg .b32 _Z5__anyi_param_0 ) // @_Z5__anyi { .reg .b64 %SP; .reg .b64 %SPL;...
2012 Jul 18
0
[LLVMdev] [NVPTX] PTXAS - Unimplemented feature: labels as initial values
...cst12 = hidden constant [3 x i8] c"aa\00", align 4096 @.cst2 = hidden constant [26 x i8] c"Usage: %s <nx> <ny> <nz>\0A\00", align 4096 > llc -march=nvptx64 test.ll -o test.ptx > cat test.ptx // // Generated by LLVM NVPTX Back-End // .version 3.0 .target sm_10, texmode_independent .address_size 64 .visible .global .align 4096 .b8 _2E_cst[11] = {114, 101, 100, 117, 99, 101, 46, 102, 57, 48, 0}; .visible .global .align 4096 .u64 z = _2E_cst1; .visible .global .align 4096 .b8 _2E_cst1[3] = {122, 122, 0}; .visible .global .align 4096 .u64 a = _2E_cst12; .vi...
2012 Jul 10
0
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
...ult, align 4 > ret i32 %2 > } > > !0 = metadata !{i32 127, i32 132, i32 166, i32 200, i32 242, i32 285, i32 327} > > > llc -march=nvptx64 test.ll -o test.ptx > > cat test.ptx > // > // Generated by LLVM NVPTX Back-End > // > > .version 3.0 > .target sm_10, texmode_independent > .address_size 64 > > > // .globl _Z5__anyi > .visible .global .align 4 .b8 __local_depot0[8]; > > .func (.reg .b32 func_retval0) _Z5__anyi( > .reg .b32 _Z5__anyi_param_0 > ) // @_Z5__anyi >...
2012 Jul 10
1
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
...t, align 4 > ret i32 %2 > } > > !0 = metadata !{i32 127, i32 132, i32 166, i32 200, i32 242, i32 285, i32 > 327} > > > llc -march=nvptx64 test.ll -o test.ptx > > cat test.ptx > // > // Generated by LLVM NVPTX Back-End > // > > .version 3.0 > .target sm_10, texmode_independent > .address_size 64 > > > // .globl _Z5__anyi > .visible .global .align 4 .b8 __local_depot0[8]; > > .func (.reg .b32 func_retval0) _Z5__anyi( > .reg .b32 _Z5__anyi_param_0 > ) // @_Z5__anyi > {...
2013 Mar 01
4
[LLVMdev] NVPTX CUDA_ERROR_NO_BINARY_FOR_GPU
...gt; error: CUDA_ERROR_NO_BINARY_FOR_GPU. I'm running this on a 2012 MBP > > with a 640M GPU. > > > > PTX Code (for a mandelbrot calculation): > > > > // > > // Generated by LLVM NVPTX Back-End > > // > > > > .version 3.1 > > .target sm_10, texmode_independent > > .address_size 64 > > > > .func (.reg .b32 func_retval0) INT_PTX_SREG_CTAID_X > > ( > > > > ) > > ; > > .func (.reg .b32 func_retval0) INT_PTX_SREG_CTAID_Y > > ( > > > > ) > > ; > > .func (.r...
2013 Mar 21
0
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
Not really. Clang does not have a way to annotate device vs. kernel functions in C/C++ mode. You're probably better off trying to use OpenCL or CUDA mode in clang. In the clang unit tests, there is a cuda.h header that provides very basic support for these keywords: tests/SemaCUDA/cuda.h If you compile as CUDA (use .cu extension, or "-x cuda") and use this header, you will have
2013 Mar 20
2
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
Thanks a lot Justin, I will remove the toolkit header. Just one last question..(maybe ;) ) If I do away with toolkit headers it says unknown type name '__device__'. Does this function qualifier have an alternative ? or I can just do away with ? -- View this message in context: http://llvm.1065342.n5.nabble.com/UNREACHABLE-executed-error-while-trying-to-generate-PTX-tp56026p56093.html
2011 May 06
2
[LLVMdev] [PATCH ]Add Subtarget ptx23
...; RUN: llc < %s -march=ptx32 -mattr=ptx21 | grep ".version 2.1" ; RUN: llc < %s -march=ptx32 -mattr=ptx22 | grep ".version 2.2" +; RUN: llc < %s -march=ptx32 -mattr=ptx23 | grep ".version 2.3" ; RUN: llc < %s -march=ptx32 -mattr=sm10 | grep ".target sm_10" ; RUN: llc < %s -march=ptx32 -mattr=sm13 | grep ".target sm_13" ; RUN: llc < %s -march=ptx32 -mattr=sm20 | grep ".target sm_20" Index: lib/Target/PTX/PTX.td =================================================================== --- lib/Target/PTX/PTX.td (revision 1309...
2013 Mar 01
0
[LLVMdev] NVPTX CUDA_ERROR_NO_BINARY_FOR_GPU
...unning this on a 2012 > > > MBP with a 640M GPU. > > > > > > PTX Code (for a mandelbrot calculation): > > > > > > // > > > // Generated by LLVM NVPTX Back-End > > > // > > > > > > .version 3.1 > > > .target sm_10, texmode_independent > > > .address_size 64 > > > > > > .func    (.reg .b32 func_retval0) INT_PTX_SREG_CTAID_X > > > ( > > > > > > ) > > > ; > > > .func    (.reg .b32 func_retval0) INT_PTX_SREG_CTAID_Y > > > ( >...
2013 Mar 01
0
[LLVMdev] NVPTX CUDA_ERROR_NO_BINARY_FOR_GPU
...PU. I'm running this on a 2012 MBP >> > with a 640M GPU. >> > >> > PTX Code (for a mandelbrot calculation): >> > >> > // >> > // Generated by LLVM NVPTX Back-End >> > // >> > >> > .version 3.1 >> > .target sm_10, texmode_independent >> > .address_size 64 >> > >> > .func (.reg .b32 func_retval0) INT_PTX_SREG_CTAID_X >> > ( >> > >> > ) >> > ; >> > .func (.reg .b32 func_retval0) INT_PTX_SREG_CTAID_Y >> > ( >> > >&...