Displaying 20 results from an estimated 27 matches for "fcuda".
Did you mean:
cuda
2016 Mar 15
2
instrumenting device code with gpucc
...When I only instrumented the device code, I used the
>> following cmd to do so:
>>
>> "/mnt/wtf/tools/bin/clang-3.9" "-cc1" "-triple"
>> "x86_64-unknown-linux-gnu" "-aux-triple" "nvptx64-nvidia-cuda"
>> "-fcuda-target-overloads" "-fcuda-disable-target-call-checks" "-emit-obj"
>> "-disable-free" "-main-file-name" "axpy.cu" "-mrelocation-model"
>> "static" "-mthread-model" "posix" "-fmath-errno&qu...
2016 Mar 05
2
instrumenting device code with gpucc
...ad model: posix
InstalledDir: /usr/local/google/home/jingyue/Work/llvm/install/bin
"/usr/local/google/home/jingyue/Work/llvm/install-git/bin/clang-3.7"
"-cc1" "-triple" "nvptx64-nvidia-cuda" "-aux-triple"
"x86_64-unknown-linux-gnu" "-fcuda-target-overloads"
"-fcuda-disable-target-call-checks" "-S" "-disable-free" "-main-file-name" "
axpy.cu" "-mrelocation-model" "static" "-mthread-model" "posix"
"-mdisable-fp-elim" "-fmath-er...
2016 Mar 13
2
instrumenting device code with gpucc
...nstrumented host code
and fatbin together. When I only instrumented the device code, I used the
following cmd to do so:
"/mnt/wtf/tools/bin/clang-3.9" "-cc1" "-triple" "x86_64-unknown-linux-gnu"
"-aux-triple" "nvptx64-nvidia-cuda" "-fcuda-target-overloads"
"-fcuda-disable-target-call-checks" "-emit-obj" "-disable-free"
"-main-file-name" "axpy.cu" "-mrelocation-model" "static" "-mthread-model"
"posix" "-fmath-errno" "-masm-ve...
2013 Mar 22
2
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
Well, I tried the command line given by you and I get the following error
clang++ nbody.kernel.cu -Xclang -fcuda-is-device
-I/home/upitamba/llvm-3.2.src/tools/clang/test/SemaCUDA/ -Xclang -triple
-Xclang nvptx64 -Xclang -target-cpu -Xclang sm_10 -S
fatal error: error in backend: Cannot select: 0x334a870: v4f32 =
NVPTXISD::MoveParam 0x334a770 [ORD=1] [ID=22]
0x334a770: v4f32 = TargetExternalSymbol'.PARA...
2016 Mar 10
4
instrumenting device code with gpucc
...r: /usr/local/google/home/jingyue/Work/llvm/install/bin
>> "/usr/local/google/home/jingyue/Work/llvm/install-git/bin/clang-3.7"
>> "-cc1" "-triple" "nvptx64-nvidia-cuda" "-aux-triple"
>> "x86_64-unknown-linux-gnu" "-fcuda-target-overloads"
>> "-fcuda-disable-target-call-checks" "-S" "-disable-free" "-main-file-name" "
>> axpy.cu" "-mrelocation-model" "static" "-mthread-model" "posix"
>> "-mdisable-fp...
2016 Mar 12
2
instrumenting device code with gpucc
Hey Jingyue,
Though I tried `opt -nvvm-reflect` on both bc files, the nvvm reflect
anchor didn't go away; ptxas is still complaining about the duplicate
definition of of function '_ZL21__nvvm_reflect_anchorv' . Did I misused
the nvvm-reflect pass?
Thanks!
yuanfeng
On Fri, Mar 11, 2016 at 10:10 AM, Jingyue Wu <jingyue at google.com> wrote:
> According to the examples you
2013 Mar 21
0
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
...nit tests, there is a cuda.h header that provides very basic
support for these keywords: tests/SemaCUDA/cuda.h
If you compile as CUDA (use .cu extension, or "-x cuda") and use this
header, you will have basic support. You can invoke clang with something
like:
$ clang test1.cu -Xclang -fcuda-is-device -I ../src/clang/test/SemaCUDA
-Xclang -triple -Xclang nvptx64 -Xclang -target-cpu -Xclang sm_20 -S
... assuming your clang source directory is ../src/clang, you want 64-bit
PTX, and your target SM is 2.0. Adjust accordingly.
Clang also knows how to map OpenCL to PTX, so you would do so...
2013 Mar 24
0
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
...in trunk, but with your code I'm now hitting
a new issue. I'll get the fix in soon.
On Fri, Mar 22, 2013 at 1:08 AM, upit <uday_pitambare at yahoo.com> wrote:
> Well, I tried the command line given by you and I get the following error
>
> clang++ nbody.kernel.cu -Xclang -fcuda-is-device
> -I/home/upitamba/llvm-3.2.src/tools/clang/test/SemaCUDA/ -Xclang -triple
> -Xclang nvptx64 -Xclang -target-cpu -Xclang sm_10 -S
>
> fatal error: error in backend: Cannot select: 0x334a870: v4f32 =
> NVPTXISD::MoveParam 0x334a770 [ORD=1] [ID=22]
> 0x334a770: v4f32 = T...
2013 Mar 20
2
[LLVMdev] UNREACHABLE executed! error while trying to generate PTX
Thanks a lot Justin,
I will remove the toolkit header. Just one last question..(maybe ;) ) If I
do away with toolkit headers it says unknown type name '__device__'. Does
this function qualifier have an alternative ? or I can just do away with ?
--
View this message in context: http://llvm.1065342.n5.nabble.com/UNREACHABLE-executed-error-while-trying-to-generate-PTX-tp56026p56093.html
2012 Jun 12
2
[LLVMdev] [NVPTX] For linkonce_odr NVPTX generates .weak, but even newest PTXAS can't handle it
...inter - does it chain to some other
printer?), and finally ptxas (both 4.2 and 5) fails to compile it to cubin.
Below is the test case:
> cat test3.cu
__inline__ __attribute__((device)) __attribute__((used)) void test()
{
return;
}
> clang -cc1 -emit-llvm -triple ptx64-unknown-unknown -fcuda-is-device
test3.cu -o test3.ll
> cat test3.ll
; ModuleID = 'test3.cu'
target datalayout = "e-p:64:64-i64:64:64-f64:64:
64-n1:8:16:32:64"
target triple = "ptx64-unknown-unknown"
@llvm.used = appending global [1 x i8*] [i8* bitcast (void ()* @_Z4testv to
i8*)], section...
2012 Jul 10
2
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
....pred \t%%p2; \n\t"
"setp.ne.u32 \t%%p1, %1, 0; \n\t"
"vote.any.pred \t%%p2, %%p1; \n\t"
"selp.s32 \t%0, 1, 0, %%p2; \n\t"
"}" : "=r"(result) : "r"(a));
return result;
}
> clang -cc1 -emit-llvm -fcuda-is-device -triple ptx64-unknown-unknown
test.cu -o test.ll
> cat test.ll
; ModuleID = 'test.cu'
target datalayout = "e-p:64:64-i64:64:64-f64:64:64-n1:8:16:32:64"
target triple = "ptx64-unknown-unknown"
define ptx_device i32 @_Z5__anyi(i32 %a) nounwind inlinehint {
en...
2012 Sep 04
2
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...clang+nvptx and nvcc:
>
> CLANG/NVPTX:
> =============
>
> $ cat dayofweek.cu
> __attribute__((device)) char yweek[7][4] = { "MON", "TUE", "WED",
> "THU", "FRI", "SAT", "SUN" };
> $ clang -cc1 -emit-llvm -fcuda-is-device dayofweek.cu -o dayofweek.ll
> $ cat dayofweek.ll
> ; ModuleID = 'dayofweek.cu'
> target datalayout =
> "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
> ta...
2012 Jul 10
0
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
...uot;setp.ne.u32 \t%%p1, %1, 0; \n\t"
> "vote.any.pred \t%%p2, %%p1; \n\t"
> "selp.s32 \t%0, 1, 0, %%p2; \n\t"
> "}" : "=r"(result) : "r"(a));
> return result;
> }
>
> > clang -cc1 -emit-llvm -fcuda-is-device -triple ptx64-unknown-unknown test.cu -o test.ll
> > cat test.ll
> ; ModuleID = 'test.cu'
> target datalayout = "e-p:64:64-i64:64:64-f64:64:64-n1:8:16:32:64"
> target triple = "ptx64-unknown-unknown"
>
> define ptx_device i32 @_Z5__anyi(i3...
2012 Jun 13
0
[LLVMdev] [NVPTX] For linkonce_odr NVPTX generates .weak, but even newest PTXAS can't handle it
...and finally ptxas (both 4.2 and 5) fails to compile it to cubin.
> Below is the test case:
>
> > cat test3.cu
>
> __inline__ __attribute__((device)) __attribute__((used)) void test()
> {
> return;
> }
>
> > clang -cc1 -emit-llvm -triple ptx64-unknown-unknown -fcuda-is-device
> test3.cu -o test3.ll
> > cat test3.ll
> ; ModuleID = 'test3.cu'
> target datalayout = "e-p:64:64-i64:64:64-f64:64:
> 64-n1:8:16:32:64"
> target triple = "ptx64-unknown-unknown"
>
> @llvm.used = appending global [1 x i8*] [i8* bitcas...
2012 Sep 04
0
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...try to process the same
.cu file with clang+nvptx and nvcc:
CLANG/NVPTX:
=============
$ cat dayofweek.cu
__attribute__((device)) char yweek[7][4] = { "MON", "TUE", "WED",
"THU", "FRI", "SAT", "SUN" };
$ clang -cc1 -emit-llvm -fcuda-is-device dayofweek.cu -o dayofweek.ll
$ cat dayofweek.ll
; ModuleID = 'dayofweek.cu'
target datalayout =
"e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
target triple = "x86_6...
2012 Jul 10
1
[LLVMdev] [NVPTX] CUDA inline PTX asm definitions scoping "{" "}" is broken
...quot;setp.ne.u32 \t%%p1, %1, 0; \n\t"
> "vote.any.pred \t%%p2, %%p1; \n\t"
> "selp.s32 \t%0, 1, 0, %%p2; \n\t"
> "}" : "=r"(result) : "r"(a));
> return result;
> }
>
> > clang -cc1 -emit-llvm -fcuda-is-device -triple ptx64-unknown-unknown
> test.cu -o test.ll
> > cat test.ll
> ; ModuleID = 'test.cu'
> target datalayout = "e-p:64:64-i64:64:64-f64:64:64-n1:8:16:32:64"
> target triple = "ptx64-unknown-unknown"
>
> define ptx_device i32 @_Z5__any...
2012 Sep 06
0
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
...>> CLANG/NVPTX:
>> =============
>>
>> $ cat dayofweek.cu
>> __attribute__((device)) char yweek[7][4] = { "MON", "TUE", "WED",
>> "THU", "FRI", "SAT", "SUN" };
>> $ clang -cc1 -emit-llvm -fcuda-is-device dayofweek.cu -o dayofweek.ll
>> $ cat dayofweek.ll
>> ; ModuleID = 'dayofweek.cu'
>> target datalayout =
>> "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S1...
2012 Sep 03
2
[LLVMdev] [NVPTX] Backend cannot handle array-of-arrays constant
Dear all,
Looks like the NVPTX backend cannot handle array-of-arrays contant
(please see the reporocase below). Is it supposed to work? Any ideas
how to get it working? Important for our target applications.
Thanks,
- Dima.
$ cat test.ll
; ModuleID = '__kernelgen_main_module'
target datalayout =
2016 Aug 01
0
[GPUCC] link against libdevice
...64*
*Found CUDA installation: /usr/local/cuda*
* "/usr/local/bin/clang-3.9" -cc1 -triple nvptx64-nvidia-cuda -aux-triple
x86_64-unknown-linux-gnu -S -disable-free -main-file-name scalarProd.cu
-mrelocation-model static -mthread-model posix -mdisable-fp-elim
-fmath-errno -no-integrated-as -fcuda-is-device -target-cpu sm_50 -v
-dwarf-column-info -debugger-tuning=gdb -resource-dir
/usr/local/bin/../lib/clang/3.9.0 -I ../ -I
/usr/local/cuda-7.0/samples/common/inc -internal-isystem
/usr/lib/gcc/x86_64-linux-gnu/4.8/../../../../include/c++/4.8
-internal-isystem
/usr/lib/gcc/x86_64-linux-gnu/4.8...
2016 Aug 01
3
[GPUCC] link against libdevice
...stallation: /usr/local/cuda
> "/usr/local/bin/clang-3.9" -cc1 -triple nvptx64-nvidia-cuda -aux-triple
> x86_64-unknown-linux-gnu -S -disable-free -main-file-name scalarProd.cu
> -mrelocation-model static -mthread-model posix -mdisable-fp-elim
> -fmath-errno -no-integrated-as -fcuda-is-device -target-cpu sm_50 -v
> -dwarf-column-info -debugger-tuning=gdb -resource-dir
> /usr/local/bin/../lib/clang/3.9.0 -I ../ -I
> /usr/local/cuda-7.0/samples/common/inc -internal-isystem
> /usr/lib/gcc/x86_64-linux-gnu/4.8/../../../../include/c++/4.8
> -internal-isystem
> /us...