Displaying 20 results from an estimated 24 matches for "axpy".
Did you mean:
axp
2016 Mar 05
2
instrumenting device code with gpucc
...pucc in my project, and I would be really
> grateful for your help!
>
> Currently we're trying to instrument CUDA code using LLVM 3.9, and I've
> written a pass to insert hook functions for certain function calls and
> memory accesses. For example, given a CUDA program, say, axpy.cu, I
> first compile it with
>
> clang++ -emit-llvm -c axpy.cu,
>
> which gives me two bitcode files, axpy.bc and axpy-sm_20.bc. Then I use
> opt to load my pass and insert the hook functions to axpy.bc, which works
> fine. After inspecting the instrumented axpy.bc, I notic...
2016 Mar 10
4
instrumenting device code with gpucc
...IR of your device code into one IR (via linking or direct IR emitting)
before the IR to PTX.
On Wed, Mar 9, 2016 at 4:31 PM, Yuanfeng Peng <yuanfeng.jack.peng at gmail.com>
wrote:
> Hi Jingyue,
>
> Thanks for the instructions! I instrumented the device code and got a
> binary of axpy.cu; however, the resulting executable always fails on the
> first cudaMalloc call in host code (the kernel had not even been launched
> yet), with the error code being 30 (cudaErrorUnknown). In my
> instrumentation pass, I only inserted a hook function upon each access to
> device mem...
2016 Mar 13
2
instrumenting device code with gpucc
...quot; "-cc1" "-triple" "x86_64-unknown-linux-gnu"
"-aux-triple" "nvptx64-nvidia-cuda" "-fcuda-target-overloads"
"-fcuda-disable-target-call-checks" "-emit-obj" "-disable-free"
"-main-file-name" "axpy.cu" "-mrelocation-model" "static" "-mthread-model"
"posix" "-fmath-errno" "-masm-verbose" "-mconstructor-aliases"
"-munwind-tables" "-fuse-init-array" "-target-cpu" "x86-64"
"-mom...
2016 Mar 15
2
instrumenting device code with gpucc
...t;-triple"
>> "x86_64-unknown-linux-gnu" "-aux-triple" "nvptx64-nvidia-cuda"
>> "-fcuda-target-overloads" "-fcuda-disable-target-call-checks" "-emit-obj"
>> "-disable-free" "-main-file-name" "axpy.cu" "-mrelocation-model"
>> "static" "-mthread-model" "posix" "-fmath-errno" "-masm-verbose"
>> "-mconstructor-aliases" "-munwind-tables" "-fuse-init-array" "-target-cpu"
>> &q...
2016 Mar 12
2
instrumenting device code with gpucc
...gt;
>>>> Hi Jingyue,
>>>>
>>>> Thank you so much for the helpful response! I didn't know that PTX
>>>> assembly cannot be linked; that's likely the reason for my issue.
>>>>
>>>> So I did the following as you suggested(axpy-sm_30.bc is the
>>>> instrumented bitcode, and cuda_hooks-sm_30.bc contains the hook functions):
>>>>
>>>> *llvm-link axpy-sm_30.bc cuda_hooks-sm_30.bc -o inst_axpy-sm_30.bc*
>>>>
>>>> *llc inst_axpy-sm_30.bc -o axpy-sm_30.s*
>>>...
2016 Oct 27
3
problem on compiling cuda program with clang++
Hi all,
I compiled the *llvm3.9* source code on the *Nvidia TX1* board. And now I
am following the document in the docs/CompileCudaWithLLVM.rst to compile
cuda program with clang++.
However, when I compile `axpy.cu` using `nvcc`, *nvcc* can generate the
correct the binary;
while compiling `axpy.cu` using clang++, the detailed command is `clang++
axpy.cu -o axpy --cuda-gpu-arch=sm_53 -L/usr/local/cuda/lib64
-lcudart_static -ldl -lrt -pthread`, *clang++* generate the following
error:`/usr/include/features.h...
2017 Aug 02
2
CUDA compilation "No available targets are compatible with this triple." problem
Yes, I followed the guide. The same error showed up:
>clang++ axpy.cu -o axpy --cuda-gpu-arch=sm_35 -L/usr/local/cuda/lib64 -I/usr/local/cuda/include -lcudart_static -ldl -lrt -pthread
error: unable to create target: 'No available targets are compatible with this triple.'
________________________________
From: Kevin Choi <code.kchoi at gmail.com>...
2017 Aug 02
2
CUDA compilation "No available targets are compatible with this triple." problem
Hi,
I have trouble compiling CUDA code with Clang. The following is a command I tried:
> clang++ axpy.cu -o axpy --cuda-gpu-arch=sm_35 --cuda-path=/usr/local/cuda
The error message is
error: unable to create target: 'No available targets are compatible with this triple.'
The info of the LLVM I'm using is as follows:
> lang++ --version
clang version 6.0.0 (http://llvm.org/git/...
2018 Mar 23
2
cuda cross compiling issue for target aarch64-linux-androideabi
...I .prebuilts/ndk/current/sources/sources/cxx-stl/gnu-libstdc++/4.9/include
\
-I
prebuilts/ndk/current/sources/cxx-stl/gnu-libstdc++/4.9/libs/arm64-v8a/include
\
-I ndk/sources/cxx-stl/system/include \
-L ./cuda/targets/aarch64-linux-androideabi/lib \
-lcudart -lcudart_static -ldl -lrt -pthread \
-o axpy axpy.cu
clang version I am using is
Android (4667116 based on r326829) clang version 7.0.1 (
https://android.googlesource.com/toolchain/clang
76e4d638afc560ad21845fad570888bef8c18045) (
https://android.googlesource.com/toolchain/llvm
67f3e6a51d93777841e0fb6d07f71fdf343df239) (based on LLVM 7.0.1sv...
2019 Jan 23
2
Debug info for CUDA code
Hi Char, I found the problem, for some reason the last patch was applied
correctly. Just committed the fixed version. Tried to compile axpy.cu,
everything works.
-------------
Best regards,
Alexey Bataev
23.01.2019 13:37, treinz пишет:
> Hi Alexey,
>
> I tried the b7195a6 from the llvm github mirror, which does include
> your commit D46189 <https://reviews.llvm.org/D46189> (see below). But
> it can't compile...
2018 May 01
3
Compiling CUDA with clang on Windows
Dear all,
In the official document <https://llvm.org/docs/CompileCudaWithLLVM.html>,
it is mentioned that CUDA compilation is supported on Windows as of
2017-01-05. I used msys2 to install clang 5.0.1. Then I installed cuda 8.0.
However, I basically could not compile any code of cuda by the prescribed
setting. I wounder if anyone can successfully compile cuda code by the
clang on Windows.
2018 Mar 23
0
cuda cross compiling issue for target aarch64-linux-androideabi
.../cxx-stl/gnu-libstdc++/4.9/include
> \
> -I
> prebuilts/ndk/current/sources/cxx-stl/gnu-libstdc++/4.9/libs/arm64-v8a/include
> \
> -I ndk/sources/cxx-stl/system/include \
> -L ./cuda/targets/aarch64-linux-androideabi/lib \
> -lcudart -lcudart_static -ldl -lrt -pthread \
> -o axpy axpy.cu
>
> clang version I am using is
> Android (4667116 based on r326829) clang version 7.0.1 (
> https://android.googlesource.com/toolchain/clang
> 76e4d638afc560ad21845fad570888bef8c18045) (
> https://android.googlesource.com/toolchain/llvm
> 67f3e6a51d93777841e0fb6d07f71f...
2019 Feb 26
1
Debug info for CUDA code
...34-0410-b5e6-96231b3b80d8
I'll check this one out.
Thanks,
Char
At 2019-01-24 03:01:18, "Alexey Bataev" <a.bataev at outlook.com> wrote:
Hi Char, I found the problem, for some reason the last patch was applied correctly. Just committed the fixed version. Tried to compile axpy.cu, everything works.
-------------
Best regards,
Alexey Bataev
23.01.2019 13:37, treinz пишет:
Hi Alexey,
I tried the b7195a6 from the llvm github mirror, which does include your commit D46189 (see below). But it can't compile the example program on https://llvm.org/docs/CompileCudaWithLL...
2016 Oct 27
0
problem on compiling cuda program with clang++
...PM, 李阳 via llvm-dev
<llvm-dev at lists.llvm.org> wrote:
> Hi all,
>
> I compiled the llvm3.9 source code on the Nvidia TX1 board. And now I am
> following the document in the docs/CompileCudaWithLLVM.rst to compile cuda
> program with clang++.
>
> However, when I compile `axpy.cu` using `nvcc`, nvcc can generate the
> correct the binary;
>
> while compiling `axpy.cu` using clang++, the detailed command is `clang++
> axpy.cu -o axpy --cuda-gpu-arch=sm_53 -L/usr/local/cuda/lib64
> -lcudart_static -ldl -lrt -pthread`, clang++ generate the following
> error...
2019 Mar 11
2
Debug info for CUDA code
...34-0410-b5e6-96231b3b80d8
I'll check this one out.
Thanks,
Char
At 2019-01-24 03:01:18, "Alexey Bataev" <a.bataev at outlook.com> wrote:
Hi Char, I found the problem, for some reason the last patch was applied correctly. Just committed the fixed version. Tried to compile axpy.cu, everything works.
-------------
Best regards,
Alexey Bataev
23.01.2019 13:37, treinz пишет:
Hi Alexey,
I tried the b7195a6 from the llvm github mirror, which does include your commit D46189 (see below). But it can't compile the example program on https://llvm.org/docs/CompileCudaWithLL...
2018 Dec 14
8
Debug info for CUDA code
Are you planning to release this as soon as it's ready or you want to make it into a major release? Is it possible to let me know (maybe by replying to this thread) once the code is ready? I know sometimes it takes a while to get things in the major release. I greatly appreciate your work on this!
Thanks,
Char
在 2018-12-15 05:19:50,"Alexey Bataev" <a.bataev at outlook.com>
2016 Oct 27
0
problem on compiling cuda program with clang++
...i all,
>> >
>> > I compiled the llvm3.9 source code on the Nvidia TX1 board. And now I am
>> > following the document in the docs/CompileCudaWithLLVM.rst to compile
>> > cuda
>> > program with clang++.
>> >
>> > However, when I compile `axpy.cu` using `nvcc`, nvcc can generate the
>> > correct the binary;
>> >
>> > while compiling `axpy.cu` using clang++, the detailed command is
>> > `clang++
>> > axpy.cu -o axpy --cuda-gpu-arch=sm_53 -L/usr/local/cuda/lib64
>> > -lcudart_static -ld...
2019 Feb 26
2
Debug info for CUDA code
...34-0410-b5e6-96231b3b80d8
I'll check this one out.
Thanks,
Char
At 2019-01-24 03:01:18, "Alexey Bataev" <a.bataev at outlook.com> wrote:
Hi Char, I found the problem, for some reason the last patch was applied correctly. Just committed the fixed version. Tried to compile axpy.cu, everything works.
-------------
Best regards,
Alexey Bataev
23.01.2019 13:37, treinz пишет:
Hi Alexey,
I tried the b7195a6 from the llvm github mirror, which does include your commit D46189 (see below). But it can't compile the example program on https://llvm.org/docs/CompileCudaWithLL...
2019 Feb 27
3
Debug info for CUDA code
...34-0410-b5e6-96231b3b80d8
I'll check this one out.
Thanks,
Char
At 2019-01-24 03:01:18, "Alexey Bataev" <a.bataev at outlook.com> wrote:
Hi Char, I found the problem, for some reason the last patch was applied correctly. Just committed the fixed version. Tried to compile axpy.cu, everything works.
-------------
Best regards,
Alexey Bataev
23.01.2019 13:37, treinz пишет:
Hi Alexey,
I tried the b7195a6 from the llvm github mirror, which does include your commit D46189 (see below). But it can't compile the example program on https://llvm.org/docs/CompileCudaWithLL...
2020 Jan 15
2
Debug info for CUDA code
...34-0410-b5e6-96231b3b80d8
I'll check this one out.
Thanks,
Char
At 2019-01-24 03:01:18, "Alexey Bataev" <a.bataev at outlook.com> wrote:
Hi Char, I found the problem, for some reason the last patch was applied correctly. Just committed the fixed version. Tried to compile axpy.cu, everything works.
-------------
Best regards,
Alexey Bataev
23.01.2019 13:37, treinz пишет:
Hi Alexey,
I tried the b7195a6 from the llvm github mirror, which does include your commit D46189 (see below). But it can't compile the example program on https://llvm.org/docs/CompileCudaWithLL...