Displaying 20 results from an estimated 700 matches similar to: "Clang option to provide list of target-subarchs."
2016 Aug 01
3
[GPUCC] link against libdevice
OK, I see the problem. You were right that we weren't picking up libdevice.
CUDA 7.0 only ships with the following libdevice binaries (found
/path/to/cuda/nvvm/libdevice):
libdevice.compute_20.10.bc libdevice.compute_30.10.bc
libdevice.compute_35.10.bc
If you ask for sm_50 with cuda 7.0, clang can't find a matching
libdevice binary, and it will apparently silently give up and try to
2017 Feb 07
0
Clang option to provide list of target-subarchs.
Thank you for the feedback.
> How is this going to work with --target-subarchs? Is there going to be a --no-target-subarchs flag to disable subarchs? What will the semantics of this be, exactly?
The large number of subarchs expected makes an inclusive only flag desirable and an exclusive flag impractical. Also, since subarchs will age more quickly than archs, who knows what old crufty
2016 Aug 01
0
[GPUCC] link against libdevice
Hi Justin,
Thanks for your response! The clang & llvm I'm using was built from
source.
Below is the output of compiling with -v. Any suggestions would be
appreciated!
*clang version 3.9.0 (trunk 270145) (llvm/trunk 270133)*
*Target: x86_64-unknown-linux-gnu*
*Thread model: posix*
*InstalledDir: /usr/local/bin*
*Found candidate GCC installation: /usr/lib/gcc/x86_64-linux-gnu/4.8*
2016 Aug 01
2
[GPUCC] link against libdevice
Hi, Yuanfeng.
What version of clang are you using? CUDA is only known to work at
tip of head, so you must build clang yourself from source.
I suspect that's your problem, but if building from source doesn't fix
it, please attach the output of compiling with -v.
Regards,
-Justin
On Sun, Jul 31, 2016 at 9:24 PM, Chandler Carruth <chandlerc at google.com> wrote:
> Directly
2017 Jun 14
4
[CUDA] Lost debug information when compiling CUDA code
Hi,
I needed to debug some CUDA code in my project; however, although I used -g when compiling the source code, no source-level information is available in cuda-gdb or cuda-memcheck.
Specifically, below is what I did:
1) For a CUDA file a.cu, generate IR files: clang++ -g -emit-llvm --cuda-gpu-arch=sm_35 -c a.cu;
2) Instrument the device code a-cuda-nvptx64-nvidia-cuda-sm_35.bc (generated
2007 Apr 27
3
The virtuailization patches break Voyager.
Guys currently I am horrified by the ease at which I can find
bugs in the pending paravirtualization patches. I have barely
even looked at arch/i386 in the -mm tree and it feels like
I am tripping over significant bugs left and right.
Because no one has heeded my advice and put in a proper platform
layer on arch/i386 and we are instead doing a half baked job
with paravirt_ops it is still
2007 Apr 27
3
The virtuailization patches break Voyager.
Guys currently I am horrified by the ease at which I can find
bugs in the pending paravirtualization patches. I have barely
even looked at arch/i386 in the -mm tree and it feels like
I am tripping over significant bugs left and right.
Because no one has heeded my advice and put in a proper platform
layer on arch/i386 and we are instead doing a half baked job
with paravirt_ops it is still
2016 Jul 29
2
[GPUCC] link against libdevice
Hi,
I was trying to compile scalarProd.cu (from CUDA SDK) with the following
command:
* clang++ -I../ -I/usr/local/cuda-7.0/samples/common/inc
--cuda-gpu-arch=sm_50 scalarProd.cu*
but ended up with the following error:
*ptxas fatal : Unresolved extern function '__nv_mul24'*
Seems to me that libdevice was not automatically linked. I wonder what
flags I need to pass to clang to have
2017 Aug 02
2
CUDA compilation "No available targets are compatible with this triple." problem
Yes, I followed the guide. The same error showed up:
>clang++ axpy.cu -o axpy --cuda-gpu-arch=sm_35 -L/usr/local/cuda/lib64 -I/usr/local/cuda/include -lcudart_static -ldl -lrt -pthread
error: unable to create target: 'No available targets are compatible with this triple.'
________________________________
From: Kevin Choi <code.kchoi at gmail.com>
Sent: Wednesday, August 2,
2016 Mar 05
2
instrumenting device code with gpucc
On Fri, Mar 4, 2016 at 5:50 PM, Yuanfeng Peng <yuanfeng.jack.peng at gmail.com>
wrote:
> Hi Jingyue,
>
> My name is Yuanfeng Peng, I'm a PhD student at UPenn. I'm sorry to bother
> you, but I'm having trouble with gpucc in my project, and I would be really
> grateful for your help!
>
> Currently we're trying to instrument CUDA code using LLVM 3.9, and
2017 Aug 02
2
CUDA compilation "No available targets are compatible with this triple." problem
Hi,
I have trouble compiling CUDA code with Clang. The following is a command I tried:
> clang++ axpy.cu -o axpy --cuda-gpu-arch=sm_35 --cuda-path=/usr/local/cuda
The error message is
error: unable to create target: 'No available targets are compatible with this triple.'
The info of the LLVM I'm using is as follows:
> lang++ --version
clang version 6.0.0
2016 Aug 01
0
[GPUCC] link against libdevice
Directly CC-ing some folks who may be able to help.
On Fri, Jul 29, 2016 at 6:27 AM Yuanfeng Peng via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> Hi,
>
> I was trying to compile scalarProd.cu (from CUDA SDK) with the following
> command:
>
> * clang++ -I../ -I/usr/local/cuda-7.0/samples/common/inc
> --cuda-gpu-arch=sm_50 scalarProd.cu*
>
> but ended up with
2015 Aug 21
2
[CUDA/NVPTX] is inlining __syncthreads allowed?
I'm using 7.0. I am attaching the reduced example.
nvcc sync.cu -arch=sm_35 -ptx
gives
// .globl _Z3foov
.visible .entry _Z3foov(
)
{
.reg .pred %p<2>;
.reg .s32 %r<3>;
mov.u32 %r1, %tid.x;
and.b32 %r2, %r1, 1;
setp.eq.b32 %p1, %r2, 1;
@!%p1 bra BB7_2;
bra.uni
2017 Jun 09
1
NVPTX Back-end: relocatable device code support for dynamic parallelism
Hi everyone,
CUDA allows to call some runtime functions also from the device code. On
a multi-GPU system this allows the GPU to determine its device id on its
own via cudaGetDevice().
Unfortunately i cannot get it working when compiling with clang. When
compiling with nvcc relocatable device code needs to be set to true
(-rdc=true) and the cudadevrt is needed when linking [0]. I did not
2006 Sep 29
2
hcc not found, rcmd build
Working under Windows XP, I am compiling a package called 'pgirmess'
with the command
rcmd build --binary --auto-zip pgirmess
I have this message error after having listed: functions text html latex
example chm
....
zipping help file
hcc: not found
cp: cannot stat 'c:/TEMP/Rbuild365620874/pgirmess/chm/pgirmess.chm': No
such file or directory
make[1]: *** [chm-pgirmess] Error 1
2004 Dec 14
3
Problems with app_realtime
It seems that when setting qualify = 200 or qualify = yes in the database for
a sip friend/peer, RealTime does not update the registration status like it
should.
I also have several peers which have been offline and Asterisk still reports
them as registered, even though the registration seconds are only 200.
Asterisk Ver: CVS HEAD 12/1/2004
Layout of sip_buddies:
mysql> describe
2005 Oct 07
10
[PATCH] reenable pygrub build
The attached re-enables the pygrub build and fixes the build with older
e2fsprogs (tested on RHEL4 with e2fsprogs-1.35 and rawhide with
e2fsprogs-1.38)
Signed-off-by: Jeremy Katz <katzj@redhat.com>
Jeremy
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
2008 Sep 19
1
readRegistry function (PR#12937)
Full_Name: Zivan Karaman
Version: 2.7.2
OS: Windows XP
Submission from: (NULL) (195.6.68.214)
I'm puzzled by the readRegistry function.
Shouldn't the "hive" argument be something like
c("HLM", "HCR", "HCU", "HU", "HCC", "HPD") rather than
c("HLM", "HCR", "HCU", "HU",
2009 Mar 17
1
Adding labels to heatmaps from image()
Hi,
I have been trying to add labels to the rows of a heatmap produced using image() function. It is simply not working. Here is what I did.
A2Rplot.hclust(hcc,k=length(num),col.up="black",col.down=num,lty.up=2,lty.down=1,lwd.up=1,lwd.down=2,show.labels=FALSE)
#used the above external program to create a colored dendrogram
xsort <- x[1:nrow(x), hcc$labels[hcc$order]]
2004 Aug 11
5
Asterisk and SMP
Does anything have to be done at compile time in order for Asterisk to
take advantage of 2 CPU's?
Thanks