thr3ads.net - similar to: "Clang option to provide list of target-subarchs."

Displaying 20 results from an estimated 700 matches similar to: "Clang option to provide list of target-subarchs."

2016 Aug 01

[GPUCC] link against libdevice

OK, I see the problem. You were right that we weren't picking up libdevice. CUDA 7.0 only ships with the following libdevice binaries (found /path/to/cuda/nvvm/libdevice): libdevice.compute_20.10.bc libdevice.compute_30.10.bc libdevice.compute_35.10.bc If you ask for sm_50 with cuda 7.0, clang can't find a matching libdevice binary, and it will apparently silently give up and try to

Clang option to provide list of target-subarchs.

2017 Feb 07

Clang option to provide list of target-subarchs.

Thank you for the feedback. > How is this going to work with --target-subarchs? Is there going to be a --no-target-subarchs flag to disable subarchs? What will the semantics of this be, exactly? The large number of subarchs expected makes an inclusive only flag desirable and an exclusive flag impractical. Also, since subarchs will age more quickly than archs, who knows what old crufty

[GPUCC] link against libdevice

2016 Aug 01

[GPUCC] link against libdevice

Hi Justin, Thanks for your response! The clang & llvm I'm using was built from source. Below is the output of compiling with -v. Any suggestions would be appreciated! *clang version 3.9.0 (trunk 270145) (llvm/trunk 270133)* *Target: x86_64-unknown-linux-gnu* *Thread model: posix* *InstalledDir: /usr/local/bin* *Found candidate GCC installation: /usr/lib/gcc/x86_64-linux-gnu/4.8*

[GPUCC] link against libdevice

2016 Aug 01

[GPUCC] link against libdevice

Hi, Yuanfeng. What version of clang are you using? CUDA is only known to work at tip of head, so you must build clang yourself from source. I suspect that's your problem, but if building from source doesn't fix it, please attach the output of compiling with -v. Regards, -Justin On Sun, Jul 31, 2016 at 9:24 PM, Chandler Carruth <chandlerc at google.com> wrote: > Directly

[CUDA] Lost debug information when compiling CUDA code

2017 Jun 14

[CUDA] Lost debug information when compiling CUDA code

Hi, I needed to debug some CUDA code in my project; however, although I used -g when compiling the source code, no source-level information is available in cuda-gdb or cuda-memcheck. Specifically, below is what I did: 1) For a CUDA file a.cu, generate IR files: clang++ -g -emit-llvm --cuda-gpu-arch=sm_35 -c a.cu; 2) Instrument the device code a-cuda-nvptx64-nvidia-cuda-sm_35.bc (generated

The virtuailization patches break Voyager.

2007 Apr 27

The virtuailization patches break Voyager.

Guys currently I am horrified by the ease at which I can find bugs in the pending paravirtualization patches. I have barely even looked at arch/i386 in the -mm tree and it feels like I am tripping over significant bugs left and right. Because no one has heeded my advice and put in a proper platform layer on arch/i386 and we are instead doing a half baked job with paravirt_ops it is still

The virtuailization patches break Voyager.

2007 Apr 27

The virtuailization patches break Voyager.

[GPUCC] link against libdevice

2016 Jul 29

[GPUCC] link against libdevice

Hi, I was trying to compile scalarProd.cu (from CUDA SDK) with the following command: * clang++ -I../ -I/usr/local/cuda-7.0/samples/common/inc --cuda-gpu-arch=sm_50 scalarProd.cu* but ended up with the following error: *ptxas fatal : Unresolved extern function '__nv_mul24'* Seems to me that libdevice was not automatically linked. I wonder what flags I need to pass to clang to have

CUDA compilation "No available targets are compatible with this triple." problem

2017 Aug 02

CUDA compilation "No available targets are compatible with this triple." problem

Yes, I followed the guide. The same error showed up: >clang++ axpy.cu -o axpy --cuda-gpu-arch=sm_35 -L/usr/local/cuda/lib64 -I/usr/local/cuda/include -lcudart_static -ldl -lrt -pthread error: unable to create target: 'No available targets are compatible with this triple.' ________________________________ From: Kevin Choi <code.kchoi at gmail.com> Sent: Wednesday, August 2,

instrumenting device code with gpucc

2016 Mar 05

instrumenting device code with gpucc

On Fri, Mar 4, 2016 at 5:50 PM, Yuanfeng Peng <yuanfeng.jack.peng at gmail.com> wrote: > Hi Jingyue, > > My name is Yuanfeng Peng, I'm a PhD student at UPenn. I'm sorry to bother > you, but I'm having trouble with gpucc in my project, and I would be really > grateful for your help! > > Currently we're trying to instrument CUDA code using LLVM 3.9, and

CUDA compilation "No available targets are compatible with this triple." problem

2017 Aug 02

CUDA compilation "No available targets are compatible with this triple." problem

Hi, I have trouble compiling CUDA code with Clang. The following is a command I tried: > clang++ axpy.cu -o axpy --cuda-gpu-arch=sm_35 --cuda-path=/usr/local/cuda The error message is error: unable to create target: 'No available targets are compatible with this triple.' The info of the LLVM I'm using is as follows: > lang++ --version clang version 6.0.0

[GPUCC] link against libdevice

2016 Aug 01

[GPUCC] link against libdevice

Directly CC-ing some folks who may be able to help. On Fri, Jul 29, 2016 at 6:27 AM Yuanfeng Peng via llvm-dev < llvm-dev at lists.llvm.org> wrote: > Hi, > > I was trying to compile scalarProd.cu (from CUDA SDK) with the following > command: > > * clang++ -I../ -I/usr/local/cuda-7.0/samples/common/inc > --cuda-gpu-arch=sm_50 scalarProd.cu* > > but ended up with

[CUDA/NVPTX] is inlining __syncthreads allowed?

2015 Aug 21

[CUDA/NVPTX] is inlining __syncthreads allowed?

I'm using 7.0. I am attaching the reduced example. nvcc sync.cu -arch=sm_35 -ptx gives // .globl _Z3foov .visible .entry _Z3foov( ) { .reg .pred %p<2>; .reg .s32 %r<3>; mov.u32 %r1, %tid.x; and.b32 %r2, %r1, 1; setp.eq.b32 %p1, %r2, 1; @!%p1 bra BB7_2; bra.uni

NVPTX Back-end: relocatable device code support for dynamic parallelism

2017 Jun 09

NVPTX Back-end: relocatable device code support for dynamic parallelism

Hi everyone, CUDA allows to call some runtime functions also from the device code. On a multi-GPU system this allows the GPU to determine its device id on its own via cudaGetDevice(). Unfortunately i cannot get it working when compiling with clang. When compiling with nvcc relocatable device code needs to be set to true (-rdc=true) and the cudadevrt is needed when linking [0]. I did not

hcc not found, rcmd build

2006 Sep 29

hcc not found, rcmd build

Working under Windows XP, I am compiling a package called 'pgirmess' with the command rcmd build --binary --auto-zip pgirmess I have this message error after having listed: functions text html latex example chm .... zipping help file hcc: not found cp: cannot stat 'c:/TEMP/Rbuild365620874/pgirmess/chm/pgirmess.chm': No such file or directory make[1]: *** [chm-pgirmess] Error 1

Problems with app_realtime

2004 Dec 14

Problems with app_realtime

It seems that when setting qualify = 200 or qualify = yes in the database for a sip friend/peer, RealTime does not update the registration status like it should. I also have several peers which have been offline and Asterisk still reports them as registered, even though the registration seconds are only 200. Asterisk Ver: CVS HEAD 12/1/2004 Layout of sip_buddies: mysql> describe

[PATCH] reenable pygrub build

2005 Oct 07

[PATCH] reenable pygrub build

The attached re-enables the pygrub build and fixes the build with older e2fsprogs (tested on RHEL4 with e2fsprogs-1.35 and rawhide with e2fsprogs-1.38) Signed-off-by: Jeremy Katz <katzj@redhat.com> Jeremy _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel

readRegistry function (PR#12937)

2008 Sep 19

readRegistry function (PR#12937)

Full_Name: Zivan Karaman Version: 2.7.2 OS: Windows XP Submission from: (NULL) (195.6.68.214) I'm puzzled by the readRegistry function. Shouldn't the "hive" argument be something like c("HLM", "HCR", "HCU", "HU", "HCC", "HPD") rather than c("HLM", "HCR", "HCU", "HU",

Adding labels to heatmaps from image()

2009 Mar 17

Adding labels to heatmaps from image()

Hi, I have been trying to add labels to the rows of a heatmap produced using image() function. It is simply not working. Here is what I did. A2Rplot.hclust(hcc,k=length(num),col.up="black",col.down=num,lty.up=2,lty.down=1,lwd.up=1,lwd.down=2,show.labels=FALSE) #used the above external program to create a colored dendrogram xsort <- x[1:nrow(x), hcc$labels[hcc$order]]

Asterisk and SMP

2004 Aug 11

Asterisk and SMP

Does anything have to be done at compile time in order for Asterisk to take advantage of 2 CPU's? Thanks

similar to: Clang option to provide list of target-subarchs.