thr3ads.net - similar to: "[LLVMdev] Target/NVPTX Ownership"

Displaying 20 results from an estimated 30000 matches similar to: "[LLVMdev] Target/NVPTX Ownership"

[LLVMdev] Problem with PTX assembly printing (NVPTX backend)

2013 Feb 04

[LLVMdev] Problem with PTX assembly printing (NVPTX backend)

Hi Nikolaos, Following commands work great for me. $ clang -S -emit-llvm -target nvptx -x cl -include clc/clctypes.h ../data-types/scalar.cl $ llc -mcpu=sm_30 scalar.s You can follow Justin's blog [1]. It helped me a lot to understand where to start. [1] http://jholewinski.org/blog/llvm-3-0-ptx-backend/ Best, Ankur On Mon, Feb 4, 2013 at 11:40 PM, Justin Holewinski < justin.holewinski

[LLVMdev] RFC: Code Ownership

2012 Nov 12

[LLVMdev] RFC: Code Ownership

On the topic of code owners: Would it make sense to add target-specific code owners for the codegen targets? Some of the targets are more esoteric (e.g. NVPTX, CellSPU, AMDGPU) and may benefit from more fine-grained code ownership. On Mon, Nov 12, 2012 at 11:26 AM, Renato Golin <rengolin at systemcall.org>wrote: > On 12 November 2012 15:55, Meador Inge <meadori at

[LLVMdev] RFC: Code Ownership

2012 Nov 12

[LLVMdev] RFC: Code Ownership

On Nov 12, 2012, at 8:56 AM, Justin Holewinski <justin.holewinski at gmail.com> wrote: > On the topic of code owners: > > Would it make sense to add target-specific code owners for the codegen targets? Some of the targets are more esoteric (e.g. NVPTX, CellSPU, AMDGPU) and may benefit from more fine-grained code ownership. It definitely makes sense to have code owners for each

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Jun 05

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

Thanks for the info! I would be glad to hear of any issues you have encountered on this path. I tried to make sure the 3.3 release was fully compatible with the libdevice implementation shipping with 5.5 (and as far as I know, it is). It's just not an officially supported configuration. Also, I've been meaning to address your -drvcuda issue. How would you feel about making that a part

[LLVMdev] Problem with PTX assembly printing (NVPTX backend)

2013 Feb 04

[LLVMdev] Problem with PTX assembly printing (NVPTX backend)

On Mon, Feb 4, 2013 at 1:09 PM, <nkavv at physics.auth.gr> wrote: > Hi Justin, > > > Has anyone had similar problems with the NVPTX backend? Shouldn't this >>> code be linked to the AsmPrinter library for NVPTX (already)? >>> >> >> What do you mean by "doesn't work"? The AsmPrinter library really houses >> the MCInst

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Jun 05

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

Dear all, FWIW, I've tested libdevice.compute_20.10.bc and libdevice.compute_30.10.bc from /cuda/nvvm/libdevice shipped with CUDA 5.5 preview. IR is compatible with LLVM 3.4 trunk that we use. Results are correct, performance - almost the same as what we had before with cicc-sniffed IR, or maybe <10% better. Will test libdevice.compute_35.10.bc once we will get K20 support. Thanks for

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Feb 17

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

The X86 back-end just calls into libm: // Always use a library call for pow. setOperationAction(ISD::FPOW , MVT::f32 , Expand); setOperationAction(ISD::FPOW , MVT::f64 , Expand); setOperationAction(ISD::FPOW , MVT::f80 , Expand); The issue is really that there is no standard math library for PTX. I agree that this is a pain for most users, but I

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Feb 17

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

> The issue is really that there is no standard math library for PTX. Well, formally, that could very well be true. Moreover, in some parts CPU math standard is impossible to accomplish on parallel architectures, consider, for example errno behavior. But here we are speaking more about practical side. And the practical side is: past 5 years CUDA claims to accelerate compute applications, and

[LLVMdev] [llvm-commits] [PATCH][RFC] NVPTX Backend

2012 Apr 25

[LLVMdev] [llvm-commits] [PATCH][RFC] NVPTX Backend

Hi Justin, Cool stuff, to be sure. Excited to see this. As a pre-cursor to more involved technical feedback, I suggest going through and fixing up the coding style and formatting issues. Just glancing through, I see lots of things like function names starting with capital letters, compound statements with the opening brace on the line following an if/for/while/etc., single-statements after an

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Feb 17

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

Hi Justin, I don't understand, why, for instance, X86 backend handles pow automatically, and NVPTX should be a PITA requiring user to bring his own pow implementation. Even at a very general level, this limits the interest of users to LLVM NVPTX backend. Could you please elaborate on the rationale behind your point? Why the accuracy modes I suggested are not sufficient, in your opinion? - D.

[LLVMdev] Problem with PTX assembly printing (NVPTX backend)

2013 Feb 04

[LLVMdev] Problem with PTX assembly printing (NVPTX backend)

Hi Justin, >> Has anyone had similar problems with the NVPTX backend? Shouldn't this >> code be linked to the AsmPrinter library for NVPTX (already)? > > What do you mean by "doesn't work"? The AsmPrinter library really houses > the MCInst printer, which isn't implemented for NVPTX yet. The older > assembly printer works just fine. This is

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Feb 08

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

Yes, it helps a lot and we are working on it. A few questions, 1) What will be your use model of this library? Will you run optimization phases after linking with the library? If so, what are they? 2) Do you care if the names of functions differ from those in libm? For example, it would be gpusin() instead of sin(). 3) Do you need a different library for different host

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Feb 17

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

I would be very hesitant to expose all math library functions as intrinsics. I believe linking with a target-specific math library is the correct approach, as it decouples the back end from the needs of the source program/language. Users should be free to use any math library implementation they choose. Intrinsics are meant for functions that compile down to specific isa features, like fused

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

2013 Feb 17

[LLVMdev] [NVPTX] We need an LLVM CUDA math library, after all

Dear Yuan, Sorry for delay with reply, Answers on your questions could be different, depending on the math library placement in the code generation pipeline. At KernelGen, we currently have a user-level CUDA math module, adopted from cicc internals [1]. It is intended to be linked with the user LLVM IR module, right before proceeding with the final optimization and backend. Last few months we

[LLVMdev] [NVPTX] launch_bounds support?

2013 Apr 02

[LLVMdev] [NVPTX] launch_bounds support?

Yes, this is supported through metadata. An example usage of these annotations is given in the test/CodeGen/NVPTX/annotations.ll unit test. I'll try to remember to add this to the NVPTX documentation I'm putting together at http://llvm.org/docs/NVPTXUsage.html. On Mon, Apr 1, 2013 at 8:06 AM, Dmitry Mikushin <dmitry at kernelgen.org>wrote: > Dear all, > > Is anybody

[LLVMdev] [NVPTX] Eliminate common sub-expressions in a group of similar GEPs

2014 Apr 21

[LLVMdev] [NVPTX] Eliminate common sub-expressions in a group of similar GEPs

Hi Hal, Thanks for your comments! I'm inlining my responses below. Jingyue On Sat, Apr 19, 2014 at 6:38 AM, Hal Finkel <hfinkel at anl.gov> wrote: > Jingyue, > > I can't speak for the NVPTX backend, but I think this looks useful as an > (optional) target-independent pass. A few thoughts: > > - Running GVN tends to be pretty expensive; have you tried EarlyCSE

[LLVMdev] [NVPTX] Backend failure in LegalizeDAG due to unimplemented expand in target lowering

2012 Jul 08

[LLVMdev] [NVPTX] Backend failure in LegalizeDAG due to unimplemented expand in target lowering

OK, thanks. For our project I implemented a similar workaround: extend each i1 memory item to i8 and load/store i1 to i8 with a type cast. Still, the issue in NVPTX remains. I don't know whether NVIDIA or community fellows have any reasonable priority to fix it (or at least put an NYI assertion!). It seems to be quite more complex, than implementing custom lowering handler, that's why

[LLVMdev] Problem with PTX assembly printing (NVPTX backend)

2013 Feb 04

[LLVMdev] Problem with PTX assembly printing (NVPTX backend)

Alright, couple of points here: 1. Address space 0 is invalid for global variables. This is causing a crash in llc where we use llvm_unreachable() on this case. This is most likely why you're seeing llc run forever. The fix for this is to use address space 1 for globals, which puts them into PTX global memory. On our side, we should provide a meaningful error message in this case. 2. The

[LLVMdev] [llvm-commits] [PATCH][RFC] NVPTX Backend

2012 May 01

[LLVMdev] [llvm-commits] [PATCH][RFC] NVPTX Backend

> -----Original Message----- > From: Dan Bailey [mailto:dan at dneg.com] > Sent: Sunday, April 29, 2012 8:46 AM > To: Justin Holewinski > Cc: Jim Grosbach; llvm-commits at cs.uiuc.edu; Vinod Grover; > llvmdev at cs.uiuc.edu > Subject: Re: [llvm-commits] [PATCH][RFC] NVPTX Backend > > Justin, > > Firstly, this is great! It seems to be so much further forward in

[LLVMdev] [PATCH][RFC] NVPTX Backend

2012 Apr 24

[LLVMdev] [PATCH][RFC] NVPTX Backend

Hi LLVMers, We at NVIDIA would like to contribute back to the LLVM open-source community by up-streaming the NVPTX back-end for LLVM. This back-end is based on the sources used by NVIDIA, and currently provides significantly more functionality than the current PTX back-end. Some functionality is currently disabled due to dependencies on LLVM core changes that we are also in the process of

similar to: [LLVMdev] Target/NVPTX Ownership