thr3ads.net - llvm dev - [llvm-dev] LLVM/CLANG: CUDA compilation fail for inline assembly code [Oct 2016]

If this information is useful, please help other people find it:
Share via:

Gurunath Kadam via llvm-dev

2016-Oct-14 12:53 UTC

[llvm-dev] LLVM/CLANG: CUDA compilation fail for inline assembly code

Hi,

I am sorry for sending this query again here, but maybe I sent it to wrong
list yesterday.

I am trying to compile LonestarGPU-rev2.0
<http://iss.ices.utexas.edu/?p=projects/galois/lonestargpu/download>
benchmark suite with LLVM/CLANG.

This suite has a following piece of code (more info here
<https://devtalk.nvidia.com/default/topic/481465/cuda-programming-and-performance/any-way-to-know-on-which-sm-a-thread-is-running-/2/?offset=21#4996171>
):

- static __device__ uint get_smid(void) {
- uint ret;
- asm("mov.u32 %0, %smid;" : "=r"(ret) );
- return ret;
- }
The original make file has nvcc compiler with a flag -Xptxas -v. It
compiles with nvcc.

LLVM has -Xcuda-ptxas <arg>, which I believe is the comparable command for
compiling PTX code. I get following error when I try compiling (clang 4.0).:


   1. ../../include/cutil_subset.h:23:25: error: invalid % escape in inline
   assembly string
   2. asm("mov.u32 %0, %smid;" : "=r"(ret) );


It points to %smid.

I have been trying to figure out what is this error is but NVIDIA PTX has
this <http://docs.nvidia.com/cuda/cuda-c-programming-guide/>.

Is this a bug or something?

Thanks.

-Guru
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20161014/fab3fa3e/attachment.html>

Gurunath Kadam via llvm-dev

2016-Oct-14 14:20 UTC

head link

[llvm-dev] LLVM/CLANG: CUDA compilation fail for inline assembly code

Okay, so as I understand, LLVM inline assembly style is *different than*
the GCC/NVPTX assembly style. So as per LLVM language reference manual
following  constraint codes are supported:


   - b: A 1-bit integer register.
   - c or h: A 16-bit integer register.
   - r: A 32-bit integer register.
   - l or N: A 64-bit integer register.
   - f: A 32-bit float register.
   - d: A 64-bit float register.

Now, I am just wondering if there are equivalent constraints for %smid and
%warpid? As per the NVIDIA documentation
<http://docs.nvidia.com/cuda/cuda-c-programming-guide/#sm-id-and-warp-id>
it
is unsafe to rely on this information provided by %smid and %warpid, but
for compiling an existing cuda code I will need to do that.

Any suggestions? Work in progress?

Thank you.

-Guru

On Fri, Oct 14, 2016 at 8:53 AM, Gurunath Kadam <gurunath.kadam at
gmail.com>
wrote:
> Hi,
>
> I am sorry for sending this query again here, but maybe I sent it to wrong
> list yesterday.
>
> I am trying to compile LonestarGPU-rev2.0
> <http://iss.ices.utexas.edu/?p=projects/galois/lonestargpu/download>
> benchmark suite with LLVM/CLANG.
>
> This suite has a following piece of code (more info here
>
<https://devtalk.nvidia.com/default/topic/481465/cuda-programming-and-performance/any-way-to-know-on-which-sm-a-thread-is-running-/2/?offset=21#4996171>
> ):
>
> - static __device__ uint get_smid(void) {
> - uint ret;
> - asm("mov.u32 %0, %smid;" : "=r"(ret) );
> - return ret;
> - }
> The original make file has nvcc compiler with a flag -Xptxas -v. It
> compiles with nvcc.
>
> LLVM has -Xcuda-ptxas <arg>, which I believe is the comparable
command for
> compiling PTX code. I get following error when I try compiling (clang
4.0).:
>
>
>    1. ../../include/cutil_subset.h:23:25: error: invalid % escape in
>    inline assembly string
>    2. asm("mov.u32 %0, %smid;" : "=r"(ret) );
>
>
> It points to %smid.
>
> I have been trying to figure out what is this error is but NVIDIA PTX has
> this <http://docs.nvidia.com/cuda/cuda-c-programming-guide/>.
>
> Is this a bug or something?
>
> Thanks.
>
> -Guru
>
>
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20161014/28d2b524/attachment.html>

Gurunath Kadam via llvm-dev

2016-Oct-15 23:12 UTC

head link

[llvm-dev] LLVM/CLANG: CUDA compilation fail for inline assembly code

Hi,

I am just following up on my question.

Meanwhile I looked up the 'include/llvm/IR/IntrinsicsNVVM.td' and found
there is a definition for smid (declare i32 @llvm.nvvm.read.ptx.sreg.smid).

At this page <http://llvm.org/docs/NVPTXUsage.html>, there is code for
kernel.ll. I am lost here because this looks like LLVM code completely and
not inline asm.

I also tried a cpp-only program with simple asm (so now this is gcc
compatible code) and found that LLVM can compile it (may be this was a too
simplified test program).

Can someone please point me in the right direction?

Thank you.

-Guru

On Fri, Oct 14, 2016 at 10:20 AM, Gurunath Kadam <gurunath.kadam at
gmail.com>
wrote:
> Okay, so as I understand, LLVM inline assembly style is *different than*
> the GCC/NVPTX assembly style. So as per LLVM language reference manual
> following  constraint codes are supported:
>
>
>    - b: A 1-bit integer register.
>    - c or h: A 16-bit integer register.
>    - r: A 32-bit integer register.
>    - l or N: A 64-bit integer register.
>    - f: A 32-bit float register.
>    - d: A 64-bit float register.
>
> Now, I am just wondering if there are equivalent constraints for %smid and
> %warpid? As per the NVIDIA documentation
>
<http://docs.nvidia.com/cuda/cuda-c-programming-guide/#sm-id-and-warp-id>
it
> is unsafe to rely on this information provided by %smid and %warpid, but
> for compiling an existing cuda code I will need to do that.
>
> Any suggestions? Work in progress?
>
> Thank you.
>
> -Guru
>
> On Fri, Oct 14, 2016 at 8:53 AM, Gurunath Kadam <gurunath.kadam at
gmail.com>
> wrote:
>
>> Hi,
>>
>> I am sorry for sending this query again here, but maybe I sent it to
>> wrong list yesterday.
>>
>> I am trying to compile LonestarGPU-rev2.0
>>
<http://iss.ices.utexas.edu/?p=projects/galois/lonestargpu/download>
>> benchmark suite with LLVM/CLANG.
>>
>> This suite has a following piece of code (more info here
>>
<https://devtalk.nvidia.com/default/topic/481465/cuda-programming-and-performance/any-way-to-know-on-which-sm-a-thread-is-running-/2/?offset=21#4996171>
>> ):
>>
>> - static __device__ uint get_smid(void) {
>> - uint ret;
>> - asm("mov.u32 %0, %smid;" : "=r"(ret) );
>> - return ret;
>> - }
>> The original make file has nvcc compiler with a flag -Xptxas -v. It
>> compiles with nvcc.
>>
>> LLVM has -Xcuda-ptxas <arg>, which I believe is the comparable
command
>> for compiling PTX code. I get following error when I try compiling
(clang
>> 4.0).:
>>
>>
>>    1. ../../include/cutil_subset.h:23:25: error: invalid % escape in
>>    inline assembly string
>>    2. asm("mov.u32 %0, %smid;" : "=r"(ret) );
>>
>>
>> It points to %smid.
>>
>> I have been trying to figure out what is this error is but NVIDIA PTX
has
>> this <http://docs.nvidia.com/cuda/cuda-c-programming-guide/>.
>>
>> Is this a bug or something?
>>
>> Thanks.
>>
>> -Guru
>>
>>
>>
>>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20161015/eff1777d/attachment-0001.html>

llvm dev - Oct 2016 - LLVM/CLANG: CUDA compilation fail for inline assembly code

[llvm-dev] LLVM/CLANG: CUDA compilation fail for inline assembly code

[llvm-dev] LLVM/CLANG: CUDA compilation fail for inline assembly code

[llvm-dev] LLVM/CLANG: CUDA compilation fail for inline assembly code