thr3ads.net - llvm dev - [LLVMdev] Upstream PTX backend that uses target independent code generator if possible [Aug 2010]

If this information is useful, please help other people find it:
Share via:

Villmow, Micah

2010-Aug-10 19:25 UTC

[LLVMdev] Upstream PTX backend that uses target independent code generator if possible

> -----Original Message-----
> From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at
cs.uiuc.edu]
> On Behalf Of David A. Greene
> Sent: Tuesday, August 10, 2010 12:02 PM
> To: Che-Liang Chiou
> Cc: llvmdev at cs.uiuc.edu
> Subject: Re: [LLVMdev] Upstream PTX backend that uses target
> independent code generator if possible
> 
> Che-Liang Chiou <clchiou at gmail.com> writes:
> 
> > I surfed their code, and it seems that they didn't use code
> generator.
> > That means there design should be similar to CBackend or CPPBackend.
> > So I guess it can't generate some machine instructions like MAD,
> > and there are some PTX instruction set features that are hard to
> exploit
> > if not using code generator.
> >
> > But I didn't study their code thoroughly, so I might be wrong
about
> this.
> 
> I haven't had a chance to look at it yet either.
> 
> >>> I have tested this backend to translate a work-efficient
parallel
> scan
> >>> kernel (
> http://http.developer.nvidia.com/GPUGems3/gpugems3_ch39.html
> >>> ) into PTX code.  The generated PTX code was then executed on
real
> >>> hardware, and the result is correct.
> >>
> >> How much of the LLVM IR does this support?  What's missing?
> > Have to add some intrinsics, calling conventions, and address spaces.
> > I would say these are relatively small changes.
> 
> Are you generating masks at all?  If so, how are you doing that?
> Similarly to how the ARM backend does predicates (handling all the
> representation, etc. in the target-specific codegen)?
> 
> I've have been wanting to see predicates (vector and scalar) in the
> LLVM IR for a long time.  Perhaps the PTX backend is an opportunity
> to explore that.[Villmow, Micah] From looking at the llvmptxbackend, it does not fully support
vector types.
This in my perspective is one of the greatest benefits of the backend
code-generator, automatic support
for vector types in LLVM-IR that are not natively supported by the target
machine via vector splitting.> 
>                            -Dave
> 
> _______________________________________________
> LLVM Developers mailing list
> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev

Che-Liang Chiou

2010-Aug-11 03:29 UTC

head link

[LLVMdev] Upstream PTX backend that uses target independent code generator if possible

My implementation of predicated instructions is similar to ARM
backend. I traced ARM and PowerPC backend for reference.

If, David, you were saying a implementation of predication in LLVM IR,
I didn't do that.  It was partly because I was not (and is still not)
very familiar with LLVM's design; so I didn't know how to do that.

I agree what Micah said; LLVM's code generator has a vector splitting,
among many other reusable components.

Regards,
Che-Liang

On Tue, Aug 10, 2010 at 12:25 PM, Villmow, Micah <Micah.Villmow at
amd.com> wrote:>
>
>> -----Original Message-----
>> From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at
cs.uiuc.edu]
>> On Behalf Of David A. Greene
>> Sent: Tuesday, August 10, 2010 12:02 PM
>> To: Che-Liang Chiou
>> Cc: llvmdev at cs.uiuc.edu
>> Subject: Re: [LLVMdev] Upstream PTX backend that uses target
>> independent code generator if possible
>>
>> Che-Liang Chiou <clchiou at gmail.com> writes:
>>
>> > I surfed their code, and it seems that they didn't use code
>> generator.
>> > That means there design should be similar to CBackend or
CPPBackend.
>> > So I guess it can't generate some machine instructions like
MAD,
>> > and there are some PTX instruction set features that are hard to
>> exploit
>> > if not using code generator.
>> >
>> > But I didn't study their code thoroughly, so I might be wrong
about
>> this.
>>
>> I haven't had a chance to look at it yet either.
>>
>> >>> I have tested this backend to translate a work-efficient
parallel
>> scan
>> >>> kernel (
>> http://http.developer.nvidia.com/GPUGems3/gpugems3_ch39.html
>> >>> ) into PTX code.  The generated PTX code was then executed
on real
>> >>> hardware, and the result is correct.
>> >>
>> >> How much of the LLVM IR does this support?  What's
missing?
>> > Have to add some intrinsics, calling conventions, and address
spaces.
>> > I would say these are relatively small changes.
>>
>> Are you generating masks at all?  If so, how are you doing that?
>> Similarly to how the ARM backend does predicates (handling all the
>> representation, etc. in the target-specific codegen)?
>>
>> I've have been wanting to see predicates (vector and scalar) in the
>> LLVM IR for a long time.  Perhaps the PTX backend is an opportunity
>> to explore that.
> [Villmow, Micah] From looking at the llvmptxbackend, it does not fully
support vector types.
> This in my perspective is one of the greatest benefits of the backend
code-generator, automatic support
> for vector types in LLVM-IR that are not natively supported by the target
machine via vector splitting.
>>
>>                            -Dave
>>
>> _______________________________________________
>> LLVM Developers mailing list
>> LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
>

David A. Greene

2010-Aug-11 21:51 UTC

head link

[LLVMdev] Upstream PTX backend that uses target independent code generator if possible

"Villmow, Micah" <Micah.Villmow at amd.com> writes:
>> I've have been wanting to see predicates (vector and scalar) in the
>> LLVM IR for a long time.  Perhaps the PTX backend is an opportunity
>> to explore that.
> [Villmow, Micah] From looking at the llvmptxbackend, it does not fully
> support vector types.  This in my perspective is one of the greatest
> benefits of the backend code-generator, automatic support for vector
> types in LLVM-IR that are not natively supported by the target machine
> via vector splitting.
Actually, one doesn't really need vector types for PTX at the codegen
level unless one wants to make use of the 128-bit memory loads and
stores.  This is because the programming model at the asm level is
almost entirely scalar.  But we do certainly want masking capability on
scalar instructions.  If the backend doesn't have that it's going to
generate horribly performing code.

                              -Dave

David A. Greene

2010-Aug-11 21:53 UTC

head link

[LLVMdev] Upstream PTX backend that uses target independent code generator if possible

Che-Liang Chiou <clchiou at gmail.com> writes:
> My implementation of predicated instructions is similar to ARM
> backend. I traced ARM and PowerPC backend for reference.
Cool.
> If, David, you were saying a implementation of predication in LLVM IR,
> I didn't do that.  It was partly because I was not (and is still not)
> very familiar with LLVM's design; so I didn't know how to do that.
No, I wouldn't have expected you to do that, but I think long-term we
will want to consider it.

                               -Dave

Seemingly Similar Threads

Search for more reasonably related threads

llvm dev - Aug 2010 - [LLVMdev] Upstream PTX backend that uses target independent code generator if possible

[LLVMdev] Upstream PTX backend that uses target independent code generator if possible

[LLVMdev] Upstream PTX backend that uses target independent code generator if possible

[LLVMdev] Upstream PTX backend that uses target independent code generator if possible

[LLVMdev] Upstream PTX backend that uses target independent code generator if possible

Seemingly Similar Threads