Che-Liang Chiou
2011-Mar-10 03:03 UTC
[LLVMdev] [PTX] Should we keep backward-compatibility of PTX?
Hi Justin, There are some backward incompatible features of PTX; for example, special registers are redefined as v4i32 (they were v4i16) in PTX 2.0. And CUDA 4.0 was rolled out last week. I heard that some instructions are deprecated. I am not sure how stable (or unstable) PTX specification is. Do you have a rough assessment of its stability? If PTX specification is still fast evolving, I would suggest we keep up with latest specification, and consider backward compatibility later when it is stabilized. What do you think? Regards, Che-Liang
Justin Holewinski
2011-Mar-10 13:15 UTC
[LLVMdev] [PTX] Should we keep backward-compatibility of PTX?
On Wed, Mar 9, 2011 at 10:03 PM, Che-Liang Chiou <clchiou at gmail.com> wrote:> Hi Justin, > > There are some backward incompatible features of PTX; for example, > special registers are redefined as v4i32 (they were v4i16) in PTX 2.0. > And CUDA 4.0 was rolled out last week. I heard that some instructions > are deprecated. >I have not checked out CUDA 4.0 yet, but any deprecated instructions would need to be made so as part of a separate PTX version, either 2.2 or 3.0. My suggestion is to stay thorough with the intrinsics. Lets create i16 and i32 versions of both, and emit appropriate cvt instructions if necessary for the target PTX version. This should be easy enough using the existing PTXVersion field in PTXSubtarget.> > I am not sure how stable (or unstable) PTX specification is. Do you > have a rough assessment of its stability? >>From what I can tell, it is fairly stable. When things change, they seem toprimarily be additions to the ISA.> > If PTX specification is still fast evolving, I would suggest we keep > up with latest specification, and consider backward compatibility > later when it is stabilized. What do you think? >I'm fine with that, as long as later PTX versions do not require later shader models. I want to maintain compatibility as far back as shader model 1.0 for some older hardware I want to test with. Besides, I think most of the functionality in newer PTX versions can be easily predicated with sub-target flags.> > Regards, > Che-Liang >-- Thanks, Justin Holewinski -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20110310/03133d2c/attachment.html>
Possibly Parallel Threads
- [LLVMdev] [PATCH ]Add Subtarget ptx23
- [LLVMdev] [PATCH ]Add Subtarget ptx23
- [LLVMdev] [PATCH ]Add Subtarget ptx23
- [LLVMdev] GPU thread/block/grid size contraints in LLVM PTX backend
- [LLVMdev] Upstream PTX backend that uses target independent code generator if possible