Dear all, Is anybody working on CUDA launch bounds support? On PTX level, __attribute__((launch_bounds(MAX_THREADS_PER_BLOCK, MIN_BLOCKS_PER_MP))) should be emitted into .maxntid / .minnctapersm specification. Thanks, - D. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20130401/044f2a01/attachment.html>
Yes, this is supported through metadata. An example usage of these annotations is given in the test/CodeGen/NVPTX/annotations.ll unit test. I'll try to remember to add this to the NVPTX documentation I'm putting together at http://llvm.org/docs/NVPTXUsage.html. On Mon, Apr 1, 2013 at 8:06 AM, Dmitry Mikushin <dmitry at kernelgen.org>wrote:> Dear all, > > Is anybody working on CUDA launch bounds support? > > On PTX level, __attribute__((launch_bounds(MAX_THREADS_PER_BLOCK, > MIN_BLOCKS_PER_MP))) should be emitted into .maxntid / .minnctapersm > specification. > > Thanks, > - D. >-- Thanks, Justin Holewinski -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20130402/a4526bed/attachment.html>
Wow, great, thanks! I will try to try it. - D. 2013/4/2 Justin Holewinski <justin.holewinski at gmail.com>> Yes, this is supported through metadata. An example usage of these > annotations is given in the test/CodeGen/NVPTX/annotations.ll unit test. > I'll try to remember to add this to the NVPTX documentation I'm putting > together at http://llvm.org/docs/NVPTXUsage.html. > > > > On Mon, Apr 1, 2013 at 8:06 AM, Dmitry Mikushin <dmitry at kernelgen.org>wrote: > >> Dear all, >> >> Is anybody working on CUDA launch bounds support? >> >> On PTX level, __attribute__((launch_bounds(MAX_THREADS_PER_BLOCK, >> MIN_BLOCKS_PER_MP))) should be emitted into .maxntid / .minnctapersm >> specification. >> >> Thanks, >> - D. >> > > > > -- > > Thanks, > > Justin Holewinski >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20130402/fcaf55a0/attachment.html>