Erkki Lindpere
2011-Jul-28 22:20 UTC
[LLVMdev] Is using lots of in-register values in IR bad?
Hi, I'm a newbie when it comes to compilers (and even close-to-machine coding), but recently started working on my own language and am using LLVM as the mid/backend. Currently I'm generating .ll files from a front-end written in Scala. The language is not really serious, just a way to learn more about compilers (and LLVM) and maybe serve as a base for further experiments. It's somewhat based on Kaleidoscope, but with more Scala-like syntax. I want to experiment with avoiding mutable state as far as I can. At the moment there are no mutable variables -- only immutable value types (numerics, bool, vectors, tuples) and I've been doing everything in LLVM registers. The compiler doesn't generate a single alloca, load or store at the moment. I wonder if it was maybe a bad idea to do it this way? Because a lot of stuff in LLVM seem to be only available through pointers. e.g. extractvalue takes only constant indices, but GEP can take variables. Some things seem to be possible only by bitcasting pointers, e.g. splitting a Vector into equal-sized parts to partially compute the sum of it's elements with SIMD instructions... And there may of course be some penalty for passing large(-ish) structures by-value. I haven't investigated at which sizes does that become worse than passing pointers. Maybe a better alternative would be to allocate memory for every local value, and let the mem2reg pass optimize? I hope these kind of questions are appropriate for this list. Regards, Erkki Lindpere -------------- next part -------------- An HTML attachment was scrubbed... URL: <lists.llvm.org/pipermail/llvm-dev/attachments/20110729/4ed5524d/attachment.html>
David A. Greene
2011-Jul-28 23:06 UTC
[LLVMdev] Is using lots of in-register values in IR bad?
Erkki Lindpere <villane at gmail.com> writes:> I want to experiment with avoiding mutable state as far as I can. At > the moment there are no mutable variables -- only immutable value > types (numerics, bool, vectors, tuples) and I've been doing everything > in LLVM registers. The compiler doesn't generate a single alloca, load > or store at the moment.Ok. Do you ever need to grab the address of something on the stack? If so you're going to need an alloca. AFAIK, it's the only way to generate an address for a local object. This is by design of the IR and it greatly simplifies analysis. How do you handle global data? That can only be accessed in LLVM IR via load/store. A GlobalValue is an address by definition.> I wonder if it was maybe a bad idea to do it this way? Because a lot > of stuff in LLVM seem to be only available through pointers. e.g. > extractvalue takes only constant indices, but GEP can take variables.Yeah, this is quite a limitation of the current IR. It is lacking a few fundamental operations that, for example, vector machines of the '60's and '70's implemented directly. Extract/insert from/to variable index being one of them. Extractvalue is a little more complicated, of course, but special cases of it are implemented on x86 (for example) and other "modernish" targets. For cases like these, it is best to create a target-specific intrinsic and use that to represent the operation. For operations not implemented directly by the target, an alloca+GEP may be necessary.> Some things seem to be possible only by bitcasting pointers, e.g. > splitting a Vector into equal-sized parts to partially compute the sum > of it's elements with SIMD instructions...That doesn't seem like the Right Way to do it. As in the extractvalue case, the IR has no direct support for vector reductions. If your target has these kinds of operations, you should probably use an intrinsic to implement them. Think of target intrinsics as a way to extend the IR for special operations. The analysis and transformation passes won't understand them but typically in these cases you "know" the right sequence to generate.> And there may of course be some penalty for passing large(-ish) > structures by-value. I haven't investigated at which sizes does that > become worse than passing pointers.It is highly target-dependent. But usually the target's ABI has already made that decision for you. In the case of pass-by-address you will need an alloca.> Maybe a better alternative would be to allocate memory for every local > value, and let the mem2reg pass optimize?That is often simpler. Then the translation of every object from your high-level language to LLVM IR looks the same. But it is not strictly necessary.> I hope these kind of questions are appropriate for this list.Absolutely. Welcome! -Dave
Erkki Lindpere
2011-Jul-29 08:58 UTC
[LLVMdev] Is using lots of in-register values in IR bad?
Thanks for the reply On Fri, Jul 29, 2011 at 2:06 AM, David A. Greene <greened at obbligato.org>wrote:> Erkki Lindpere <villane at gmail.com> writes: > Ok. Do you ever need to grab the address of something on the stack? If > so you're going to need an alloca. AFAIK, it's the only way to generate > an address for a local object. This is by design of the IR and it > greatly simplifies analysis. > > How do you handle global data? That can only be accessed in LLVM IR via > load/store. A GlobalValue is an address by definition. > > At the moment I do have global string constants (for passing to libcfunctions such as puts), but I'm adding other types of global values soon so I guess I'll need to start thinking about that.> I wonder if it was maybe a bad idea to do it this way? Because a lot > > of stuff in LLVM seem to be only available through pointers. e.g. > > extractvalue takes only constant indices, but GEP can take variables. > > For cases like these, it is best to create a target-specific intrinsic > and use that to represent the operation. For operations not implemented > directly by the target, an alloca+GEP may be necessary. >I want to be target-neutral actually (but generate well performing code for the x86 / x86-64)> Some things seem to be possible only by bitcasting pointers, e.g. > > splitting a Vector into equal-sized parts to partially compute the sum > > of it's elements with SIMD instructions... > > That doesn't seem like the Right Way to do it. As in the extractvalue > case, the IR has no direct support for vector reductions. If your > target has these kinds of operations, you should probably use an > intrinsic to implement them. >Ah, ok. I didn't notice before that x86 SSE actually has vector reductions, but I looked it up now and indeed it does. I guess I could introduce some target-specific codegen, then. Because at the moment I'm only using x86 anyway and in the future I may want to delay the code generation to install time. Thanks, Erkki -------------- next part -------------- An HTML attachment was scrubbed... URL: <lists.llvm.org/pipermail/llvm-dev/attachments/20110729/b1155218/attachment.html>
Duncan Sands
2011-Jul-29 09:24 UTC
[LLVMdev] Is using lots of in-register values in IR bad?
Hi Erkki,> I want to experiment with avoiding mutable state as far as I can. At the moment > there are no mutable variables -- only immutable value types (numerics, bool, > vectors, tuples) and I've been doing everything in LLVM registers. The compiler > doesn't generate a single alloca, load or store at the moment. > > I wonder if it was maybe a bad idea to do it this way? Because a lot of stuff in > LLVM seem to be only available through pointers. e.g. extractvalue takes only > constant indices, but GEP can take variables. Some things seem to be possible > only by bitcasting pointers, e.g. splitting a Vector into equal-sized parts to > partially compute the sum of it's elements with SIMD instructions...splitting a vector can (and should) be done using the shufflevector instruction.> And there may of course be some penalty for passing large(-ish) structures > by-value.In-register structs and arrays are not intended to be used for large structs and arrays. They are intended be used for small objects like complex numbers (two elements), (pointer,size) pairs and so on. Use memory (pointers) for anything larger.> Maybe a better alternative would be to allocate memory for every local value, > and let the mem2reg pass optimize?Most front-ends do that. The dragonegg front-end is a bit different: it uses registers directly for scalars and complex numbers, and memory for everything else. Ciao, Duncan.
Carlo Alberto Ferraris
2011-Jul-29 13:44 UTC
[LLVMdev] Is using lots of in-register values in IR bad?
Nella citazione venerdì 29 luglio 2011 11:24:47, Duncan Sands ha scritto:> In-register structs and arrays are not intended to be used for large structs and > arrays.Out of curiosity, why would that be the case? -- Carlo Alberto Ferraris <cafxx at strayorange.com <mailto:cafxx at strayorange.com>> website/blog <cafxx.strayorange.com> - +39 333 7643 235 -------------- next part -------------- A non-text attachment was scrubbed... Name: cafxx.vcf Type: text/x-vcard Size: 233 bytes Desc: not available URL: <lists.llvm.org/pipermail/llvm-dev/attachments/20110729/cb98951b/attachment.vcf>