Jonas Maebe
2014-Aug-02  14:51 UTC
[LLVMdev] Create "appending" section that can be partially dead stripped
On 01/08/14 19:37, Reid Kleckner wrote:> What happens if you drop appending linkage? I think it will just work, > since you are already using a custom section, which will ensure that all > the data appears contiguously in memory.Thanks for the suggestion, but it still puts everything in a single .section statement.> Although, I do worry about what LLVM's alias analysis will say about > this... I don't think LLVM allows GEPing from one global to another, and > at the end of the day, you'll GEP from an external global representing > the section start through to the elements of the array to section end.That's true and an interesting point. However, wouldn't that mean that "appending" linkage and "section" globals in general are completely unusable from llvm IR and would only be safely usable from inline assembler or code not compiled by llvm? (unless you don't access such data as a contiguous block, but I guess that doesn't happen very often) Jonas
Reid Kleckner
2014-Aug-04  07:27 UTC
[LLVMdev] Create "appending" section that can be partially dead stripped
On Sat, Aug 2, 2014 at 7:51 AM, Jonas Maebe <jonas.maebe at elis.ugent.be> wrote:> On 01/08/14 19:37, Reid Kleckner wrote: > >> What happens if you drop appending linkage? I think it will just work, >> since you are already using a custom section, which will ensure that all >> the data appears contiguously in memory. >> > > Thanks for the suggestion, but it still puts everything in a single > .section statement.Try giving the globals linkonce_odr linkage instead of external linkage manually? This is essentially the effect of -fdata-sections, except it happens later during codegen. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20140804/7579e20d/attachment.html>
Jonas Maebe
2014-Aug-05  14:34 UTC
[LLVMdev] Create "appending" section that can be partially dead stripped
On 04 Aug 2014, at 09:27, Reid Kleckner wrote:> On Sat, Aug 2, 2014 at 7:51 AM, Jonas Maebe > <jonas.maebe at elis.ugent.be> > wrote: > >> On 01/08/14 19:37, Reid Kleckner wrote: >> >>> What happens if you drop appending linkage? I think it will just >>> work, >>> since you are already using a custom section, which will ensure >>> that all >>> the data appears contiguously in memory. >> >> Thanks for the suggestion, but it still puts everything in a single >> .section statement. > > Try giving the globals linkonce_odr linkage instead of external > linkage > manually? This is essentially the effect of -fdata-sections, except > it > happens later during codegen.That one indeed works, thanks! (provided I process the .ll file with optimisations enabled). Regarding the question whether the optimisers and analyses will deal correctly with me iterating over all of the elements in such a section, at least at first sight it seems that the test below looks fine when processed with full optimisations. That's of course no proof that there won't be any problems in any case. However, I'm not sure how to add a symbol without data at the start and end of the section. Adding a [i32 x 0]-typed symbol still inserts a byte (see the @arrstart/@arrstop). I can't work with alias declarations to the first and last element, since then the first and last element will always be considered live. Jonas target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32- i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64- f80:128:128-n8:16:32:64-S128" target triple = "x86_64-pc-linux-gnu" define i32 @main() { Entry: %sumvar = alloca i32 ; make the second and third element live by directly referring them %eleptr1 = getelementptr [2 x i32]* @arr2, i64 0, i32 0 %eleptr2 = getelementptr [2 x i32]* @arr3, i64 0, i32 0 %ele1 = load i32* %eleptr1 %ele2 = load i32* %eleptr2 %sum1 = add i32 %ele1, %ele2 store i32 %sum1, i32* %sumvar ; now loop over the entire array by using @arrstart and @arrstop %loopstop = ptrtoint [0 x i32]* @arrstop to i64 %loopstart = ptrtoint [0 x i32]* @arrstart to i64 %loopcount = sub i64 %loopstop, %loopstart %looparrinit = bitcast [0 x i32]* @arrstart to i32* br label %LoopStart ; sum all elements in the array LoopStart: %looparr = phi i32* [%looparrinit, %Entry], [%looparrnext, %LoopBody] %loopcond = icmp eq i64 %loopcount, 0 br i1 %loopcond, label %LoopEnd, label %LoopBody LoopBody: %val = load i32* %looparr %sum2 = load i32* %sumvar %sum3 = add i32 %val, %sum2 store i32 %sum3, i32* %sumvar %looparrnext = getelementptr i32* %looparr, i64 1 br label %LoopStart ; return the sum LoopEnd: %retval = load i32* %sumvar ret i32 %retval } ; this declaration inserts a 0 byte @arrstart = global [0 x i32] [ ], section "mytest" @arr1 = linkonce_odr global [2 x i32] [ i32 1, i32 2 ], section "mytest" @arr2 = linkonce_odr global [2 x i32] [ i32 3, i32 4 ], section "mytest" @arr3 = linkonce_odr global [2 x i32] [ i32 5, i32 6 ], section "mytest" ; this declaration inserts a 0 byte @arrstop = global [0 x i32] [ ], section "mytest"