palpar via llvm-dev
2018-Sep-14  15:16 UTC
[llvm-dev] Function calls keep increasing the stack usage
Hi everyone,
I found that LLVM generates redundant code when calling functions with
constant parameters, with optimizations disabled.
Consider the following C code snippet:
int foo(int x, int y);
void bar()
{
foo(1, 2);
foo(3, 4);
}
Clang/LLVM 6.0 generates the following assembly code:
_bar:
subl $32, %esp
movl $1, %eax
movl $2, %ecx
movl $1, (%esp)
movl $2, 4(%esp)
movl %eax, 28(%esp)
movl %ecx, 24(%esp)
calll _foo
movl $3, %ecx
movl $4, %edx
movl $3, (%esp)
movl $4, 4(%esp)
movl %eax, 20(%esp)
movl %ecx, 16(%esp)
movl %edx, 12(%esp)
calll _foo
movl %eax, 8(%esp)
addl $32, %esp
retl
Note how the constants are stored in registers but when saving the
parameters on the stack for the call the immediate values are used. The
registers are still stored on the stack probably because it's the
caller's
responsibility once they were used (which seems expected).
I think the problem comes from the fact that LLVM unconditionally allocates
a register for each parameter value regardless if it's used later or not.
If the stack space of the program is sufficiently large this is probably
not a problem, but otherwise if there is a large number of such calls,
despite not recursive, it can lead to stack overflow. Do you think I should
create a bug report for this?
(Similarly, the return value of the function could be not saved but the
LLVM IR code that Clang generates has the call with assignment so at this
point LLVM couldn't possibly know.
define void @bar() #0 {
  %call = call i32 @foo(i32 1, i32 2)
  %call1 = call i32 @foo(i32 3, i32 4)
  ret void
}
)
Thanks,
Alpar
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20180914/25ba1341/attachment.html>
David Blaikie via llvm-dev
2018-Sep-14  18:15 UTC
[llvm-dev] Function calls keep increasing the stack usage
Can't say I've observed that behavior (though I'm just building from
top-of-tree rather than 6.0, compiling for x86-64 on linux), perhaps you
could provide more detail (what target are you compiling for - possibly
provide the -cc1 command line, etc).
bar:                                    # @bar
        .cfi_startproc
# %bb.0:                                # %entry
        pushq   %rbp
        .cfi_def_cfa_offset 16
        .cfi_offset %rbp, -16
        movq    %rsp, %rbp
        .cfi_def_cfa_register %rbp
        subq    $16, %rsp
        movl    $1, %edi
        movl    $2, %esi
        callq   foo
        movl    $3, %edi
        movl    $4, %esi
        movl    %eax, -4(%rbp)          # 4-byte Spill
        callq   foo
        movl    %eax, -8(%rbp)          # 4-byte Spill
        addq    $16, %rsp
        popq    %rbp
        .cfi_def_cfa %rsp, 8
        retq
Or on 32-bit X86:
bar:                                    # @bar
        .cfi_startproc
# %bb.0:                                # %entry
        pushq   %rbp
        .cfi_def_cfa_offset 16
        .cfi_offset %rbp, -16
        movq    %rsp, %rbp
        .cfi_def_cfa_register %rbp
        subq    $16, %rsp
        movl    $1, %edi
        movl    $2, %esi
        callq   foo
        movl    $3, %edi
        movl    $4, %esi
        movl    %eax, -4(%rbp)          # 4-byte Spill
        callq   foo
        movl    %eax, -8(%rbp)          # 4-byte Spill
        addq    $16, %rsp
        popq    %rbp
        .cfi_def_cfa %rsp, 8
        retq
On Fri, Sep 14, 2018 at 8:16 AM palpar via llvm-dev <llvm-dev at
lists.llvm.org>
wrote:
> Hi everyone,
>
> I found that LLVM generates redundant code when calling functions with
> constant parameters, with optimizations disabled.
>
> Consider the following C code snippet:
>
> int foo(int x, int y);
>
> void bar()
> {
> foo(1, 2);
> foo(3, 4);
> }
>
> Clang/LLVM 6.0 generates the following assembly code:
> _bar:
> subl $32, %esp
> movl $1, %eax
> movl $2, %ecx
> movl $1, (%esp)
> movl $2, 4(%esp)
> movl %eax, 28(%esp)
> movl %ecx, 24(%esp)
> calll _foo
> movl $3, %ecx
> movl $4, %edx
> movl $3, (%esp)
> movl $4, 4(%esp)
> movl %eax, 20(%esp)
> movl %ecx, 16(%esp)
> movl %edx, 12(%esp)
> calll _foo
> movl %eax, 8(%esp)
> addl $32, %esp
> retl
>
> Note how the constants are stored in registers but when saving the
> parameters on the stack for the call the immediate values are used. The
> registers are still stored on the stack probably because it's the
caller's
> responsibility once they were used (which seems expected).
> I think the problem comes from the fact that LLVM unconditionally
> allocates a register for each parameter value regardless if it's used
later
> or not.
> If the stack space of the program is sufficiently large this is probably
> not a problem, but otherwise if there is a large number of such calls,
> despite not recursive, it can lead to stack overflow. Do you think I should
> create a bug report for this?
>
> (Similarly, the return value of the function could be not saved but the
> LLVM IR code that Clang generates has the call with assignment so at this
> point LLVM couldn't possibly know.
> define void @bar() #0 {
>   %call = call i32 @foo(i32 1, i32 2)
>   %call1 = call i32 @foo(i32 3, i32 4)
>   ret void
> }
> )
>
> Thanks,
> Alpar
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20180914/bb2265cd/attachment.html>
Reid Kleckner via llvm-dev
2018-Sep-14  18:26 UTC
[llvm-dev] Function calls keep increasing the stack usage
I think this is a bug in fastisel. It sometimes generates unnecessary materializations like this. It's unclear why they end up getting spilled, though. On Fri, Sep 14, 2018 at 8:16 AM palpar via llvm-dev <llvm-dev at lists.llvm.org> wrote:> Hi everyone, > > I found that LLVM generates redundant code when calling functions with > constant parameters, with optimizations disabled. > > Consider the following C code snippet: > > int foo(int x, int y); > > void bar() > { > foo(1, 2); > foo(3, 4); > } > > Clang/LLVM 6.0 generates the following assembly code: > _bar: > subl $32, %esp > movl $1, %eax > movl $2, %ecx > movl $1, (%esp) > movl $2, 4(%esp) > movl %eax, 28(%esp) > movl %ecx, 24(%esp) > calll _foo > movl $3, %ecx > movl $4, %edx > movl $3, (%esp) > movl $4, 4(%esp) > movl %eax, 20(%esp) > movl %ecx, 16(%esp) > movl %edx, 12(%esp) > calll _foo > movl %eax, 8(%esp) > addl $32, %esp > retl > > Note how the constants are stored in registers but when saving the > parameters on the stack for the call the immediate values are used. The > registers are still stored on the stack probably because it's the caller's > responsibility once they were used (which seems expected). > I think the problem comes from the fact that LLVM unconditionally > allocates a register for each parameter value regardless if it's used later > or not. > If the stack space of the program is sufficiently large this is probably > not a problem, but otherwise if there is a large number of such calls, > despite not recursive, it can lead to stack overflow. Do you think I should > create a bug report for this? > > (Similarly, the return value of the function could be not saved but the > LLVM IR code that Clang generates has the call with assignment so at this > point LLVM couldn't possibly know. > define void @bar() #0 { > %call = call i32 @foo(i32 1, i32 2) > %call1 = call i32 @foo(i32 3, i32 4) > ret void > } > ) > > Thanks, > Alpar > _______________________________________________ > LLVM Developers mailing list > llvm-dev at lists.llvm.org > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180914/0c93586e/attachment.html>
palpar via llvm-dev
2018-Sep-14  18:57 UTC
[llvm-dev] Function calls keep increasing the stack usage
Sorry I missed that important detail. The relevant part of the command line is: -cc1 -S -triple i386-pc-win32 I don't expect it matters if it's for Windows or Linux in this case. On Fri, Sep 14, 2018 at 9:16 PM David Blaikie <dblaikie at gmail.com> wrote:> Can't say I've observed that behavior (though I'm just building from > top-of-tree rather than 6.0, compiling for x86-64 on linux), perhaps you > could provide more detail (what target are you compiling for - possibly > provide the -cc1 command line, etc). > > bar: # @bar > .cfi_startproc > # %bb.0: # %entry > pushq %rbp > .cfi_def_cfa_offset 16 > .cfi_offset %rbp, -16 > movq %rsp, %rbp > .cfi_def_cfa_register %rbp > subq $16, %rsp > movl $1, %edi > movl $2, %esi > callq foo > movl $3, %edi > movl $4, %esi > movl %eax, -4(%rbp) # 4-byte Spill > callq foo > movl %eax, -8(%rbp) # 4-byte Spill > addq $16, %rsp > popq %rbp > .cfi_def_cfa %rsp, 8 > retq > > > Or on 32-bit X86: > > bar: # @bar > .cfi_startproc > # %bb.0: # %entry > pushq %rbp > .cfi_def_cfa_offset 16 > .cfi_offset %rbp, -16 > movq %rsp, %rbp > .cfi_def_cfa_register %rbp > subq $16, %rsp > movl $1, %edi > movl $2, %esi > callq foo > movl $3, %edi > movl $4, %esi > movl %eax, -4(%rbp) # 4-byte Spill > callq foo > movl %eax, -8(%rbp) # 4-byte Spill > addq $16, %rsp > popq %rbp > .cfi_def_cfa %rsp, 8 > retq > > > On Fri, Sep 14, 2018 at 8:16 AM palpar via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > >> Hi everyone, >> >> I found that LLVM generates redundant code when calling functions with >> constant parameters, with optimizations disabled. >> >> Consider the following C code snippet: >> >> int foo(int x, int y); >> >> void bar() >> { >> foo(1, 2); >> foo(3, 4); >> } >> >> Clang/LLVM 6.0 generates the following assembly code: >> _bar: >> subl $32, %esp >> movl $1, %eax >> movl $2, %ecx >> movl $1, (%esp) >> movl $2, 4(%esp) >> movl %eax, 28(%esp) >> movl %ecx, 24(%esp) >> calll _foo >> movl $3, %ecx >> movl $4, %edx >> movl $3, (%esp) >> movl $4, 4(%esp) >> movl %eax, 20(%esp) >> movl %ecx, 16(%esp) >> movl %edx, 12(%esp) >> calll _foo >> movl %eax, 8(%esp) >> addl $32, %esp >> retl >> >> Note how the constants are stored in registers but when saving the >> parameters on the stack for the call the immediate values are used. The >> registers are still stored on the stack probably because it's the caller's >> responsibility once they were used (which seems expected). >> I think the problem comes from the fact that LLVM unconditionally >> allocates a register for each parameter value regardless if it's used later >> or not. >> If the stack space of the program is sufficiently large this is probably >> not a problem, but otherwise if there is a large number of such calls, >> despite not recursive, it can lead to stack overflow. Do you think I should >> create a bug report for this? >> >> (Similarly, the return value of the function could be not saved but the >> LLVM IR code that Clang generates has the call with assignment so at this >> point LLVM couldn't possibly know. >> define void @bar() #0 { >> %call = call i32 @foo(i32 1, i32 2) >> %call1 = call i32 @foo(i32 3, i32 4) >> ret void >> } >> ) >> >> Thanks, >> Alpar >> _______________________________________________ >> LLVM Developers mailing list >> llvm-dev at lists.llvm.org >> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev >> >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180914/c11ad652/attachment.html>
palpar via llvm-dev
2018-Sep-14  19:09 UTC
[llvm-dev] Function calls keep increasing the stack usage
Thanks for confirming that it seems to be a bug. I was studying the code you mentioned and X86FastISel::fastLowerCall() calls getRegForValue() for each argument. Then the second iteration over the arguments (after it called CCInfo.AnalyzeCallOperands()) may decide to call X86FastEmitStore() if not VA.isRegLoc() and ArgVal is a constant (ignoring the register, but that was already marked used I guess by getRegForValue). http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/X86/X86FastISel.cpp?view=markup Could we maybe delay the call to getRegForValue() after the analyze so that we don't call it when not needed? On Fri, Sep 14, 2018 at 9:26 PM Reid Kleckner <rnk at google.com> wrote:> I think this is a bug in fastisel. It sometimes generates unnecessary > materializations like this. It's unclear why they end up getting spilled, > though. > > On Fri, Sep 14, 2018 at 8:16 AM palpar via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > >> Hi everyone, >> >> I found that LLVM generates redundant code when calling functions with >> constant parameters, with optimizations disabled. >> >> Consider the following C code snippet: >> >> int foo(int x, int y); >> >> void bar() >> { >> foo(1, 2); >> foo(3, 4); >> } >> >> Clang/LLVM 6.0 generates the following assembly code: >> _bar: >> subl $32, %esp >> movl $1, %eax >> movl $2, %ecx >> movl $1, (%esp) >> movl $2, 4(%esp) >> movl %eax, 28(%esp) >> movl %ecx, 24(%esp) >> calll _foo >> movl $3, %ecx >> movl $4, %edx >> movl $3, (%esp) >> movl $4, 4(%esp) >> movl %eax, 20(%esp) >> movl %ecx, 16(%esp) >> movl %edx, 12(%esp) >> calll _foo >> movl %eax, 8(%esp) >> addl $32, %esp >> retl >> >> Note how the constants are stored in registers but when saving the >> parameters on the stack for the call the immediate values are used. The >> registers are still stored on the stack probably because it's the caller's >> responsibility once they were used (which seems expected). >> I think the problem comes from the fact that LLVM unconditionally >> allocates a register for each parameter value regardless if it's used later >> or not. >> If the stack space of the program is sufficiently large this is probably >> not a problem, but otherwise if there is a large number of such calls, >> despite not recursive, it can lead to stack overflow. Do you think I should >> create a bug report for this? >> >> (Similarly, the return value of the function could be not saved but the >> LLVM IR code that Clang generates has the call with assignment so at this >> point LLVM couldn't possibly know. >> define void @bar() #0 { >> %call = call i32 @foo(i32 1, i32 2) >> %call1 = call i32 @foo(i32 3, i32 4) >> ret void >> } >> ) >> >> Thanks, >> Alpar >> _______________________________________________ >> LLVM Developers mailing list >> llvm-dev at lists.llvm.org >> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev >> >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180914/1038d021/attachment.html>
Matthias Braun via llvm-dev
2018-Sep-17  20:27 UTC
[llvm-dev] Function calls keep increasing the stack usage
- Questions like this are better suited for bugs.llvm.org <http://bugs.llvm.org/> - Please provide the full clang invocation. In this case I suspect your problem will go away if you use `-O2` or similar. - Matthias> On Sep 14, 2018, at 8:16 AM, palpar via llvm-dev <llvm-dev at lists.llvm.org> wrote: > > Hi everyone, > > I found that LLVM generates redundant code when calling functions with constant parameters, with optimizations disabled. > > Consider the following C code snippet: > > int foo(int x, int y); > > void bar() > { > foo(1, 2); > foo(3, 4); > } > > Clang/LLVM 6.0 generates the following assembly code: > _bar: > subl $32, %esp > movl $1, %eax > movl $2, %ecx > movl $1, (%esp) > movl $2, 4(%esp) > movl %eax, 28(%esp) > movl %ecx, 24(%esp) > calll _foo > movl $3, %ecx > movl $4, %edx > movl $3, (%esp) > movl $4, 4(%esp) > movl %eax, 20(%esp) > movl %ecx, 16(%esp) > movl %edx, 12(%esp) > calll _foo > movl %eax, 8(%esp) > addl $32, %esp > retl > > Note how the constants are stored in registers but when saving the parameters on the stack for the call the immediate values are used. The registers are still stored on the stack probably because it's the caller's responsibility once they were used (which seems expected). > I think the problem comes from the fact that LLVM unconditionally allocates a register for each parameter value regardless if it's used later or not. > If the stack space of the program is sufficiently large this is probably not a problem, but otherwise if there is a large number of such calls, despite not recursive, it can lead to stack overflow. Do you think I should create a bug report for this? > > (Similarly, the return value of the function could be not saved but the LLVM IR code that Clang generates has the call with assignment so at this point LLVM couldn't possibly know. > define void @bar() #0 { > %call = call i32 @foo(i32 1, i32 2) > %call1 = call i32 @foo(i32 3, i32 4) > ret void > } > ) > > Thanks, > Alpar > _______________________________________________ > LLVM Developers mailing list > llvm-dev at lists.llvm.org > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180917/20a83300/attachment.html>
David Blaikie via llvm-dev
2018-Sep-17  20:30 UTC
[llvm-dev] Function calls keep increasing the stack usage
On Mon, Sep 17, 2018 at 1:27 PM Matthias Braun via llvm-dev < llvm-dev at lists.llvm.org> wrote:> - Questions like this are better suited for bugs.llvm.org >Not sure - I think it's pretty fine for discussion here, when someone's not sure it's a bug/intended, etc.> - Please provide the full clang invocation. In this case I suspect your > problem will go away if you use `-O2` or similar. >Right, I think the original post calls this out as specifically being about -O0. - Dave> > - Matthias > > > On Sep 14, 2018, at 8:16 AM, palpar via llvm-dev <llvm-dev at lists.llvm.org> > wrote: > > Hi everyone, > > I found that LLVM generates redundant code when calling functions with > constant parameters, with optimizations disabled. > > Consider the following C code snippet: > > int foo(int x, int y); > > void bar() > { > foo(1, 2); > foo(3, 4); > } > > Clang/LLVM 6.0 generates the following assembly code: > _bar: > subl $32, %esp > movl $1, %eax > movl $2, %ecx > movl $1, (%esp) > movl $2, 4(%esp) > movl %eax, 28(%esp) > movl %ecx, 24(%esp) > calll _foo > movl $3, %ecx > movl $4, %edx > movl $3, (%esp) > movl $4, 4(%esp) > movl %eax, 20(%esp) > movl %ecx, 16(%esp) > movl %edx, 12(%esp) > calll _foo > movl %eax, 8(%esp) > addl $32, %esp > retl > > Note how the constants are stored in registers but when saving the > parameters on the stack for the call the immediate values are used. The > registers are still stored on the stack probably because it's the caller's > responsibility once they were used (which seems expected). > I think the problem comes from the fact that LLVM unconditionally > allocates a register for each parameter value regardless if it's used later > or not. > If the stack space of the program is sufficiently large this is probably > not a problem, but otherwise if there is a large number of such calls, > despite not recursive, it can lead to stack overflow. Do you think I should > create a bug report for this? > > (Similarly, the return value of the function could be not saved but the > LLVM IR code that Clang generates has the call with assignment so at this > point LLVM couldn't possibly know. > define void @bar() #0 { > %call = call i32 @foo(i32 1, i32 2) > %call1 = call i32 @foo(i32 3, i32 4) > ret void > } > ) > > Thanks, > Alpar > _______________________________________________ > LLVM Developers mailing list > llvm-dev at lists.llvm.org > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev > > > _______________________________________________ > LLVM Developers mailing list > llvm-dev at lists.llvm.org > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180917/c2f30a0a/attachment.html>
Possibly Parallel Threads
- Function calls keep increasing the stack usage
- [newbie] trouble with global variables and CreateLoad/Store in JIT
- [newbie] trouble with global variables and CreateLoad/Store in JIT
- [newbie] trouble with global variables and CreateLoad/Store in JIT
- [LLVMdev] [RFC] [X86] Mov to push transformation in x86-32 call sequences