thr3ads.net - llvm dev - [llvm-dev] Supporting Regular and Thin LTO with a Single LTO Bitcode Format [Apr 2021]

If this information is useful, please help other people find it:
Share via:

via llvm-dev

2021-Apr-13 21:46 UTC

[llvm-dev] Supporting Regular and Thin LTO with a Single LTO Bitcode Format

Hi Petr,

This does sound like a good use case for our pipeline. We’ve seen good runtime
performance overall, as we stated in the talk. I’ve been working on upstreaming
our patches off and on for a couple months now. Our pipeline needs to be ported
to the NPM, but once that work is done, the patch is ready for review. I should
be able to finish that work within the next month or two and would love to get
some feedback on our approach.

Thanks,
Matthew


From: Steven Wu <stevenwu at apple.com>
Sent: Tuesday, April 13, 2021 2:39 PM
To: David Blaikie <dblaikie at gmail.com>
Cc: Petr Hosek <phosek at google.com>; Voss, Matthew <Matthew.Voss at
sony.com>; Teresa Johnson <tejohnson at google.com>; llvm-dev
<llvm-dev at lists.llvm.org>
Subject: Re: [llvm-dev] Supporting Regular and Thin LTO with a Single LTO
Bitcode Format

This is a really good thread to read:
https://lists.llvm.org/pipermail/llvm-dev/2018-April/122469.html

There is no fundamental technical reasons why this cannot happen but it requires
lots of work to fine tuning the pipeline (yes, fullLTO and thinLTO uses
different pipeline) so that it reaches a good balance of performance/build
overhead for general users.

Steven


On Apr 13, 2021, at 2:23 PM, David Blaikie via llvm-dev <llvm-dev at
lists.llvm.org<mailto:llvm-dev at lists.llvm.org>> wrote:

+Matthew and Teresa for any context they might have

High level sounds like a reasonable thing to me, for what it's worth.

On Tue, Apr 13, 2021 at 2:19 PM Petr Hosek via llvm-dev
<llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>>
wrote:


We're using regular LTO for our Clang toolchain build because we don't
mind spending more resources to squeeze out as much performance as possible.
However, when looking into our build performance, I've noticed that we only
spent about 1/3 of the total build time in building distribution components, the
rest is spent on building unit tests and tools that are only used by lit tests.
For the latter, we don't care about the performance, so it'd be nice to
avoid doing regular LTO to speed up the build.

The idea I had would be to use a single LTO bitcode format for all translation
units, and then decide only at link time whether to use regular LTO for
distribution components or ThinLTO for everything else.

After doing some research, I found the "Supporting Regular and Thin LTO
with a Single LTO Bitcode Format" talk presented by Matthew Voss at LLVM
Developers’ Meeting 2019 which does exactly what I described, but it seems like
this was only implemented downstream.

Has there been any progress on upstreaming the implementation? Is there any way
to do what I described using the in-tree LTO implementation?
_______________________________________________
LLVM Developers mailing list
llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
_______________________________________________
LLVM Developers mailing list
llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210413/863ae1b6/attachment.html>

Mehdi AMINI via llvm-dev

2021-Apr-13 22:01 UTC

head link

[llvm-dev] Supporting Regular and Thin LTO with a Single LTO Bitcode Format

Hi,

I actually just watched the presentation, but it is a bit too high level
for me to really understand what is this new "format" in practice.
As far as I remember ThinLTO bitcode should be a super-set of information
compared to FullLTO. It isn't clear to me what prevents a linker
implementation to take the ThinLTO bitcode and perform FullLTO on these,
this was designed to allow this originally.

The optimization pipelines are set up differently, but that does not really
affect the format I believe. A good FullLTO link implementation based on
the ThinLTO bitcode would likely involve running a heavier pipeline during
LTO, this is something I tried to do at the time to align the pipeline and
ensure there is only one, however it was making the LTO link slower and we
considered it wasn't a good tradeoff.
We could also run part of the pipeline on individual modules (in parallel)
before linking them together during FullLTO, which would address the
performance question. It wouldn't however help when the same modules are
linked into different binaries, like it is the case for LLVM test binaries
which are linking over and over the same set of libraries. For this setup
(which may not be common though) it is better to do as much work as
possible in the first compilation phase and as little as possible during
the link phase.
> the rest is spent on building unit tests and tools that are only used bylit tests. For the latter, we don't care about the performance, so it'd
be
nice to avoid doing regular LTO to speed up the build.

For such a case, the "ideal" setup would be to link these in a ThinLTO
mode
where we disable cross-module importing and just optimize/codegen in
parallel. We could guarantee this way that each file is codegened once and
cached and every test binary would have a cache hit on the files. This
makes it "almost zero" cost over non-LTO I think.

Best,

-- 
Mehdi

On Tue, Apr 13, 2021 at 2:46 PM via llvm-dev <llvm-dev at lists.llvm.org>
wrote:
> Hi Petr,
>
>
>
> This does sound like a good use case for our pipeline. We’ve seen good
> runtime performance overall, as we stated in the talk. I’ve been working on
> upstreaming our patches off and on for a couple months now. Our pipeline
> needs to be ported to the NPM, but once that work is done, the patch is
> ready for review. I should be able to finish that work within the next
> month or two and would love to get some feedback on our approach.
>
>
>
> Thanks,
>
> Matthew
>
>
>
>
>
> *From:* Steven Wu <stevenwu at apple.com>
> *Sent:* Tuesday, April 13, 2021 2:39 PM
> *To:* David Blaikie <dblaikie at gmail.com>
> *Cc:* Petr Hosek <phosek at google.com>; Voss, Matthew
<Matthew.Voss at sony.com>;
> Teresa Johnson <tejohnson at google.com>; llvm-dev <llvm-dev at
lists.llvm.org>
> *Subject:* Re: [llvm-dev] Supporting Regular and Thin LTO with a Single
> LTO Bitcode Format
>
>
>
> This is a really good thread to read:
> https://lists.llvm.org/pipermail/llvm-dev/2018-April/122469.html
>
>
>
> There is no fundamental technical reasons why this cannot happen but it
> requires lots of work to fine tuning the pipeline (yes, fullLTO and thinLTO
> uses different pipeline) so that it reaches a good balance of
> performance/build overhead for general users.
>
>
>
> Steven
>
>
>
> On Apr 13, 2021, at 2:23 PM, David Blaikie via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>
>
> +Matthew and Teresa for any context they might have
>
> High level sounds like a reasonable thing to me, for what it's worth.
>
> On Tue, Apr 13, 2021 at 2:19 PM Petr Hosek via llvm-dev
> <llvm-dev at lists.llvm.org> wrote:
>
>
> We're using regular LTO for our Clang toolchain build because we
don't
> mind spending more resources to squeeze out as much performance as
> possible. However, when looking into our build performance, I've
noticed
> that we only spent about 1/3 of the total build time in building
> distribution components, the rest is spent on building unit tests and tools
> that are only used by lit tests. For the latter, we don't care about
the
> performance, so it'd be nice to avoid doing regular LTO to speed up the
> build.
>
> The idea I had would be to use a single LTO bitcode format for all
> translation units, and then decide only at link time whether to use regular
> LTO for distribution components or ThinLTO for everything else.
>
> After doing some research, I found the "Supporting Regular and Thin
LTO
> with a Single LTO Bitcode Format" talk presented by Matthew Voss at
LLVM
> Developers’ Meeting 2019 which does exactly what I described, but it seems
> like this was only implemented downstream.
>
> Has there been any progress on upstreaming the implementation? Is there
> any way to do what I described using the in-tree LTO implementation?
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210413/949c4e17/attachment.html>

Teresa Johnson via llvm-dev

2021-Apr-13 22:04 UTC

head link

[llvm-dev] Supporting Regular and Thin LTO with a Single LTO Bitcode Format

Since we do get a module summary even for regular LTO (with metadata added
to indicate whether the LTO link should do regular or ThinLTO), I think the
issue is just the divergent pass managers, which is also discussed in the
thread Steven pointed to. Matthew - can you remind me of the different
summary issue?

Petr, in your case where you don't care about the performance (so much)
when running in a ThinLTO mode since it is just tests, presumably the
different pipeline issue isn't a big deal.

Teresa

On Tue, Apr 13, 2021 at 2:46 PM <Matthew.Voss at sony.com> wrote:
> Hi Petr,
>
>
>
> This does sound like a good use case for our pipeline. We’ve seen good
> runtime performance overall, as we stated in the talk. I’ve been working on
> upstreaming our patches off and on for a couple months now. Our pipeline
> needs to be ported to the NPM, but once that work is done, the patch is
> ready for review. I should be able to finish that work within the next
> month or two and would love to get some feedback on our approach.
>
>
>
> Thanks,
>
> Matthew
>
>
>
>
>
> *From:* Steven Wu <stevenwu at apple.com>
> *Sent:* Tuesday, April 13, 2021 2:39 PM
> *To:* David Blaikie <dblaikie at gmail.com>
> *Cc:* Petr Hosek <phosek at google.com>; Voss, Matthew
<Matthew.Voss at sony.com>;
> Teresa Johnson <tejohnson at google.com>; llvm-dev <llvm-dev at
lists.llvm.org>
> *Subject:* Re: [llvm-dev] Supporting Regular and Thin LTO with a Single
> LTO Bitcode Format
>
>
>
> This is a really good thread to read:
> https://lists.llvm.org/pipermail/llvm-dev/2018-April/122469.html
>
>
>
> There is no fundamental technical reasons why this cannot happen but it
> requires lots of work to fine tuning the pipeline (yes, fullLTO and thinLTO
> uses different pipeline) so that it reaches a good balance of
> performance/build overhead for general users.
>
>
>
> Steven
>
>
>
> On Apr 13, 2021, at 2:23 PM, David Blaikie via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>
>
> +Matthew and Teresa for any context they might have
>
> High level sounds like a reasonable thing to me, for what it's worth.
>
> On Tue, Apr 13, 2021 at 2:19 PM Petr Hosek via llvm-dev
> <llvm-dev at lists.llvm.org> wrote:
>
>
> We're using regular LTO for our Clang toolchain build because we
don't
> mind spending more resources to squeeze out as much performance as
> possible. However, when looking into our build performance, I've
noticed
> that we only spent about 1/3 of the total build time in building
> distribution components, the rest is spent on building unit tests and tools
> that are only used by lit tests. For the latter, we don't care about
the
> performance, so it'd be nice to avoid doing regular LTO to speed up the
> build.
>
> The idea I had would be to use a single LTO bitcode format for all
> translation units, and then decide only at link time whether to use regular
> LTO for distribution components or ThinLTO for everything else.
>
> After doing some research, I found the "Supporting Regular and Thin
LTO
> with a Single LTO Bitcode Format" talk presented by Matthew Voss at
LLVM
> Developers’ Meeting 2019 which does exactly what I described, but it seems
> like this was only implemented downstream.
>
> Has there been any progress on upstreaming the implementation? Is there
> any way to do what I described using the in-tree LTO implementation?
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
>
>

-- 
Teresa Johnson |  Software Engineer |  tejohnson at google.com |
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210413/43deba11/attachment.html>

llvm dev - Apr 2021 - Supporting Regular and Thin LTO with a Single LTO Bitcode Format

[llvm-dev] Supporting Regular and Thin LTO with a Single LTO Bitcode Format

[llvm-dev] Supporting Regular and Thin LTO with a Single LTO Bitcode Format

[llvm-dev] Supporting Regular and Thin LTO with a Single LTO Bitcode Format