thr3ads.net - llvm dev - [llvm-dev] [RFC] Introduce Dump Accumulator [Aug 2020]

If this information is useful, please help other people find it:
Share via:

Hal Finkel via llvm-dev

2020-Aug-05 23:22 UTC

[llvm-dev] [RFC] Introduce Dump Accumulator

I think that we should think about the relationship between this 
proposed mechanism and the existing mechanism that we have for emitting 
and capturing optimization remarks. In some sense, I feel like we 
already have a lot of this capability (e.g., llc has -remarks-section).

  -Hal

On 8/5/20 5:51 PM, Johannes Doerfert via llvm-dev wrote:> I like the ability, not sure about the proposed implementation though.
>
> Did you consider a flag that redirects `llvm::outs()` and `llvm::errs()`
>
> into sections of the object file instead? So, you'd say:
>
>
> `clang ... -mllvm -debug-only=inline ... -mllvm -dump-section=.dump`
>
>
> and you'd get the regular debug output nicely ordered in the `.dump` 
> section.
>
> I mainly want to avoid even more output code in the passes but also be 
> able
>
> to collect at least that information. That doesn't mean we couldn't
> add another
>
> output stream that would always/only redirect into the sections.
>
>
> ~ Johannes
>
>
> On 8/5/20 5:36 PM, Kazu Hirata via llvm-dev wrote:
>> Introduction
>> ===========>>
>> This RFC proposes a mechanism to dump arbitrary messages into object
>> files during compilation and retrieve them from the final executable.
>>
>> Background
>> =========>>
>> We often need to collect information from all object files of
>> applications.  For example:
>>
>> - Mircea Trofin needs to collect information from the function
>>    inlining pass so that he can train the machine learning model with
>>    the information.
>>
>> - I sometimes need to dump messages from optimization passes to see
>>    where and how they trigger.
>>
>> Now, this process becomes challenging when we build large applications
>> with a build system that caches and distributes compilation jobs.  If
>> we were to dump messages to stderr, we would have to be careful not to
>> interleave messages from multiple object files.  If we were to modify
>> a source file, we would have to flush the cache and rebuild the entire
>> application to collect dump messages from all relevant object files.
>>
>> High Level Design
>> ================>>
>> - LLVM: We provide machinery for individual passes to dump arbitrary
>>    messages into a special ELF section in a compressed manner.
>>
>> - Linker: We simply concatenate the contents of the special ELF
>>    section.  No change is needed.
>>
>> - llvm-readobj: We add an option to retrieve the contents of the
>>    special ELF section.
>>
>> Detailed Design
>> ==============>>
>> DumpAccumulator analysis pass
>> -----------------------------
>>
>> We create a new analysis pass called DumpAccumulator.  We add the
>> analysis pass right at the beginning of the pass pipeline.  The new
>> analysis pass holds the dump messages throughout the pass pipeline.
>>
>> If you would like to dump messages from some pass, you would obtain
>> the result of DumpAccumulator in the pass:
>>
>>    DumpAccumulator::Result *DAR = 
>> MAMProxy.getCachedResult<DumpAccumulator>(M);
>>
>> Then dump messages:
>>
>>    if (DAR) {
>>      DAR->Message += "Processing ";
>>      DAR->Message += F.getName();
>>      DAR->Message += "\n";
>>    }
>>
>> AsmPrinter
>> ----------
>>
>> We dump the messages from DumpAccumulator into a section called
>> ".llvm_dump" in a compressed manner.  Specifically, the
section
>> contains:
>>
>> - LEB128 encoding of the original size in bytes
>> - LEB128 encoding of the compressed size in bytes
>> - the message compressed by zlib::compressed
>>
>> in that order.
>>
>> llvm-readobj
>> ------------
>>
>> We read the .llvm_dump section.  We dump each chunk of compressed data
>> one after another.
>>
>> Existing Implementation
>> ======================>> https://reviews.llvm.org/D84473
>>
>> Future Directions
>> ================>>
>> The proposal above does not support the ThinLTO build flow.  To
>> support that, I am thinking about putting the message as metadata in
>> the IR at the prelink stage.
>>
>> Thoughts?
>>
>>
>> _______________________________________________
>> LLVM Developers mailing list
>> llvm-dev at lists.llvm.org
>> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
-- 
Hal Finkel
Lead, Compiler Technology and Programming Languages
Leadership Computing Facility
Argonne National Laboratory

Han Shen via llvm-dev

2020-Aug-06 00:50 UTC

head link

[llvm-dev] [RFC] Introduce Dump Accumulator

Does this also support having relocations in the .llvm_dump section,
like a .rela.llvm_dump?  Because we (Rahman) are now prototyping
dumping function and basic block symbols in a designated section,
which seems similar to what you propose except that we need to embed
relocations in the section and let the link do the relocation
resolving.

Thanks,
Han


On Wed, Aug 5, 2020 at 4:23 PM Hal Finkel via llvm-dev
<llvm-dev at lists.llvm.org> wrote:>
> I think that we should think about the relationship between this
> proposed mechanism and the existing mechanism that we have for emitting
> and capturing optimization remarks. In some sense, I feel like we
> already have a lot of this capability (e.g., llc has -remarks-section).
>
>   -Hal
>
> On 8/5/20 5:51 PM, Johannes Doerfert via llvm-dev wrote:
> > I like the ability, not sure about the proposed implementation though.
> >
> > Did you consider a flag that redirects `llvm::outs()` and
`llvm::errs()`
> >
> > into sections of the object file instead? So, you'd say:
> >
> >
> > `clang ... -mllvm -debug-only=inline ... -mllvm -dump-section=.dump`
> >
> >
> > and you'd get the regular debug output nicely ordered in the
`.dump`
> > section.
> >
> > I mainly want to avoid even more output code in the passes but also be
> > able
> >
> > to collect at least that information. That doesn't mean we
couldn't
> > add another
> >
> > output stream that would always/only redirect into the sections.
> >
> >
> > ~ Johannes
> >
> >
> > On 8/5/20 5:36 PM, Kazu Hirata via llvm-dev wrote:
> >> Introduction
> >> ===========> >>
> >> This RFC proposes a mechanism to dump arbitrary messages into
object
> >> files during compilation and retrieve them from the final
executable.
> >>
> >> Background
> >> =========> >>
> >> We often need to collect information from all object files of
> >> applications.  For example:
> >>
> >> - Mircea Trofin needs to collect information from the function
> >>    inlining pass so that he can train the machine learning model
with
> >>    the information.
> >>
> >> - I sometimes need to dump messages from optimization passes to
see
> >>    where and how they trigger.
> >>
> >> Now, this process becomes challenging when we build large
applications
> >> with a build system that caches and distributes compilation jobs. 
If
> >> we were to dump messages to stderr, we would have to be careful
not to
> >> interleave messages from multiple object files.  If we were to
modify
> >> a source file, we would have to flush the cache and rebuild the
entire
> >> application to collect dump messages from all relevant object
files.
> >>
> >> High Level Design
> >> ================> >>
> >> - LLVM: We provide machinery for individual passes to dump
arbitrary
> >>    messages into a special ELF section in a compressed manner.
> >>
> >> - Linker: We simply concatenate the contents of the special ELF
> >>    section.  No change is needed.
> >>
> >> - llvm-readobj: We add an option to retrieve the contents of the
> >>    special ELF section.
> >>
> >> Detailed Design
> >> ==============> >>
> >> DumpAccumulator analysis pass
> >> -----------------------------
> >>
> >> We create a new analysis pass called DumpAccumulator.  We add the
> >> analysis pass right at the beginning of the pass pipeline.  The
new
> >> analysis pass holds the dump messages throughout the pass
pipeline.
> >>
> >> If you would like to dump messages from some pass, you would
obtain
> >> the result of DumpAccumulator in the pass:
> >>
> >>    DumpAccumulator::Result *DAR > >>
MAMProxy.getCachedResult<DumpAccumulator>(M);
> >>
> >> Then dump messages:
> >>
> >>    if (DAR) {
> >>      DAR->Message += "Processing ";
> >>      DAR->Message += F.getName();
> >>      DAR->Message += "\n";
> >>    }
> >>
> >> AsmPrinter
> >> ----------
> >>
> >> We dump the messages from DumpAccumulator into a section called
> >> ".llvm_dump" in a compressed manner.  Specifically, the
section
> >> contains:
> >>
> >> - LEB128 encoding of the original size in bytes
> >> - LEB128 encoding of the compressed size in bytes
> >> - the message compressed by zlib::compressed
> >>
> >> in that order.
> >>
> >> llvm-readobj
> >> ------------
> >>
> >> We read the .llvm_dump section.  We dump each chunk of compressed
data
> >> one after another.
> >>
> >> Existing Implementation
> >> ======================> >>
https://reviews.llvm.org/D84473
> >>
> >> Future Directions
> >> ================> >>
> >> The proposal above does not support the ThinLTO build flow.  To
> >> support that, I am thinking about putting the message as metadata
in
> >> the IR at the prelink stage.
> >>
> >> Thoughts?
> >>
> >>
> >> _______________________________________________
> >> LLVM Developers mailing list
> >> llvm-dev at lists.llvm.org
> >> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
> > _______________________________________________
> > LLVM Developers mailing list
> > llvm-dev at lists.llvm.org
> > https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
> --
> Hal Finkel
> Lead, Compiler Technology and Programming Languages
> Leadership Computing Facility
> Argonne National Laboratory
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev


-- 
Han Shen |  Software Engineer |  shenhan at google.com |  +1-650-440-3330

Mircea Trofin via llvm-dev

2020-Aug-06 15:55 UTC

head link

[llvm-dev] [RFC] Introduce Dump Accumulator

Thanks - it's a llvm-wide feature, not just llc-only, from what I can tell
- which is good (llc-only wouldn't have helped us much, we don't build
with
llc :) ).

We should take a look, indeed.

I was scanning the documentation, maybe I missed it - so we better
understand the feature, do you happen to know what the motivating scenarios
were for this, and how would one read the section in the resulting binary
(i.e. custom binary reader, lllvm-readobj?)

Thanks!

On Wed, Aug 5, 2020 at 4:23 PM Hal Finkel <hfinkel at anl.gov> wrote:
> I think that we should think about the relationship between this
> proposed mechanism and the existing mechanism that we have for emitting
> and capturing optimization remarks. In some sense, I feel like we
> already have a lot of this capability (e.g., llc has -remarks-section).
>
>   -Hal
>
> On 8/5/20 5:51 PM, Johannes Doerfert via llvm-dev wrote:
> > I like the ability, not sure about the proposed implementation though.
> >
> > Did you consider a flag that redirects `llvm::outs()` and
`llvm::errs()`
> >
> > into sections of the object file instead? So, you'd say:
> >
> >
> > `clang ... -mllvm -debug-only=inline ... -mllvm -dump-section=.dump`
> >
> >
> > and you'd get the regular debug output nicely ordered in the
`.dump`
> > section.
> >
> > I mainly want to avoid even more output code in the passes but also be
> > able
> >
> > to collect at least that information. That doesn't mean we
couldn't
> > add another
> >
> > output stream that would always/only redirect into the sections.
> >
> >
> > ~ Johannes
> >
> >
> > On 8/5/20 5:36 PM, Kazu Hirata via llvm-dev wrote:
> >> Introduction
> >> ===========> >>
> >> This RFC proposes a mechanism to dump arbitrary messages into
object
> >> files during compilation and retrieve them from the final
executable.
> >>
> >> Background
> >> =========> >>
> >> We often need to collect information from all object files of
> >> applications.  For example:
> >>
> >> - Mircea Trofin needs to collect information from the function
> >>    inlining pass so that he can train the machine learning model
with
> >>    the information.
> >>
> >> - I sometimes need to dump messages from optimization passes to
see
> >>    where and how they trigger.
> >>
> >> Now, this process becomes challenging when we build large
applications
> >> with a build system that caches and distributes compilation jobs. 
If
> >> we were to dump messages to stderr, we would have to be careful
not to
> >> interleave messages from multiple object files.  If we were to
modify
> >> a source file, we would have to flush the cache and rebuild the
entire
> >> application to collect dump messages from all relevant object
files.
> >>
> >> High Level Design
> >> ================> >>
> >> - LLVM: We provide machinery for individual passes to dump
arbitrary
> >>    messages into a special ELF section in a compressed manner.
> >>
> >> - Linker: We simply concatenate the contents of the special ELF
> >>    section.  No change is needed.
> >>
> >> - llvm-readobj: We add an option to retrieve the contents of the
> >>    special ELF section.
> >>
> >> Detailed Design
> >> ==============> >>
> >> DumpAccumulator analysis pass
> >> -----------------------------
> >>
> >> We create a new analysis pass called DumpAccumulator.  We add the
> >> analysis pass right at the beginning of the pass pipeline.  The
new
> >> analysis pass holds the dump messages throughout the pass
pipeline.
> >>
> >> If you would like to dump messages from some pass, you would
obtain
> >> the result of DumpAccumulator in the pass:
> >>
> >>    DumpAccumulator::Result *DAR > >>
MAMProxy.getCachedResult<DumpAccumulator>(M);
> >>
> >> Then dump messages:
> >>
> >>    if (DAR) {
> >>      DAR->Message += "Processing ";
> >>      DAR->Message += F.getName();
> >>      DAR->Message += "\n";
> >>    }
> >>
> >> AsmPrinter
> >> ----------
> >>
> >> We dump the messages from DumpAccumulator into a section called
> >> ".llvm_dump" in a compressed manner.  Specifically, the
section
> >> contains:
> >>
> >> - LEB128 encoding of the original size in bytes
> >> - LEB128 encoding of the compressed size in bytes
> >> - the message compressed by zlib::compressed
> >>
> >> in that order.
> >>
> >> llvm-readobj
> >> ------------
> >>
> >> We read the .llvm_dump section.  We dump each chunk of compressed
data
> >> one after another.
> >>
> >> Existing Implementation
> >> ======================> >>
https://reviews.llvm.org/D84473
> >>
> >> Future Directions
> >> ================> >>
> >> The proposal above does not support the ThinLTO build flow.  To
> >> support that, I am thinking about putting the message as metadata
in
> >> the IR at the prelink stage.
> >>
> >> Thoughts?
> >>
> >>
> >> _______________________________________________
> >> LLVM Developers mailing list
> >> llvm-dev at lists.llvm.org
> >> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
> > _______________________________________________
> > LLVM Developers mailing list
> > llvm-dev at lists.llvm.org
> > https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
> --
> Hal Finkel
> Lead, Compiler Technology and Programming Languages
> Leadership Computing Facility
> Argonne National Laboratory
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200806/f503e07a/attachment.html>

Hal Finkel via llvm-dev

2020-Aug-06 19:45 UTC

head link

[llvm-dev] [RFC] Introduce Dump Accumulator

On 8/6/20 10:55 AM, Mircea Trofin wrote:> Thanks - it's a llvm-wide feature, not just llc-only, from what I can 
> tell - which is good (llc-only wouldn't have helped us much, we
don't
> build with llc :) ).

Exactly. The optimization-remark infrastructure was designed to be 
flexible in this regard. Remarks are classes that can be directly fed to 
a frontend handler (for direct interpretation and/or for display to the 
user), can be serialized in YAML and read later, etc.

>
> We should take a look, indeed.
>
> I was scanning the documentation, maybe I missed it - so we better 
> understand the feature, do you happen to know what the motivating 
> scenarios were for this, and how would one read the section in the 
> resulting binary (i.e. custom binary reader, lllvm-readobj?)

My recollection is that the motivation is so that optimization remarks, 
in their machine-readable/serialized form, can be collected, stored in 
the binary, and then displayed, along with profiling information, in 
tools that help with performance profiling and analysis.

  -Hal

>
> Thanks!
>
> On Wed, Aug 5, 2020 at 4:23 PM Hal Finkel <hfinkel at anl.gov 
> <mailto:hfinkel at anl.gov>> wrote:
>
>     I think that we should think about the relationship between this
>     proposed mechanism and the existing mechanism that we have for
>     emitting
>     and capturing optimization remarks. In some sense, I feel like we
>     already have a lot of this capability (e.g., llc has
>     -remarks-section).
>
>       -Hal
>
>     On 8/5/20 5:51 PM, Johannes Doerfert via llvm-dev wrote:
>     > I like the ability, not sure about the proposed implementation
>     though.
>     >
>     > Did you consider a flag that redirects `llvm::outs()` and
>     `llvm::errs()`
>     >
>     > into sections of the object file instead? So, you'd say:
>     >
>     >
>     > `clang ... -mllvm -debug-only=inline ... -mllvm
-dump-section=.dump`
>     >
>     >
>     > and you'd get the regular debug output nicely ordered in the
>     `.dump`
>     > section.
>     >
>     > I mainly want to avoid even more output code in the passes but
>     also be
>     > able
>     >
>     > to collect at least that information. That doesn't mean we
couldn't
>     > add another
>     >
>     > output stream that would always/only redirect into the sections.
>     >
>     >
>     > ~ Johannes
>     >
>     >
>     > On 8/5/20 5:36 PM, Kazu Hirata via llvm-dev wrote:
>     >> Introduction
>     >> ===========>     >>
>     >> This RFC proposes a mechanism to dump arbitrary messages into
>     object
>     >> files during compilation and retrieve them from the final
>     executable.
>     >>
>     >> Background
>     >> =========>     >>
>     >> We often need to collect information from all object files of
>     >> applications.  For example:
>     >>
>     >> - Mircea Trofin needs to collect information from the function
>     >>    inlining pass so that he can train the machine learning
>     model with
>     >>    the information.
>     >>
>     >> - I sometimes need to dump messages from optimization passes
to see
>     >>    where and how they trigger.
>     >>
>     >> Now, this process becomes challenging when we build large
>     applications
>     >> with a build system that caches and distributes compilation
>     jobs.  If
>     >> we were to dump messages to stderr, we would have to be
careful
>     not to
>     >> interleave messages from multiple object files.  If we were to
>     modify
>     >> a source file, we would have to flush the cache and rebuild
the
>     entire
>     >> application to collect dump messages from all relevant object
>     files.
>     >>
>     >> High Level Design
>     >> ================>     >>
>     >> - LLVM: We provide machinery for individual passes to dump
>     arbitrary
>     >>    messages into a special ELF section in a compressed manner.
>     >>
>     >> - Linker: We simply concatenate the contents of the special
ELF
>     >>    section.  No change is needed.
>     >>
>     >> - llvm-readobj: We add an option to retrieve the contents of
the
>     >>    special ELF section.
>     >>
>     >> Detailed Design
>     >> ==============>     >>
>     >> DumpAccumulator analysis pass
>     >> -----------------------------
>     >>
>     >> We create a new analysis pass called DumpAccumulator.  We add
the
>     >> analysis pass right at the beginning of the pass pipeline. 
The new
>     >> analysis pass holds the dump messages throughout the pass
pipeline.
>     >>
>     >> If you would like to dump messages from some pass, you would
obtain
>     >> the result of DumpAccumulator in the pass:
>     >>
>     >>    DumpAccumulator::Result *DAR >     >>
MAMProxy.getCachedResult<DumpAccumulator>(M);
>     >>
>     >> Then dump messages:
>     >>
>     >>    if (DAR) {
>     >>      DAR->Message += "Processing ";
>     >>      DAR->Message += F.getName();
>     >>      DAR->Message += "\n";
>     >>    }
>     >>
>     >> AsmPrinter
>     >> ----------
>     >>
>     >> We dump the messages from DumpAccumulator into a section
called
>     >> ".llvm_dump" in a compressed manner.  Specifically,
the section
>     >> contains:
>     >>
>     >> - LEB128 encoding of the original size in bytes
>     >> - LEB128 encoding of the compressed size in bytes
>     >> - the message compressed by zlib::compressed
>     >>
>     >> in that order.
>     >>
>     >> llvm-readobj
>     >> ------------
>     >>
>     >> We read the .llvm_dump section.  We dump each chunk of
>     compressed data
>     >> one after another.
>     >>
>     >> Existing Implementation
>     >> ======================>     >>
https://reviews.llvm.org/D84473 <https://reviews.llvm.org/D84473>
>     >>
>     >> Future Directions
>     >> ================>     >>
>     >> The proposal above does not support the ThinLTO build flow. 
To
>     >> support that, I am thinking about putting the message as
>     metadata in
>     >> the IR at the prelink stage.
>     >>
>     >> Thoughts?
>     >>
>     >>
>     >> _______________________________________________
>     >> LLVM Developers mailing list
>     >> llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>
>     >> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>     <https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev>
>     > _______________________________________________
>     > LLVM Developers mailing list
>     > llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>
>     > https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>     <https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev>
>
>     -- 
>     Hal Finkel
>     Lead, Compiler Technology and Programming Languages
>     Leadership Computing Facility
>     Argonne National Laboratory
>-- 
Hal Finkel
Lead, Compiler Technology and Programming Languages
Leadership Computing Facility
Argonne National Laboratory

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20200806/b08bf2a4/attachment.html>

llvm dev - Aug 2020 - [RFC] Introduce Dump Accumulator

[llvm-dev] [RFC] Introduce Dump Accumulator

[llvm-dev] [RFC] Introduce Dump Accumulator

[llvm-dev] [RFC] Introduce Dump Accumulator

[llvm-dev] [RFC] Introduce Dump Accumulator