thr3ads.net - llvm dev - [llvm-dev] Next steps for optimization remarks? [Jul 2017]

If this information is useful, please help other people find it:
Share via:

Adam Nemet via llvm-dev

2017-Jul-14 17:10 UTC

[llvm-dev] Next steps for optimization remarks?

> On Jul 14, 2017, at 8:21 AM, Davide Italiano via llvm-dev <llvm-dev at
lists.llvm.org> wrote:
> 
> On Mon, Jun 19, 2017 at 4:13 PM, Brian Gesiak via llvm-dev
> <llvm-dev at lists.llvm.org <mailto:llvm-dev at
lists.llvm.org>> wrote:
>> Hello all,
>> 
>> In https://www.youtube.com/watch?v=qq0q1hfzidg, Adam Nemet (cc'ed)
describes
>> optimization remarks and some future plans for the project. I had a few
>> follow-up questions:
>> 
>> 1. As an example of future work to be done, the talk mentions expanding
the
>> set of optimization passes that emit remarks. However, the Clang User
Manual
>> mentions that "optimization remarks do not really make sense
outside of the
>> major transformations (e.g.: inlining, vectorization, loop
optimizations)."
>> [1] I am wondering: which passes exist today that are most in need of
>> supporting optimization remarks? Should all passes emit optimization
>> remarks, or are there indeed passes for which optimization remarks
"do not
>> make sense"?
>> 
>> 2. I tried running llvm/utils/opt-viewer/opt-viewer.py to produce an
HTML
>> dashboard for the optimization remark YAML generated from a large C++
>> program. Unfortunately, the Python script does not finish, even after
over
>> an hour of processing. It appears performance has been brought up
before by
>> Bob Haarman (cc'ed), and some optimizations have been made since.
[2] I
>> wonder if I'm passing in bad input (6,000+ YAML files -- too
many?), or if
>> there's still some room to speed up the opt-viewer.py script? I
tried the
>> C++ implementation as well, but that never completed either. [3]
>> 
>> Overall I'm excited to make greater use of optimization remarks,
and to
>> contribute in any way I can. Please let me know if you have any
thoughts on
>> my questions above!
>> 
> 
> Hi,
> I've been asked at $WORK to take a look at `-opt-remarks` , so here
> are a couple of thoughts.
> 
> 1) When LTO is on, the output isn't particularly easy to read. I guess
> this can be mitigated with some filtering approach, I and Simon
> discussed it offline.
Can you please elaborate?
> 
> 2) Yes, indeed `opt-viewer` takes forever for large testcases to
> process. I think that it could lead to exploring a better
> representation than YAML which is, indeed, a little slow to parse. To
> be honest, I'm torn about this.
> YAML is definitely really convenient as we already use it somewhere in
> tree, and it has an easy textual repr. OTOH, it doesn't seem to scale
> that nicely.
Agreed.  We now have a mitigation strategy with -pass-remarks-hotness-threshold
but this is something that we may have to solve in the long run.
> 
> 3) There are lots of optimizations which are still missing from the
> output, in particular PGO remarks (including, e.g. branch info
> probabilities which still use the old API as far as I can tell
> [PGOInstrumentation.cpp])
Yes, how about we file bugs for each pass that still uses the old API (I am
looking at ICP today) and then we can split up the work and then finally remove
the old API?

Also on exposing PGO info, I have a patch that adds a pass I call
HotnessDecorator.  The pass emits a remark for each basic block.  Then
opt-viewer is made aware of these and the remarks are special-cased to show
hotness for a line unless there is already a remark on the line.  The idea is
that since we only show hotness as part of the remark if a block does not
contain a remark we don’t see its hotness.  E.g.:


> 
> 4) `opt-remarks` heavily relies on the fidelity of the DebugLoc
> attached to instructions. Things get a little hairy at -O3 (or with
> -flto) because there are optimizations bugs so transformations don't
> preserve debuginfo. This is not entirely orthogonal but something can
> be worked on in parallel (bonus point, this would also help SamplePGO
> & debuginfo experience). With `-flto` the problem gets amplified more,
> as expected.
> 
> 5) I found a couple of issue when trying the support, but I'm actively
> working on them.
> https://bugs.llvm.org/show_bug.cgi?id=33773
<https://bugs.llvm.org/show_bug.cgi?id=33773>
> https://bugs.llvm.org/show_bug.cgi?id=33776
<https://bugs.llvm.org/show_bug.cgi?id=33776>
> 
> That said, I think optimization remarks support is coming along nicely.
Yes, I’ve been really happy with the progress.  Thanks for all the help from
everybody!

Adam
> 
> --
> Davide
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
<http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170714/4cb96531/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: PastedGraphic-1.tiff
Type: image/tiff
Size: 58614 bytes
Desc: not available
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170714/4cb96531/attachment-0001.tiff>

Davide Italiano via llvm-dev

2017-Jul-14 17:22 UTC

head link

[llvm-dev] Next steps for optimization remarks?

On Fri, Jul 14, 2017 at 10:10 AM, Adam Nemet <anemet at apple.com>
wrote:>
>
> On Jul 14, 2017, at 8:21 AM, Davide Italiano via llvm-dev <llvm-dev at
lists.llvm.org> wrote:
>
> On Mon, Jun 19, 2017 at 4:13 PM, Brian Gesiak via llvm-dev
> <llvm-dev at lists.llvm.org> wrote:
>
> Hello all,
>
> In https://www.youtube.com/watch?v=qq0q1hfzidg, Adam Nemet (cc'ed)
describes
> optimization remarks and some future plans for the project. I had a few
> follow-up questions:
>
> 1. As an example of future work to be done, the talk mentions expanding the
> set of optimization passes that emit remarks. However, the Clang User
Manual
> mentions that "optimization remarks do not really make sense outside
of the
> major transformations (e.g.: inlining, vectorization, loop
optimizations)."
> [1] I am wondering: which passes exist today that are most in need of
> supporting optimization remarks? Should all passes emit optimization
> remarks, or are there indeed passes for which optimization remarks "do
not
> make sense"?
>
> 2. I tried running llvm/utils/opt-viewer/opt-viewer.py to produce an HTML
> dashboard for the optimization remark YAML generated from a large C++
> program. Unfortunately, the Python script does not finish, even after over
> an hour of processing. It appears performance has been brought up before by
> Bob Haarman (cc'ed), and some optimizations have been made since. [2] I
> wonder if I'm passing in bad input (6,000+ YAML files -- too many?), or
if
> there's still some room to speed up the opt-viewer.py script? I tried
the
> C++ implementation as well, but that never completed either. [3]
>
> Overall I'm excited to make greater use of optimization remarks, and to
> contribute in any way I can. Please let me know if you have any thoughts on
> my questions above!
>
>
> Hi,
> I've been asked at $WORK to take a look at `-opt-remarks` , so here
> are a couple of thoughts.
>
> 1) When LTO is on, the output isn't particularly easy to read. I guess
> this can be mitigated with some filtering approach, I and Simon
> discussed it offline.
>
>
> Can you please elaborate?
>
The issue is twofold:
1) With LTO, the number of remarks generated skyrockets because whole
module visibility makes IPO more effective (i.e. you end up inlining
much more etc..). As a side effect, more aggressive inlining/IPCP
expose more intraprocedural optimizations which in turn generates more
remarks.
2) As pointed out earlier, DI is not always reliable.
>
>
> 2) Yes, indeed `opt-viewer` takes forever for large testcases to
> process. I think that it could lead to exploring a better
> representation than YAML which is, indeed, a little slow to parse. To
> be honest, I'm torn about this.
> YAML is definitely really convenient as we already use it somewhere in
> tree, and it has an easy textual repr. OTOH, it doesn't seem to scale
> that nicely.
>
>
> Agreed.  We now have a mitigation strategy with
-pass-remarks-hotness-threshold but this is something that we may have to solve
in the long run.
>
At some point, I guess we might just slowly moving away
from>
>
> 3) There are lots of optimizations which are still missing from the
> output, in particular PGO remarks (including, e.g. branch info
> probabilities which still use the old API as far as I can tell
> [PGOInstrumentation.cpp])
>
>
> Yes, how about we file bugs for each pass that still uses the old API (I am
looking at ICP today) and then we can split up the work and then finally remove
the old API?
>
That sounds like a plan.
> Also on exposing PGO info, I have a patch that adds a pass I call
HotnessDecorator.  The pass emits a remark for each basic block.  Then
opt-viewer is made aware of these and the remarks are special-cased to show
hotness for a line unless there is already a remark on the line.  The idea is
that since we only show hotness as part of the remark if a block does not
contain a remark we don’t see its hotness.  E.g.:
>
>
Yes, feel free to post for review once you have it
ready.>
>
> 4) `opt-remarks` heavily relies on the fidelity of the DebugLoc
> attached to instructions. Things get a little hairy at -O3 (or with
> -flto) because there are optimizations bugs so transformations don't
> preserve debuginfo. This is not entirely orthogonal but something can
> be worked on in parallel (bonus point, this would also help SamplePGO
> & debuginfo experience). With `-flto` the problem gets amplified more,
> as expected.
>
> 5) I found a couple of issue when trying the support, but I'm actively
> working on them.
> https://bugs.llvm.org/show_bug.cgi?id=33773
> https://bugs.llvm.org/show_bug.cgi?id=33776
>
> That said, I think optimization remarks support is coming along nicely.
>
>
> Yes, I’ve been really happy with the progress.  Thanks for all the help
from everybody!
At some point, I guess we might just consider the HTML generated
report as a fallback and having the opt-remarks more integrated in the
developer's workflow.
I personally use Visual studio daily to compile clang and it would be
nice to have remarks there as a plugin. I can imagine something
similar happening for XCode/CLion/Emacs etc..

Thanks,

--
Davide

Adam Nemet via llvm-dev

2017-Jul-14 17:32 UTC

head link

[llvm-dev] Next steps for optimization remarks?

> On Jul 14, 2017, at 10:22 AM, Davide Italiano <davide at freebsd.org>
wrote:
> 
> On Fri, Jul 14, 2017 at 10:10 AM, Adam Nemet <anemet at apple.com
<mailto:anemet at apple.com>> wrote:
>> 
>> 
>> On Jul 14, 2017, at 8:21 AM, Davide Italiano via llvm-dev <llvm-dev
at lists.llvm.org> wrote:
>> 
>> On Mon, Jun 19, 2017 at 4:13 PM, Brian Gesiak via llvm-dev
>> <llvm-dev at lists.llvm.org> wrote:
>> 
>> Hello all,
>> 
>> In https://www.youtube.com/watch?v=qq0q1hfzidg, Adam Nemet (cc'ed)
describes
>> optimization remarks and some future plans for the project. I had a few
>> follow-up questions:
>> 
>> 1. As an example of future work to be done, the talk mentions expanding
the
>> set of optimization passes that emit remarks. However, the Clang User
Manual
>> mentions that "optimization remarks do not really make sense
outside of the
>> major transformations (e.g.: inlining, vectorization, loop
optimizations)."
>> [1] I am wondering: which passes exist today that are most in need of
>> supporting optimization remarks? Should all passes emit optimization
>> remarks, or are there indeed passes for which optimization remarks
"do not
>> make sense"?
>> 
>> 2. I tried running llvm/utils/opt-viewer/opt-viewer.py to produce an
HTML
>> dashboard for the optimization remark YAML generated from a large C++
>> program. Unfortunately, the Python script does not finish, even after
over
>> an hour of processing. It appears performance has been brought up
before by
>> Bob Haarman (cc'ed), and some optimizations have been made since.
[2] I
>> wonder if I'm passing in bad input (6,000+ YAML files -- too
many?), or if
>> there's still some room to speed up the opt-viewer.py script? I
tried the
>> C++ implementation as well, but that never completed either. [3]
>> 
>> Overall I'm excited to make greater use of optimization remarks,
and to
>> contribute in any way I can. Please let me know if you have any
thoughts on
>> my questions above!
>> 
>> 
>> Hi,
>> I've been asked at $WORK to take a look at `-opt-remarks` , so here
>> are a couple of thoughts.
>> 
>> 1) When LTO is on, the output isn't particularly easy to read. I
guess
>> this can be mitigated with some filtering approach, I and Simon
>> discussed it offline.
>> 
>> 
>> Can you please elaborate?
>> 
> 
> The issue is twofold:
> 1) With LTO, the number of remarks generated skyrockets because whole
> module visibility makes IPO more effective (i.e. you end up inlining
> much more etc..). As a side effect, more aggressive inlining/IPCP
> expose more intraprocedural optimizations which in turn generates more
> remarks.
Ah ok, you meant increased quantity.  Sure.  On inlining there is actually a
low-hanging fruit: https://bugs.llvm.org/show_bug.cgi?id=33786
<https://bugs.llvm.org/show_bug.cgi?id=33786>

> 2) As pointed out earlier, DI is not always reliable.
> 
>> 
>> 
>> 2) Yes, indeed `opt-viewer` takes forever for large testcases to
>> process. I think that it could lead to exploring a better
>> representation than YAML which is, indeed, a little slow to parse. To
>> be honest, I'm torn about this.
>> YAML is definitely really convenient as we already use it somewhere in
>> tree, and it has an easy textual repr. OTOH, it doesn't seem to
scale
>> that nicely.
>> 
>> 
>> Agreed.  We now have a mitigation strategy with
-pass-remarks-hotness-threshold but this is something that we may have to solve
in the long run.
>> 
> 
> At some point, I guess we might just slowly moving away from
>> 
>> 
>> 3) There are lots of optimizations which are still missing from the
>> output, in particular PGO remarks (including, e.g. branch info
>> probabilities which still use the old API as far as I can tell
>> [PGOInstrumentation.cpp])
>> 
>> 
>> Yes, how about we file bugs for each pass that still uses the old API
(I am looking at ICP today) and then we can split up the work and then finally
remove the old API?
>> 
> 
> That sounds like a plan.
> 
>> Also on exposing PGO info, I have a patch that adds a pass I call
HotnessDecorator.  The pass emits a remark for each basic block.  Then
opt-viewer is made aware of these and the remarks are special-cased to show
hotness for a line unless there is already a remark on the line.  The idea is
that since we only show hotness as part of the remark if a block does not
contain a remark we don’t see its hotness.  E.g.:
>> 
>> 
> 
> Yes, feel free to post for review once you have it ready.
Will do.
>> 
>> 
>> 4) `opt-remarks` heavily relies on the fidelity of the DebugLoc
>> attached to instructions. Things get a little hairy at -O3 (or with
>> -flto) because there are optimizations bugs so transformations
don't
>> preserve debuginfo. This is not entirely orthogonal but something can
>> be worked on in parallel (bonus point, this would also help SamplePGO
>> & debuginfo experience). With `-flto` the problem gets amplified
more,
>> as expected.
>> 
>> 5) I found a couple of issue when trying the support, but I'm
actively
>> working on them.
>> https://bugs.llvm.org/show_bug.cgi?id=33773
>> https://bugs.llvm.org/show_bug.cgi?id=33776
>> 
>> That said, I think optimization remarks support is coming along nicely.
>> 
>> 
>> Yes, I’ve been really happy with the progress.  Thanks for all the help
from everybody!
> 
> At some point, I guess we might just consider the HTML generated
> report as a fallback and having the opt-remarks more integrated in the
> developer's workflow.
> I personally use Visual studio daily to compile clang and it would be
> nice to have remarks there as a plugin. I can imagine something
> similar happening for XCode/CLion/Emacs etc..
Exactly.

Adam
> 
> Thanks,
> 
> --
> Davide
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170714/fb0f8a94/attachment.html>

Adam Nemet via llvm-dev

2017-Jul-14 17:52 UTC

head link

[llvm-dev] Next steps for optimization remarks?

> On Jul 14, 2017, at 10:22 AM, Davide Italiano <davide at freebsd.org>
wrote:
> 
> On Fri, Jul 14, 2017 at 10:10 AM, Adam Nemet <anemet at apple.com
<mailto:anemet at apple.com>> wrote:
>> 
>> 
>> On Jul 14, 2017, at 8:21 AM, Davide Italiano via llvm-dev <llvm-dev
at lists.llvm.org> wrote:
>> 
>> On Mon, Jun 19, 2017 at 4:13 PM, Brian Gesiak via llvm-dev
>> <llvm-dev at lists.llvm.org> wrote:
>> 
>> Hello all,
>> 
>> In https://www.youtube.com/watch?v=qq0q1hfzidg, Adam Nemet (cc'ed)
describes
>> optimization remarks and some future plans for the project. I had a few
>> follow-up questions:
>> 
>> 1. As an example of future work to be done, the talk mentions expanding
the
>> set of optimization passes that emit remarks. However, the Clang User
Manual
>> mentions that "optimization remarks do not really make sense
outside of the
>> major transformations (e.g.: inlining, vectorization, loop
optimizations)."
>> [1] I am wondering: which passes exist today that are most in need of
>> supporting optimization remarks? Should all passes emit optimization
>> remarks, or are there indeed passes for which optimization remarks
"do not
>> make sense"?
>> 
>> 2. I tried running llvm/utils/opt-viewer/opt-viewer.py to produce an
HTML
>> dashboard for the optimization remark YAML generated from a large C++
>> program. Unfortunately, the Python script does not finish, even after
over
>> an hour of processing. It appears performance has been brought up
before by
>> Bob Haarman (cc'ed), and some optimizations have been made since.
[2] I
>> wonder if I'm passing in bad input (6,000+ YAML files -- too
many?), or if
>> there's still some room to speed up the opt-viewer.py script? I
tried the
>> C++ implementation as well, but that never completed either. [3]
>> 
>> Overall I'm excited to make greater use of optimization remarks,
and to
>> contribute in any way I can. Please let me know if you have any
thoughts on
>> my questions above!
>> 
>> 
>> Hi,
>> I've been asked at $WORK to take a look at `-opt-remarks` , so here
>> are a couple of thoughts.
>> 
>> 1) When LTO is on, the output isn't particularly easy to read. I
guess
>> this can be mitigated with some filtering approach, I and Simon
>> discussed it offline.
>> 
>> 
>> Can you please elaborate?
>> 
> 
> The issue is twofold:
> 1) With LTO, the number of remarks generated skyrockets because whole
> module visibility makes IPO more effective (i.e. you end up inlining
> much more etc..). As a side effect, more aggressive inlining/IPCP
> expose more intraprocedural optimizations which in turn generates more
> remarks.
> 2) As pointed out earlier, DI is not always reliable.
> 
>> 
>> 
>> 2) Yes, indeed `opt-viewer` takes forever for large testcases to
>> process. I think that it could lead to exploring a better
>> representation than YAML which is, indeed, a little slow to parse. To
>> be honest, I'm torn about this.
>> YAML is definitely really convenient as we already use it somewhere in
>> tree, and it has an easy textual repr. OTOH, it doesn't seem to
scale
>> that nicely.
>> 
>> 
>> Agreed.  We now have a mitigation strategy with
-pass-remarks-hotness-threshold but this is something that we may have to solve
in the long run.
>> 
> 
> At some point, I guess we might just slowly moving away from
>> 
>> 
>> 3) There are lots of optimizations which are still missing from the
>> output, in particular PGO remarks (including, e.g. branch info
>> probabilities which still use the old API as far as I can tell
>> [PGOInstrumentation.cpp])
>> 
>> 
>> Yes, how about we file bugs for each pass that still uses the old API
(I am looking at ICP today) and then we can split up the work and then finally
remove the old API?
>> 
> 
> That sounds like a plan.
Filed https://bugs.llvm.org/show_bug.cgi?id=33789
<https://bugs.llvm.org/show_bug.cgi?id=33789> to remove the old API and
blockers for the 7 passes that need to be migrated.  Anybody wanting to help
with this, please feel free to grab any of the bugs.

Thanks!
Adam
> 
>> Also on exposing PGO info, I have a patch that adds a pass I call
HotnessDecorator.  The pass emits a remark for each basic block.  Then
opt-viewer is made aware of these and the remarks are special-cased to show
hotness for a line unless there is already a remark on the line.  The idea is
that since we only show hotness as part of the remark if a block does not
contain a remark we don’t see its hotness.  E.g.:
>> 
>> 
> 
> Yes, feel free to post for review once you have it ready.
>> 
>> 
>> 4) `opt-remarks` heavily relies on the fidelity of the DebugLoc
>> attached to instructions. Things get a little hairy at -O3 (or with
>> -flto) because there are optimizations bugs so transformations
don't
>> preserve debuginfo. This is not entirely orthogonal but something can
>> be worked on in parallel (bonus point, this would also help SamplePGO
>> & debuginfo experience). With `-flto` the problem gets amplified
more,
>> as expected.
>> 
>> 5) I found a couple of issue when trying the support, but I'm
actively
>> working on them.
>> https://bugs.llvm.org/show_bug.cgi?id=33773
>> https://bugs.llvm.org/show_bug.cgi?id=33776
>> 
>> That said, I think optimization remarks support is coming along nicely.
>> 
>> 
>> Yes, I’ve been really happy with the progress.  Thanks for all the help
from everybody!
> 
> At some point, I guess we might just consider the HTML generated
> report as a fallback and having the opt-remarks more integrated in the
> developer's workflow.
> I personally use Visual studio daily to compile clang and it would be
> nice to have remarks there as a plugin. I can imagine something
> similar happening for XCode/CLion/Emacs etc..
> 
> Thanks,
> 
> --
> Davide
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20170714/ba46ae50/attachment-0001.html>

llvm dev - Jul 2017 - Next steps for optimization remarks?

[llvm-dev] Next steps for optimization remarks?

[llvm-dev] Next steps for optimization remarks?

[llvm-dev] Next steps for optimization remarks?

[llvm-dev] Next steps for optimization remarks?