Vedant Kumar via llvm-dev
2018-Jun-20  00:46 UTC
[llvm-dev] [RFC] Removing debug locations from ConstantSDNodes
Some time ago I saw a report of strange single-stepping behavior given Swift
code that looks like this:
1|  var input = [16, 14, 10, 9, 8, 7, 4, 3, 2, 1] //< Only number 1 is
relevant.
2|  print("start")
TL;DR: The debugger steps from line 2 -> 1 -> 2 in this code example. One
clean way to fix this bug is to remove debug locations from ConstantSDNodes, and
I'm asking if there are any objections to doing that.
In more detail:
The problem appears to be that we assign a debug location to the ConstantSDNode
for "1". When this constant is used again (as it happens to be in the
lowered version of the call to print()) the debugger steps back to the first use
of the constant. Here's a snippet of MIR output that illustrates the
problem:
	%19:gr64 = ADD64ri8 %18, 8, implicit-def dead %eflags; GR64:%19,%18
dbg:destroy-after-foreach.swift:2:7
	%20:gr32 = MOV32ri64 1; GR32:%20 dbg:destroy-after-foreach.swift:1:44
	%21:gr64 = SUBREG_TO_REG 0, killed %20, sub_32bit; GR64:%21 GR32:%20
dbg:destroy-after-foreach.swift:1:44
	%rdi = COPY %21; GR64:%21 dbg:destroy-after-foreach.swift:2:7
The out-of-order stepping behavior is confusing and unexpected to users. ISTM
that the simplest way to fix this bug is to not attach debug locations to
constant SDNodes. This matches the model used by IR (Constants don't have
DebugLocs). As an added benefit, it would let us delete a fair amount of code
which traffics in SDLocs. I'm not sure what (if any) impact this would have
on sampling-based profilers. SamplePGO, at least, has smoothing methods
specifically designed to paper over this sort of "out-of-place" debug
location issue (see
https://storage.googleapis.com/pub-tools-public-publication-data/pdf/45290.pdf
<https://storage.googleapis.com/pub-tools-public-publication-data/pdf/45290.pdf>,
Section 4.1.4, "Sources of Profile Inaccuracies").
I brought all of this up a while ago on IRC: some feedback was positive, but I
didn't get a clear +1 or -1 to proceed.
Please let me know what you think, if there are cleaner alternatives to
consider, etc. Any comments/feedback would be appreciated.
Someone (Reid?) mentioned that we could try sinking constants to their point of
first use as an alternative, and (IIUC) create new nodes with distinct DebugLocs
for each use of a constant. I don't recall the details of that alternative
clearly. Based on my (likely incorrect) understanding of it, dropping locations
from constants outright might be simpler.
thanks,
vedant
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20180619/edf6fd5f/attachment.html>
Reid Kleckner via llvm-dev
2018-Jun-20  01:36 UTC
[llvm-dev] [RFC] Removing debug locations from ConstantSDNodes
On Tue, Jun 19, 2018 at 5:46 PM Vedant Kumar <vsk at apple.com> wrote:> Someone (Reid?) mentioned that we could try sinking constants to their > point of first use as an alternative, and (IIUC) create new nodes with > distinct DebugLocs for each use of a constant. I don't recall the details > of that alternative clearly. Based on my (likely incorrect) understanding > of it, dropping locations from constants outright might be simpler. >Our use case was in fastisel, so things are different. I don't think my solution will help you. In our case, users were complaining about code like this: volatile int do_something; void f() { ++do_something; foo(1, 2, 3); ++do_something; } We'd generate locations like: .loc line 1 incl do_something(%rip) movl $1, %ecx movl $2, %edx movl $3, %r8d .loc line 2 # line 2 starts here, instead of 3 instructions earlier callq foo .loc line 3 incl do_something(%rip) Our users really wanted the line table entry for line 2 to include the constant materialization code for some VS debugger feature. I think if you remove locations from ConstantSDNodes, you might want to add a late pass that propagates source locations backwards onto location-less instructions. This would also avoid some special cases when a basic block starts with an instruction that lacks location information. See CodeViewDebug::beginInstruction and DwarfDebug::beginInstruction for what we do today. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180619/460de41a/attachment.html>
via llvm-dev
2018-Jun-20  02:20 UTC
[llvm-dev] [RFC] Removing debug locations from ConstantSDNodes
Our users really wanted the line table entry for line 2 to include the constant
materialization code for some VS debugger feature.
Same here.  I have serious doubts that the fastisel tactic of bubbling constants
up to the top of the block really helps code size or performance that much, and
it distinctly hurts the debugging experience.  It's reordering code at –O0
and that is generally a Bad Thing™.
--paulr
From: Reid Kleckner [mailto:rnk at google.com]
Sent: Tuesday, June 19, 2018 9:37 PM
To: Vedant Kumar; Robinson, Paul
Cc: llvm-dev; Justin Bogner; David Li; David Blaikie
Subject: Re: [RFC] Removing debug locations from ConstantSDNodes
On Tue, Jun 19, 2018 at 5:46 PM Vedant Kumar <vsk at apple.com<mailto:vsk
at apple.com>> wrote:
Someone (Reid?) mentioned that we could try sinking constants to their point of
first use as an alternative, and (IIUC) create new nodes with distinct DebugLocs
for each use of a constant. I don't recall the details of that alternative
clearly. Based on my (likely incorrect) understanding of it, dropping locations
from constants outright might be simpler.
Our use case was in fastisel, so things are different. I don't think my
solution will help you.
In our case, users were complaining about code like this:
volatile int do_something;
void f() {
  ++do_something;
  foo(1, 2, 3);
  ++do_something;
}
We'd generate locations like:
  .loc line 1
  incl do_something(%rip)
  movl $1, %ecx
  movl $2, %edx
  movl $3, %r8d
  .loc line 2 # line 2 starts here, instead of 3 instructions earlier
  callq foo
  .loc line 3
  incl do_something(%rip)
Our users really wanted the line table entry for line 2 to include the constant
materialization code for some VS debugger feature.
I think if you remove locations from ConstantSDNodes, you might want to add a
late pass that propagates source locations backwards onto location-less
instructions. This would also avoid some special cases when a basic block starts
with an instruction that lacks location information. See
CodeViewDebug::beginInstruction and DwarfDebug::beginInstruction for what we do
today.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20180620/85e0dcb0/attachment-0001.html>
Xinliang David Li via llvm-dev
2018-Jun-20  03:10 UTC
[llvm-dev] [RFC] Removing debug locations from ConstantSDNodes
+Dehao Chen <dehao at google.com> to comment on the potential impact. In general, I prefer the debug info is kept when sample PGO is on. David On Tue, Jun 19, 2018 at 5:46 PM Vedant Kumar <vsk at apple.com> wrote:> Some time ago I saw a report of strange single-stepping behavior given > Swift code that looks like this: > > 1| var input = [16, 14, 10, 9, 8, 7, 4, 3, 2, 1] //< Only number 1 is > relevant. > 2| print("start") > > TL;DR: The debugger steps from line 2 -> 1 -> 2 in this code example. One > clean way to fix this bug is to remove debug locations from > ConstantSDNodes, and I'm asking if there are any objections to doing that. > > In more detail: > > The problem appears to be that we assign a debug location to the > ConstantSDNode for "1". When this constant is used again (as it happens to > be in the lowered version of the call to print()) the debugger steps back > to the first use of the constant. Here's a snippet of MIR output that > illustrates the problem: > > %19:gr64 = ADD64ri8 %18, 8, implicit-def dead %eflags; GR64:%19,%18 > dbg:destroy-after-foreach.swift:2:7 > %20:gr32 = MOV32ri64 1; GR32:%20 dbg:destroy-after-foreach.swift:1:44 > %21:gr64 = SUBREG_TO_REG 0, killed %20, sub_32bit; GR64:%21 GR32:%20 > dbg:destroy-after-foreach.swift:1:44 > %rdi = COPY %21; GR64:%21 dbg:destroy-after-foreach.swift:2:7 > > The out-of-order stepping behavior is confusing and unexpected to users. > ISTM that the simplest way to fix this bug is to not attach debug locations > to constant SDNodes. This matches the model used by IR (Constants don't > have DebugLocs). As an added benefit, it would let us delete a fair amount > of code which traffics in SDLocs. I'm not sure what (if any) impact this > would have on sampling-based profilers. SamplePGO, at least, has smoothing > methods specifically designed to paper over this sort of "out-of-place" > debug location issue (see > https://storage.googleapis.com/pub-tools-public-publication-data/pdf/45290.pdf, > Section 4.1.4, "Sources of Profile Inaccuracies"). > > I brought all of this up a while ago on IRC: some feedback was positive, > but I didn't get a clear +1 or -1 to proceed. > > Please let me know what you think, if there are cleaner alternatives to > consider, etc. Any comments/feedback would be appreciated. > > Someone (Reid?) mentioned that we could try sinking constants to their > point of first use as an alternative, and (IIUC) create new nodes with > distinct DebugLocs for each use of a constant. I don't recall the details > of that alternative clearly. Based on my (likely incorrect) understanding > of it, dropping locations from constants outright might be simpler. > > thanks, > vedant >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180619/a3131a07/attachment.html>
Vedant Kumar via llvm-dev
2018-Jun-20  17:47 UTC
[llvm-dev] [RFC] Removing debug locations from ConstantSDNodes
> On Jun 19, 2018, at 6:36 PM, Reid Kleckner <rnk at google.com> wrote: > > On Tue, Jun 19, 2018 at 5:46 PM Vedant Kumar <vsk at apple.com <mailto:vsk at apple.com>> wrote: > Someone (Reid?) mentioned that we could try sinking constants to their point of first use as an alternative, and (IIUC) create new nodes with distinct DebugLocs for each use of a constant. I don't recall the details of that alternative clearly. Based on my (likely incorrect) understanding of it, dropping locations from constants outright might be simpler. > > Our use case was in fastisel, so things are different. I don't think my solution will help you. > > In our case, users were complaining about code like this: > volatile int do_something; > void f() { > ++do_something; > foo(1, 2, 3); > ++do_something; > } > > We'd generate locations like: > .loc line 1 > incl do_something(%rip) > movl $1, %ecx > movl $2, %edx > movl $3, %r8d > .loc line 2 # line 2 starts here, instead of 3 instructions earlier > callq foo > .loc line 3 > incl do_something(%rip) > > Our users really wanted the line table entry for line 2 to include the constant materialization code for some VS debugger feature.Got it. (For others following along, there's more discussion about this debugger feature in the commit description for r327581.)> I think if you remove locations from ConstantSDNodes, you might want to add a late pass that propagates source locations backwards onto location-less instructions. This would also avoid some special cases when a basic block starts with an instruction that lacks location information. See CodeViewDebug::beginInstruction and DwarfDebug::beginInstruction for what we do today.Interesting, I hadn't considered doing this. What happens in the special case where a block starts with a location-less instruction? Ah, I see that CodeViewDebug::beginInstruction does a forward scan to find the first instruction with location. This can fail though: there might not be such an instruction, in which case... I assume we either apply a line-0 location, or nothing at all, similar to Dwarf::beginInstruction? It's a bit unclear to me what the benefits of a late pass that backwards-propagated locations would be. I suppose we'd have fewer forward scans in CodeViewDebug::beginInstruction, so is the benefit a compile-time win (which would offset the cost of a late pass)? Would the final debug info quality would be better in some way I'm just missing? thanks! vedant -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180620/b1d5f899/attachment.html>