Timur Iskhodzhanov
2013-Nov-19 18:42 UTC
[LLVMdev] Adding line table debug information to LLVM on Windows
Attached is a slightly updated patch. (it doesn't include D2222 yet). 2013/11/19 Eric Christopher <echristo at gmail.com>:> In general I do think we're going to need to abstract it out as much > as possible. I'm not sure what the previous patch looks like, but > abstracting the interface out would be general goodness for this. We > can talk about designs for that as we move on.How about http://llvm-reviews.chandlerc.com/D2222 ? // Ha! A lucky number!> As far as how to > migrate the decision down we can have it both as an option to code gen > maybe or, for now, make it dependent upon triple. The former is, I > think, the best option there.You mean LLVM CodeGen or Clang CodeGen? Can you suggest a few places in the code where I can find the clues for that? I'm not yet familiar with this part of the project...> I'm not sure if you're going to want to support both debug info at the > same time, but it's a readonly format at debug emission so I don't see > it as being a problem.Well, if two debug formats share the same section name, they might conflict with each other. I don't think this is the case for Dwarf&CodeView [yet?].> -Zmlt makes sense as the clang-cl name, or just > make it whatever the debug mode flag is for cl.exe - this is at least > a start down that path.2013/11/19 Reid Kleckner <rnk at google.com>:> I'd just use /Z7, since that's the cl.exe flag for the compatible format. > This will need a long-form clang -cc1 flag name, though.Z7 implies a much more complete debug info than what we'll emit short-term. Since Z7 is not widely used, I don't think threading Z7 is any simpler than Zmlt. I don't have a strong opinion though, WDYT?> I think the best way for the tests will be to write a set of parsers, > etc that can dump the info. I realize this is likely a very large > undertaking as well, but it seems like the only way to ensure we're > getting some decent testing out.As you can see from my patch, I actually succeeded in writing *some* tests without a dumper - just by using the MCAsmStreamer. Do you think we should really write a dumper too? That's kinda hard and we don't plan to fully support the CodeView format yet...> Any other questions I missed?Please see the TODOs in the attached patch. You are very likely to come up with a better design/ideas given I'm new to this part of the codebase. One particular question I'd like to emphasize is getting a full filepath for a given MDNode. As far as I can tell, the metadata for scopes holds pairs of <filename, directory>, which reflects how DWARF stores them. However, CodeView stores full paths as entire strings (I admit that ain't efficient). Currently, I concat the directory and filename together, but it a) requires some extra memory management b) requires special tricks to handle filenames starting from "./", "../", etc. c) the slashes in the directory name and filename are not consistent on Windows and is ugly in general. Do you think it's appropriate to change the scope metadata format to store <filename, directory, fullpath> instead? That'd require changing Clang, right?> -eric > > > On Mon, Nov 18, 2013 at 9:14 AM, Timur Iskhodzhanov <timurrrr at google.com> wrote: >> I wrote some more lit tests for my patch and realized I was generating >> some redundant info. This is fixed now. Attached is a new version >> of the prototype patch with some more tests. >> >> 2013/11/15 João Matos <ripzonetriton at gmail.com>: >>> Hi Timur, >>> >>> There's also a pending patch adding CodeView support in Phab: >>> http://llvm-reviews.chandlerc.com/D165 >> >> I haven't looked at your patch yet, but based on the very low phab >> review number, I'm pretty sure there are good reasons this wasn't >> committed. >> >>> Does your patch provide just a subset of the CodeView debug info provided in >>> the other patch? >> >> Yes. I prefer small incremental changes. >> Also, the file:line debug info is much much more important for me atm >> than the other types of debug info. >> >>> Looking at the patch, I think the approach the other patch took of >>> abstracting the emission of debug information is a bit cleaner and it will >>> probably make life easier when adding more debug formats in the future. >> >> Why wasn't the "abstract the emission of DI" part of that patch >> reviewed/committed separately? >> >>> On Fri, Nov 15, 2013 at 4:39 PM, Timur Iskhodzhanov <timurrrr at google.com> >>> wrote: >>>> >>>> 2013/11/14 Timur Iskhodzhanov <timurrrr at google.com>: >>>> > Hi David, Eric, LLVM devs, >>>> > >>>> > You've probably heard about AddressSanitizer (ASan) and other >>>> > sanitizers based on LLVM. One of the things that makes ASan >>>> > not as awesome on Windows as it is on Linux >>>> > is the symbolization of the stacks. >>>> > >>>> > Currently, ASan runtime on Windows uses >>>> > CaptureStackBackTrace/SymFromAddr/SymGetLineFromAddr64 >>>> > to unwind and symbolize stacks. This works like a charm >>>> > in-process for stack frames built with CL, but yields >>>> > "function+0x0ff537" for frames built with Clang. >>>> > >>>> > I came up with a prototype which emits "old-style debug info" COFF >>>> > sections that are sufficient to get function name / filename / >>>> > linenumber information. That's pretty much everything that's required >>>> > for ASan to work beautifully in terms of the completeness of error >>>> > reports. >>>> > >>>> > Attached is a prototype patch which I've tried on some simple tests, >>>> > including some more complex ones with weird #line constructions. >>>> > It also works just great on ASan/Win tests without any link/run-time >>>> > warnings (I had a bunch of those during development, so I can tell it >>>> > works rather than fails silently). >>>> > >>>> > I didn't have time to work on threading the command-line flags into >>>> > the AsmPrinter yet, so currently it just replaces DWARF entirely. >>>> > Of course, this should be fixed before this lands into trunk. >>>> > Currently, one can try this patch by using "clang-cl -Xclang -g ...". >>>> > Eventually we should have some dedicated flag for clang-cl. >>>> > >>>> > Can you please take a look at the patch and suggest a good path forward? >>>> > >>>> > I'm very unfamiliar with LLVM CodeGen/MC, so I'm pretty sure I made a >>>> > few weird things in the code... I've also put a few TODOs with >>>> > questions and suggestions. >>>> > >>>> > Some general questions: >>>> > 1) Threading flags from the driver down to CodeGen. >>>> > How do we do that? Should we support all 4 combinations >>>> > of no-info/DWARF/CVLT/both? >>>> > How about "-Zmlt" as the clang-cl flag name? ("minimal line tables") >>>> > >>>> > 2) Am I right that DWARF is pretty much the only debug info format >>>> > supported by LLVM/AsmPrinter right now? Do we want to take >>>> > an effort to come up with a generic debuginfogenerator interface >>>> > to share between DwarfDebug and WinCodeViewLineTables? >>>> > Then AsmPrinter should just hold a SmallVector<DebugInfoEmitter*> >>>> > rather than a pair of DD/DE pointers. >>>> > >>>> > 3) How would you suggest to write WinCodeViewLineTables tests >>>> > given that dumpbin is not available everywhere except Windows? >>>> > // Yeah, I should have looked at the DwarfDebug LIT tests and >>>> > // written some; but the prototype development went faster >>>> > // than I expected... >>>> >>>> I found the MCAsmStreamer being used by llc which gives a decent text >>>> format. >>>> I wrote a simple x86+x86_64 .ll test and FileCheck expectations for >>>> the llc asm output. >>>> Is this the right approach to write tests? >>>> If so, I'll convert my remaining C program test cases into such an >>>> .ll+llc tests. >>>> >>>> > Can you suggest ways to split this patch so it's easier >>>> > to review part-by-part before this hits trunk? >>>> >>>> Attached is an updated patch with a new test and a few minor things >>>> improved. >>>> I also removed the "TODO: test on X64" as I did try it on x64 and no >>>> changes were required. >>>> >>>> Looking forward to your feedback! >>>> >>>> > Thanks! >>>> > -- >>>> > Timur Iskhodzhanov, >>>> > Google >>>> >>>> _______________________________________________ >>>> LLVM Developers mailing list >>>> LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu >>>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev >>>> >>> >>> >>> >>> -- >>> João Matos-------------- next part -------------- A non-text attachment was scrubbed... Name: codeview_linetables.patch Type: application/octet-stream Size: 60019 bytes Desc: not available URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20131119/a0543600/attachment.obj>
Reid Kleckner
2013-Nov-19 23:01 UTC
[LLVMdev] Adding line table debug information to LLVM on Windows
On Tue, Nov 19, 2013 at 10:42 AM, Timur Iskhodzhanov <timurrrr at google.com>wrote:> > 2013/11/19 Reid Kleckner <rnk at google.com>: > > I'd just use /Z7, since that's the cl.exe flag for the compatible format. > > This will need a long-form clang -cc1 flag name, though. > > Z7 implies a much more complete debug info than what we'll emit short-term. > Since Z7 is not widely used, I don't think threading Z7 is any simpler > than Zmlt. > I don't have a strong opinion though, WDYT?My thinking is that it works well with /fallback. We'll make Z7-style debug info, and for TUs compiled with cl.exe, we'll get the same style info. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20131119/ef1844b6/attachment.html>
Timur Iskhodzhanov
2013-Nov-20 04:44 UTC
[LLVMdev] Adding line table debug information to LLVM on Windows
Ah, seamlessly integrating with /fallback is a good point. I'm not sure we really want Z7 rather than Zi when we fall back, but let's start with Z7 and see how it goes. 20 нояб. 2013 г. 3:01 пользователь "Reid Kleckner" <rnk at google.com> написал:> On Tue, Nov 19, 2013 at 10:42 AM, Timur Iskhodzhanov <timurrrr at google.com>wrote: >> >> 2013/11/19 Reid Kleckner <rnk at google.com>: >> > I'd just use /Z7, since that's the cl.exe flag for the compatible >> format. >> > This will need a long-form clang -cc1 flag name, though. >> >> Z7 implies a much more complete debug info than what we'll emit >> short-term. >> Since Z7 is not widely used, I don't think threading Z7 is any simpler >> than Zmlt. >> I don't have a strong opinion though, WDYT? > > > My thinking is that it works well with /fallback. We'll make Z7-style > debug info, and for TUs compiled with cl.exe, we'll get the same style info. > >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20131120/c777a64d/attachment.html>
Timur Iskhodzhanov
2013-Nov-20 17:46 UTC
[LLVMdev] Adding line table debug information to LLVM on Windows
Eric, David, 2013/11/19 Timur Iskhodzhanov <timurrrr at google.com>:> Attached is a slightly updated patch. > (it doesn't include D2222 yet).The new version of the patch stopped fitting into the llvmdev 100K limit, so I've uploaded it to http://llvm-reviews.chandlerc.com/D2232 (you need to apply D2222 first if you'd like to give D2232 a try)> 2013/11/19 Eric Christopher <echristo at gmail.com>: >> In general I do think we're going to need to abstract it out as much >> as possible. I'm not sure what the previous patch looks like, but >> abstracting the interface out would be general goodness for this. We >> can talk about designs for that as we move on. > > How about http://llvm-reviews.chandlerc.com/D2222 ? > // Ha! A lucky number! > >> As far as how to >> migrate the decision down we can have it both as an option to code gen >> maybe or, for now, make it dependent upon triple. The former is, I >> think, the best option there. > > You mean LLVM CodeGen or Clang CodeGen?On the second thought, "llc" seems to choose ELF vs COFF by using a triple. I think we should make the DWARF vs CodeView choice in sync with the object file format choice, at least to begin with. WDYT?> Can you suggest a few places in the code where I can find the clues for that? > I'm not yet familiar with this part of the project... > >> I'm not sure if you're going to want to support both debug info at the >> same time, but it's a readonly format at debug emission so I don't see >> it as being a problem. > > Well, if two debug formats share the same section name, they might > conflict with each other. > I don't think this is the case for Dwarf&CodeView [yet?]. > >> -Zmlt makes sense as the clang-cl name, or just >> make it whatever the debug mode flag is for cl.exe - this is at least >> a start down that path. > > 2013/11/19 Reid Kleckner <rnk at google.com>: >> I'd just use /Z7, since that's the cl.exe flag for the compatible format. >> This will need a long-form clang -cc1 flag name, though. > > Z7 implies a much more complete debug info than what we'll emit short-term. > Since Z7 is not widely used, I don't think threading Z7 is any simpler > than Zmlt. > I don't have a strong opinion though, WDYT?[I agreed to use Z7 on this thread]>> I think the best way for the tests will be to write a set of parsers, >> etc that can dump the info. I realize this is likely a very large >> undertaking as well, but it seems like the only way to ensure we're >> getting some decent testing out. > > As you can see from my patch, I actually succeeded in writing *some* > tests without a dumper - just by using the MCAsmStreamer. > Do you think we should really write a dumper too? > That's kinda hard and we don't plan to fully support the CodeView format yet...I tried my patch on Chromium and it hit the llvm_unreachable I wrote in WinCOFFStreamer::EmitCOFFStaticOffset(). Now that it also supports using a fixup to calculate the offset (that happens as the second pass, not supported by MCAsmStreamer, right?), I think I do have to write at least a simple dumper... Any hints on how to do that? Should it be a separate app or built into some COFF reader readily available? Should it have its own tests, i.e. binary COFF files added to the repo? Good news - other than that, the emitter seems to work fine on some medium-sized Chromium tests and generate symbolized ASan reports if I manually introduce an error in the binary.>> Any other questions I missed? > > Please see the TODOs in the attached patch. > You are very likely to come up with a better design/ideas given I'm > new to this part of the codebase. > > One particular question I'd like to emphasize is getting a full > filepath for a given MDNode. > As far as I can tell, the metadata for scopes holds pairs of > <filename, directory>, which reflects how DWARF stores them. > However, CodeView stores full paths as entire strings (I admit that > ain't efficient). > Currently, I concat the directory and filename together, but it > a) requires some extra memory management > b) requires special tricks to handle filenames starting from "./", "../", etc. > c) the slashes in the directory name and filename are not consistent on Windows > and is ugly in general. > > Do you think it's appropriate to change the scope metadata format to > store <filename, directory, fullpath> instead? > That'd require changing Clang, right?ping>> -eric >> >> >> On Mon, Nov 18, 2013 at 9:14 AM, Timur Iskhodzhanov <timurrrr at google.com> wrote: >>> I wrote some more lit tests for my patch and realized I was generating >>> some redundant info. This is fixed now. Attached is a new version >>> of the prototype patch with some more tests. >>> >>> 2013/11/15 João Matos <ripzonetriton at gmail.com>: >>>> Hi Timur, >>>> >>>> There's also a pending patch adding CodeView support in Phab: >>>> http://llvm-reviews.chandlerc.com/D165 >>> >>> I haven't looked at your patch yet, but based on the very low phab >>> review number, I'm pretty sure there are good reasons this wasn't >>> committed. >>> >>>> Does your patch provide just a subset of the CodeView debug info provided in >>>> the other patch? >>> >>> Yes. I prefer small incremental changes. >>> Also, the file:line debug info is much much more important for me atm >>> than the other types of debug info. >>> >>>> Looking at the patch, I think the approach the other patch took of >>>> abstracting the emission of debug information is a bit cleaner and it will >>>> probably make life easier when adding more debug formats in the future. >>> >>> Why wasn't the "abstract the emission of DI" part of that patch >>> reviewed/committed separately? >>> >>>> On Fri, Nov 15, 2013 at 4:39 PM, Timur Iskhodzhanov <timurrrr at google.com> >>>> wrote: >>>>> >>>>> 2013/11/14 Timur Iskhodzhanov <timurrrr at google.com>: >>>>> > Hi David, Eric, LLVM devs, >>>>> > >>>>> > You've probably heard about AddressSanitizer (ASan) and other >>>>> > sanitizers based on LLVM. One of the things that makes ASan >>>>> > not as awesome on Windows as it is on Linux >>>>> > is the symbolization of the stacks. >>>>> > >>>>> > Currently, ASan runtime on Windows uses >>>>> > CaptureStackBackTrace/SymFromAddr/SymGetLineFromAddr64 >>>>> > to unwind and symbolize stacks. This works like a charm >>>>> > in-process for stack frames built with CL, but yields >>>>> > "function+0x0ff537" for frames built with Clang. >>>>> > >>>>> > I came up with a prototype which emits "old-style debug info" COFF >>>>> > sections that are sufficient to get function name / filename / >>>>> > linenumber information. That's pretty much everything that's required >>>>> > for ASan to work beautifully in terms of the completeness of error >>>>> > reports. >>>>> > >>>>> > Attached is a prototype patch which I've tried on some simple tests, >>>>> > including some more complex ones with weird #line constructions. >>>>> > It also works just great on ASan/Win tests without any link/run-time >>>>> > warnings (I had a bunch of those during development, so I can tell it >>>>> > works rather than fails silently). >>>>> > >>>>> > I didn't have time to work on threading the command-line flags into >>>>> > the AsmPrinter yet, so currently it just replaces DWARF entirely. >>>>> > Of course, this should be fixed before this lands into trunk. >>>>> > Currently, one can try this patch by using "clang-cl -Xclang -g ...". >>>>> > Eventually we should have some dedicated flag for clang-cl. >>>>> > >>>>> > Can you please take a look at the patch and suggest a good path forward? >>>>> > >>>>> > I'm very unfamiliar with LLVM CodeGen/MC, so I'm pretty sure I made a >>>>> > few weird things in the code... I've also put a few TODOs with >>>>> > questions and suggestions. >>>>> > >>>>> > Some general questions: >>>>> > 1) Threading flags from the driver down to CodeGen. >>>>> > How do we do that? Should we support all 4 combinations >>>>> > of no-info/DWARF/CVLT/both? >>>>> > How about "-Zmlt" as the clang-cl flag name? ("minimal line tables") >>>>> > >>>>> > 2) Am I right that DWARF is pretty much the only debug info format >>>>> > supported by LLVM/AsmPrinter right now? Do we want to take >>>>> > an effort to come up with a generic debuginfogenerator interface >>>>> > to share between DwarfDebug and WinCodeViewLineTables? >>>>> > Then AsmPrinter should just hold a SmallVector<DebugInfoEmitter*> >>>>> > rather than a pair of DD/DE pointers. >>>>> > >>>>> > 3) How would you suggest to write WinCodeViewLineTables tests >>>>> > given that dumpbin is not available everywhere except Windows? >>>>> > // Yeah, I should have looked at the DwarfDebug LIT tests and >>>>> > // written some; but the prototype development went faster >>>>> > // than I expected... >>>>> >>>>> I found the MCAsmStreamer being used by llc which gives a decent text >>>>> format. >>>>> I wrote a simple x86+x86_64 .ll test and FileCheck expectations for >>>>> the llc asm output. >>>>> Is this the right approach to write tests? >>>>> If so, I'll convert my remaining C program test cases into such an >>>>> .ll+llc tests. >>>>> >>>>> > Can you suggest ways to split this patch so it's easier >>>>> > to review part-by-part before this hits trunk? >>>>> >>>>> Attached is an updated patch with a new test and a few minor things >>>>> improved. >>>>> I also removed the "TODO: test on X64" as I did try it on x64 and no >>>>> changes were required. >>>>> >>>>> Looking forward to your feedback! >>>>> >>>>> > Thanks! >>>>> > -- >>>>> > Timur Iskhodzhanov, >>>>> > Google >>>>> >>>>> _______________________________________________ >>>>> LLVM Developers mailing list >>>>> LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu >>>>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev >>>>> >>>> >>>> >>>> >>>> -- >>>> João Matos
Reid Kleckner
2013-Nov-20 18:01 UTC
[LLVMdev] Adding line table debug information to LLVM on Windows
On Wed, Nov 20, 2013 at 9:46 AM, Timur Iskhodzhanov <timurrrr at google.com>wrote:> Eric, David, > > 2013/11/19 Timur Iskhodzhanov <timurrrr at google.com>: > > Attached is a slightly updated patch. > > (it doesn't include D2222 yet). > > The new version of the patch stopped fitting into the llvmdev 100K limit, > so I've uploaded it to http://llvm-reviews.chandlerc.com/D2232 > (you need to apply D2222 first if you'd like to give D2232 a try) > > > 2013/11/19 Eric Christopher <echristo at gmail.com>: > >> In general I do think we're going to need to abstract it out as much > >> as possible. I'm not sure what the previous patch looks like, but > >> abstracting the interface out would be general goodness for this. We > >> can talk about designs for that as we move on. > > > > How about http://llvm-reviews.chandlerc.com/D2222 ? > > // Ha! A lucky number! > > > >> As far as how to > >> migrate the decision down we can have it both as an option to code gen > >> maybe or, for now, make it dependent upon triple. The former is, I > >> think, the best option there. > > > > You mean LLVM CodeGen or Clang CodeGen? > > On the second thought, "llc" seems to choose ELF vs COFF by using a triple. > I think we should make the DWARF vs CodeView choice in sync with the > object file format choice, at least to begin with. > WDYT?I think we can use the triple OS to choose a sensible default here (win32 -> CodeView, mingw -> DWARF), but eventually some users might want to force DWARF on win32 with a flag because it is more complete.> > Can you suggest a few places in the code where I can find the clues for > that? > > I'm not yet familiar with this part of the project... > > > >> I'm not sure if you're going to want to support both debug info at the > >> same time, but it's a readonly format at debug emission so I don't see > >> it as being a problem. > > > > Well, if two debug formats share the same section name, they might > > conflict with each other. > > I don't think this is the case for Dwarf&CodeView [yet?]. > > > >> -Zmlt makes sense as the clang-cl name, or just > >> make it whatever the debug mode flag is for cl.exe - this is at least > >> a start down that path. > > > > 2013/11/19 Reid Kleckner <rnk at google.com>: > >> I'd just use /Z7, since that's the cl.exe flag for the compatible > format. > >> This will need a long-form clang -cc1 flag name, though. > > > > Z7 implies a much more complete debug info than what we'll emit > short-term. > > Since Z7 is not widely used, I don't think threading Z7 is any simpler > > than Zmlt. > > I don't have a strong opinion though, WDYT? > > [I agreed to use Z7 on this thread] > > >> I think the best way for the tests will be to write a set of parsers, > >> etc that can dump the info. I realize this is likely a very large > >> undertaking as well, but it seems like the only way to ensure we're > >> getting some decent testing out. > > > > As you can see from my patch, I actually succeeded in writing *some* > > tests without a dumper - just by using the MCAsmStreamer. > > Do you think we should really write a dumper too? > > That's kinda hard and we don't plan to fully support the CodeView format > yet... > > I tried my patch on Chromium and it hit the llvm_unreachable I wrote > in WinCOFFStreamer::EmitCOFFStaticOffset(). > Now that it also supports using a fixup to calculate the offset (that > happens as the second pass, not supported by MCAsmStreamer, right?), I > think I do have to write at least a simple dumper... > > Any hints on how to do that? Should it be a separate app or built into > some COFF reader readily available? Should it have its own tests, i.e. > binary COFF files added to the repo? > > Good news - other than that, the emitter seems to work fine on some > medium-sized Chromium tests and generate symbolized ASan reports if I > manually introduce an error in the binary. > > >> Any other questions I missed? > > > > Please see the TODOs in the attached patch. > > You are very likely to come up with a better design/ideas given I'm > > new to this part of the codebase. > > > > One particular question I'd like to emphasize is getting a full > > filepath for a given MDNode. > > As far as I can tell, the metadata for scopes holds pairs of > > <filename, directory>, which reflects how DWARF stores them. > > However, CodeView stores full paths as entire strings (I admit that > > ain't efficient). > > Currently, I concat the directory and filename together, but it > > a) requires some extra memory management > > b) requires special tricks to handle filenames starting from "./", > "../", etc. > > c) the slashes in the directory name and filename are not consistent on > Windows > > and is ugly in general. > > > > Do you think it's appropriate to change the scope metadata format to > > store <filename, directory, fullpath> instead? > > That'd require changing Clang, right? > > pingI think the problems you mention can be resolved without changing the DI metadata format. Debug info metadata already consumes too much memory. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20131120/dc175a28/attachment.html>
Anton Korobeynikov
2013-Nov-20 18:14 UTC
[LLVMdev] Adding line table debug information to LLVM on Windows
> On the second thought, "llc" seems to choose ELF vs COFF by using a triple. > I think we should make the DWARF vs CodeView choice in sync with the > object file format choice, at least to begin with. > WDYT?On Windows it's perfectly fine to have DWARF encoded into COFF objects. E.g. in case of mingw32/-w64 triple. -- With best regards, Anton Korobeynikov Faculty of Mathematics and Mechanics, Saint Petersburg State University
Eric Christopher
2013-Nov-20 21:02 UTC
[LLVMdev] Adding line table debug information to LLVM on Windows
On Wed, Nov 20, 2013 at 9:46 AM, Timur Iskhodzhanov <timurrrr at google.com> wrote:> Eric, David, > > 2013/11/19 Timur Iskhodzhanov <timurrrr at google.com>: >> Attached is a slightly updated patch. >> (it doesn't include D2222 yet). > > The new version of the patch stopped fitting into the llvmdev 100K limit, > so I've uploaded it to http://llvm-reviews.chandlerc.com/D2232 > (you need to apply D2222 first if you'd like to give D2232 a try) >That's a big patch. :)>> 2013/11/19 Eric Christopher <echristo at gmail.com>: >>> In general I do think we're going to need to abstract it out as much >>> as possible. I'm not sure what the previous patch looks like, but >>> abstracting the interface out would be general goodness for this. We >>> can talk about designs for that as we move on. >> >> How about http://llvm-reviews.chandlerc.com/D2222 ? >> // Ha! A lucky number! >> >>> As far as how to >>> migrate the decision down we can have it both as an option to code gen >>> maybe or, for now, make it dependent upon triple. The former is, I >>> think, the best option there. >> >> You mean LLVM CodeGen or Clang CodeGen? >LLVM... and we really need to change that directory in Clang. :)> On the second thought, "llc" seems to choose ELF vs COFF by using a triple. > I think we should make the DWARF vs CodeView choice in sync with the > object file format choice, at least to begin with. > WDYT? >Yep. Sounds good to me. I'd suggest triple, that way you can base it on "windows" versus "coff".>> Can you suggest a few places in the code where I can find the clues for that? >> I'm not yet familiar with this part of the project... >>lib/CodeGen/AsmPrinter/DwarfDebug.cpp has some checks for triple/os.> > I tried my patch on Chromium and it hit the llvm_unreachable I wrote > in WinCOFFStreamer::EmitCOFFStaticOffset(). > Now that it also supports using a fixup to calculate the offset (that > happens as the second pass, not supported by MCAsmStreamer, right?), I > think I do have to write at least a simple dumper... > > Any hints on how to do that? Should it be a separate app or built into > some COFF reader readily available? Should it have its own tests, i.e. > binary COFF files added to the repo? >Probably the same structure that we've been going down via lib/DebugInfo/. A set of files than handle reading and parsing and both some binary files and some files produced by the backend.> Good news - other than that, the emitter seems to work fine on some > medium-sized Chromium tests and generate symbolized ASan reports if I > manually introduce an error in the binary. >Awesome.>>> Any other questions I missed? >> >> Please see the TODOs in the attached patch. >> You are very likely to come up with a better design/ideas given I'm >> new to this part of the codebase. >>Will do.>> One particular question I'd like to emphasize is getting a full >> filepath for a given MDNode. >> As far as I can tell, the metadata for scopes holds pairs of >> <filename, directory>, which reflects how DWARF stores them. >> However, CodeView stores full paths as entire strings (I admit that >> ain't efficient). >> Currently, I concat the directory and filename together, but it >> a) requires some extra memory management >> b) requires special tricks to handle filenames starting from "./", "../", etc. >> c) the slashes in the directory name and filename are not consistent on Windows >> and is ugly in general. >> >> Do you think it's appropriate to change the scope metadata format to >> store <filename, directory, fullpath> instead? >> That'd require changing Clang, right? >It would require changing clang. Since this is a single string per file I'm not against adding the full path to the source file in the metadata along side the basename/compilation dir pair. -eric> ping > >>> -eric >>> >>> >>> On Mon, Nov 18, 2013 at 9:14 AM, Timur Iskhodzhanov <timurrrr at google.com> wrote: >>>> I wrote some more lit tests for my patch and realized I was generating >>>> some redundant info. This is fixed now. Attached is a new version >>>> of the prototype patch with some more tests. >>>> >>>> 2013/11/15 João Matos <ripzonetriton at gmail.com>: >>>>> Hi Timur, >>>>> >>>>> There's also a pending patch adding CodeView support in Phab: >>>>> http://llvm-reviews.chandlerc.com/D165 >>>> >>>> I haven't looked at your patch yet, but based on the very low phab >>>> review number, I'm pretty sure there are good reasons this wasn't >>>> committed. >>>> >>>>> Does your patch provide just a subset of the CodeView debug info provided in >>>>> the other patch? >>>> >>>> Yes. I prefer small incremental changes. >>>> Also, the file:line debug info is much much more important for me atm >>>> than the other types of debug info. >>>> >>>>> Looking at the patch, I think the approach the other patch took of >>>>> abstracting the emission of debug information is a bit cleaner and it will >>>>> probably make life easier when adding more debug formats in the future. >>>> >>>> Why wasn't the "abstract the emission of DI" part of that patch >>>> reviewed/committed separately? >>>> >>>>> On Fri, Nov 15, 2013 at 4:39 PM, Timur Iskhodzhanov <timurrrr at google.com> >>>>> wrote: >>>>>> >>>>>> 2013/11/14 Timur Iskhodzhanov <timurrrr at google.com>: >>>>>> > Hi David, Eric, LLVM devs, >>>>>> > >>>>>> > You've probably heard about AddressSanitizer (ASan) and other >>>>>> > sanitizers based on LLVM. One of the things that makes ASan >>>>>> > not as awesome on Windows as it is on Linux >>>>>> > is the symbolization of the stacks. >>>>>> > >>>>>> > Currently, ASan runtime on Windows uses >>>>>> > CaptureStackBackTrace/SymFromAddr/SymGetLineFromAddr64 >>>>>> > to unwind and symbolize stacks. This works like a charm >>>>>> > in-process for stack frames built with CL, but yields >>>>>> > "function+0x0ff537" for frames built with Clang. >>>>>> > >>>>>> > I came up with a prototype which emits "old-style debug info" COFF >>>>>> > sections that are sufficient to get function name / filename / >>>>>> > linenumber information. That's pretty much everything that's required >>>>>> > for ASan to work beautifully in terms of the completeness of error >>>>>> > reports. >>>>>> > >>>>>> > Attached is a prototype patch which I've tried on some simple tests, >>>>>> > including some more complex ones with weird #line constructions. >>>>>> > It also works just great on ASan/Win tests without any link/run-time >>>>>> > warnings (I had a bunch of those during development, so I can tell it >>>>>> > works rather than fails silently). >>>>>> > >>>>>> > I didn't have time to work on threading the command-line flags into >>>>>> > the AsmPrinter yet, so currently it just replaces DWARF entirely. >>>>>> > Of course, this should be fixed before this lands into trunk. >>>>>> > Currently, one can try this patch by using "clang-cl -Xclang -g ...". >>>>>> > Eventually we should have some dedicated flag for clang-cl. >>>>>> > >>>>>> > Can you please take a look at the patch and suggest a good path forward? >>>>>> > >>>>>> > I'm very unfamiliar with LLVM CodeGen/MC, so I'm pretty sure I made a >>>>>> > few weird things in the code... I've also put a few TODOs with >>>>>> > questions and suggestions. >>>>>> > >>>>>> > Some general questions: >>>>>> > 1) Threading flags from the driver down to CodeGen. >>>>>> > How do we do that? Should we support all 4 combinations >>>>>> > of no-info/DWARF/CVLT/both? >>>>>> > How about "-Zmlt" as the clang-cl flag name? ("minimal line tables") >>>>>> > >>>>>> > 2) Am I right that DWARF is pretty much the only debug info format >>>>>> > supported by LLVM/AsmPrinter right now? Do we want to take >>>>>> > an effort to come up with a generic debuginfogenerator interface >>>>>> > to share between DwarfDebug and WinCodeViewLineTables? >>>>>> > Then AsmPrinter should just hold a SmallVector<DebugInfoEmitter*> >>>>>> > rather than a pair of DD/DE pointers. >>>>>> > >>>>>> > 3) How would you suggest to write WinCodeViewLineTables tests >>>>>> > given that dumpbin is not available everywhere except Windows? >>>>>> > // Yeah, I should have looked at the DwarfDebug LIT tests and >>>>>> > // written some; but the prototype development went faster >>>>>> > // than I expected... >>>>>> >>>>>> I found the MCAsmStreamer being used by llc which gives a decent text >>>>>> format. >>>>>> I wrote a simple x86+x86_64 .ll test and FileCheck expectations for >>>>>> the llc asm output. >>>>>> Is this the right approach to write tests? >>>>>> If so, I'll convert my remaining C program test cases into such an >>>>>> .ll+llc tests. >>>>>> >>>>>> > Can you suggest ways to split this patch so it's easier >>>>>> > to review part-by-part before this hits trunk? >>>>>> >>>>>> Attached is an updated patch with a new test and a few minor things >>>>>> improved. >>>>>> I also removed the "TODO: test on X64" as I did try it on x64 and no >>>>>> changes were required. >>>>>> >>>>>> Looking forward to your feedback! >>>>>> >>>>>> > Thanks! >>>>>> > -- >>>>>> > Timur Iskhodzhanov, >>>>>> > Google >>>>>> >>>>>> _______________________________________________ >>>>>> LLVM Developers mailing list >>>>>> LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu >>>>>> http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> João Matos
Apparently Analagous Threads
- [LLVMdev] Adding line table debug information to LLVM on Windows
- [LLVMdev] Adding line table debug information to LLVM on Windows
- [LLVMdev] Adding line table debug information to LLVM on Windows
- [LLVMdev] Adding line table debug information to LLVM on Windows
- [LLVMdev] Adding line table debug information to LLVM on Windows