thr3ads.net - llvm dev - [llvm-dev] RFC: Revisiting LLD-as-a-library design [Jun 2021]

If this information is useful, please help other people find it:
Share via:

Michael Spencer via llvm-dev

2021-Jun-12 03:20 UTC

[llvm-dev] RFC: Revisiting LLD-as-a-library design

Adding Erik (not subscribed) who has previously had issues with LLD not
being a library to provide some additional use cases.

- Michael Spencer


On Thu, Jun 10, 2021 at 12:15 PM Reid Kleckner via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> Hey all,
>
> Long ago, the LLD project contributors decided that they weren't going
to
> design LLD as a library, which stands in opposition to the way that the
> rest of LLVM strives to be a reusable library. Part of the reasoning was
> that, at the time, LLD wasn't done yet, and the top priority was to
finish
> making LLD a fast, useful, usable product. If sacrificing reusability
> helped LLD achieve its project goals, the contributors at the time felt
> that was the right tradeoff, and that carried the day.
>
> However, it is now ${YEAR} 2021, and I think we ought to reconsider this
> design decision. LLD was a great success: it works, it is fast, it is
> simple, many users have adopted it, it has many ports
> (COFF/ELF/mingw/wasm/new MachO). Today, we have actual users who want to
> run the linker as a library, and they aren't satisfied with the option
of
> launching a child process. Some users are interested in process reuse as a
> performance optimization, some are including the linker in the frontend.
> Who knows. I try not to pre-judge any of these efforts, I think we should
> do what we can to enable experimentation.
>
> So, concretely, what could change? The main points of reusability are:
> - Fatal errors and warnings exit the process without returning control to
> the caller
> - Conflicts over global variables between threads
>
> Error recovery is the big imposition here. To avoid a giant rewrite of all
> error handling code in LLD, I think we should *avoid* returning failure via
> the llvm::Error class or std::error_code. We should instead use an approach
> more like clang, where diagnostics are delivered to a diagnostic consumer
> on the side. The success of the link is determined by whether any errors
> were reported. Functions may return a simple success boolean in cases where
> higher level functions need to exit early. This has worked reasonably well
> for clang. The main failure mode here is that we miss an error check, and
> crash or report useless follow-on errors after an error that would normally
> have been fatal.
>
> Another motivation for all of this is increasing the use of parallelism in
> LLD. Emitting errors in parallel from threads and then exiting the process
> is risky business. A new diagnostic context or consumer could make this
> more reliable. MLIR has this issue as well, and I believe they use this
> pattern. They use some kind of thread shard index to order the diagnostics,
> LLD could do the same.
>
> Finally, we'd work to eliminate globals. I think this is mainly a small
> matter of programming (SMOP) and doesn't need much discussion, although
the
> `make` template presents interesting challenges.
>
> Thoughts? Tomatoes? Flowers? I apologize for the lack of context links to
> the original discussions. It takes more time than I have to dig those up.
>
> Reid
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210611/b50cdcf2/attachment.html>

pawel k. via llvm-dev

2021-Jun-12 06:00 UTC

head link

[llvm-dev] RFC: Revisiting LLD-as-a-library design

Hello all,
Do I understand correctly lld is binary linker? Does it take arch/os
specific or llvm-specific object files as input or both?

Im very young devel llvm- and clang-wise albeit if libraryfication of lld
doesnt involve too heavy algorithmical and architectural changes and
someone can intro me on working with git (which branch to start from on
which branch to work whether to create separate branch and how), i could
try to prepare working prototype of this solution. Please note though im
still learning.

For macho code, we could use samples in testsuite where I could make sure
nothing breaks after my changes. We could use some such samples too for
lto. In perfect world, we could use some testset too for each arch/os combo.

As on avoiding merge conflicts with current devel including macho code, id
suggest finnishing work there and stabilizing lld first, then when no new
code is changed heavily or introduced anymore, i could try to focus on
architecture rework safely.

As on unification with jit or lto as was mentioned, i dont know enough
about this task yet.

As on avoiding/fixing memory leaks between compilation units etc, i think i
could try to fix those.

Please note though I still dont have clear view as on how to make lld
process more parallel so I may be not yet usable for this project.

Best regards,
Pawel

sob., 12.06.2021, 05:20 użytkownik Michael Spencer via llvm-dev <
llvm-dev at lists.llvm.org> napisał:
> Adding Erik (not subscribed) who has previously had issues with LLD not
> being a library to provide some additional use cases.
>
> - Michael Spencer
>
>
> On Thu, Jun 10, 2021 at 12:15 PM Reid Kleckner via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>> Hey all,
>>
>> Long ago, the LLD project contributors decided that they weren't
going to
>> design LLD as a library, which stands in opposition to the way that the
>> rest of LLVM strives to be a reusable library. Part of the reasoning
was
>> that, at the time, LLD wasn't done yet, and the top priority was to
finish
>> making LLD a fast, useful, usable product. If sacrificing reusability
>> helped LLD achieve its project goals, the contributors at the time felt
>> that was the right tradeoff, and that carried the day.
>>
>> However, it is now ${YEAR} 2021, and I think we ought to reconsider
this
>> design decision. LLD was a great success: it works, it is fast, it is
>> simple, many users have adopted it, it has many ports
>> (COFF/ELF/mingw/wasm/new MachO). Today, we have actual users who want
to
>> run the linker as a library, and they aren't satisfied with the
option of
>> launching a child process. Some users are interested in process reuse
as a
>> performance optimization, some are including the linker in the
frontend.
>> Who knows. I try not to pre-judge any of these efforts, I think we
should
>> do what we can to enable experimentation.
>>
>> So, concretely, what could change? The main points of reusability are:
>> - Fatal errors and warnings exit the process without returning control
to
>> the caller
>> - Conflicts over global variables between threads
>>
>> Error recovery is the big imposition here. To avoid a giant rewrite of
>> all error handling code in LLD, I think we should *avoid* returning
failure
>> via the llvm::Error class or std::error_code. We should instead use an
>> approach more like clang, where diagnostics are delivered to a
diagnostic
>> consumer on the side. The success of the link is determined by whether
any
>> errors were reported. Functions may return a simple success boolean in
>> cases where higher level functions need to exit early. This has worked
>> reasonably well for clang. The main failure mode here is that we miss
an
>> error check, and crash or report useless follow-on errors after an
error
>> that would normally have been fatal.
>>
>> Another motivation for all of this is increasing the use of parallelism
>> in LLD. Emitting errors in parallel from threads and then exiting the
>> process is risky business. A new diagnostic context or consumer could
make
>> this more reliable. MLIR has this issue as well, and I believe they use
>> this pattern. They use some kind of thread shard index to order the
>> diagnostics, LLD could do the same.
>>
>> Finally, we'd work to eliminate globals. I think this is mainly a
small
>> matter of programming (SMOP) and doesn't need much discussion,
although the
>> `make` template presents interesting challenges.
>>
>> Thoughts? Tomatoes? Flowers? I apologize for the lack of context links
to
>> the original discussions. It takes more time than I have to dig those
up.
>>
>> Reid
>> _______________________________________________
>> LLVM Developers mailing list
>> llvm-dev at lists.llvm.org
>> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210612/1dee5c86/attachment.html>

Erik McClure via llvm-dev

2021-Jun-12 06:00 UTC

head link

[llvm-dev] RFC: Revisiting LLD-as-a-library design

I use LLVM to compile WebAssembly to native code. The primary use-case for
this is compiling WASM plugins for games - this is what Microsoft Flight
Simulator 2020 uses it for. Using the system linker is not an option on
Windows, which does not ship link.exe by default, making LLD a mandatory
requirement if you are using LLVM in any kind of end-user plugin scenario,
as the average user has not installed Visual Studio.

This puts users of LLVM's library capabilities on windows in an awkward
position, because in order to use LLVM as a library when compiling a
plugin, one must use LLD, which cannot be used as a library. My current
solution is to use LLD as a library anyway and maintain a fork of LLVM with
the various global cleanup bugs patched (most of which have now made it
into stable), along with a helper function that allows me to use LLD to
read out the symbols of a given shared library (which is used to perform
link-time validation of webassembly modules, because LLD makes it difficult
to access any errors that happen).

If LLD wanted to become an actual library, I think it would need a better
method of reporting errors than simply an stdout and stderr stream,
although I don't know what this would look like. It would also be nice for
it to expose the different link stages like LLVM does so that the
application has a bit more control over what's going on. However, I
don't
really have any concrete ideas about what LLD should look like as a
library, only that I would like it to stop crashing when I attempt to use
it as one.

--
Sincerely, Erik McClure


On Fri, Jun 11, 2021 at 8:20 PM Michael Spencer <bigcheesegs at gmail.com>
wrote:
> Adding Erik (not subscribed) who has previously had issues with LLD not
> being a library to provide some additional use cases.
>
> - Michael Spencer
>
>
> On Thu, Jun 10, 2021 at 12:15 PM Reid Kleckner via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>> Hey all,
>>
>> Long ago, the LLD project contributors decided that they weren't
going to
>> design LLD as a library, which stands in opposition to the way that the
>> rest of LLVM strives to be a reusable library. Part of the reasoning
was
>> that, at the time, LLD wasn't done yet, and the top priority was to
finish
>> making LLD a fast, useful, usable product. If sacrificing reusability
>> helped LLD achieve its project goals, the contributors at the time felt
>> that was the right tradeoff, and that carried the day.
>>
>> However, it is now ${YEAR} 2021, and I think we ought to reconsider
this
>> design decision. LLD was a great success: it works, it is fast, it is
>> simple, many users have adopted it, it has many ports
>> (COFF/ELF/mingw/wasm/new MachO). Today, we have actual users who want
to
>> run the linker as a library, and they aren't satisfied with the
option of
>> launching a child process. Some users are interested in process reuse
as a
>> performance optimization, some are including the linker in the
frontend.
>> Who knows. I try not to pre-judge any of these efforts, I think we
should
>> do what we can to enable experimentation.
>>
>> So, concretely, what could change? The main points of reusability are:
>> - Fatal errors and warnings exit the process without returning control
to
>> the caller
>> - Conflicts over global variables between threads
>>
>> Error recovery is the big imposition here. To avoid a giant rewrite of
>> all error handling code in LLD, I think we should *avoid* returning
failure
>> via the llvm::Error class or std::error_code. We should instead use an
>> approach more like clang, where diagnostics are delivered to a
diagnostic
>> consumer on the side. The success of the link is determined by whether
any
>> errors were reported. Functions may return a simple success boolean in
>> cases where higher level functions need to exit early. This has worked
>> reasonably well for clang. The main failure mode here is that we miss
an
>> error check, and crash or report useless follow-on errors after an
error
>> that would normally have been fatal.
>>
>> Another motivation for all of this is increasing the use of parallelism
>> in LLD. Emitting errors in parallel from threads and then exiting the
>> process is risky business. A new diagnostic context or consumer could
make
>> this more reliable. MLIR has this issue as well, and I believe they use
>> this pattern. They use some kind of thread shard index to order the
>> diagnostics, LLD could do the same.
>>
>> Finally, we'd work to eliminate globals. I think this is mainly a
small
>> matter of programming (SMOP) and doesn't need much discussion,
although the
>> `make` template presents interesting challenges.
>>
>> Thoughts? Tomatoes? Flowers? I apologize for the lack of context links
to
>> the original discussions. It takes more time than I have to dig those
up.
>>
>> Reid
>> _______________________________________________
>> LLVM Developers mailing list
>> llvm-dev at lists.llvm.org
>> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20210611/cd9a501a/attachment.html>

llvm dev - Jun 2021 - RFC: Revisiting LLD-as-a-library design

[llvm-dev] RFC: Revisiting LLD-as-a-library design

[llvm-dev] RFC: Revisiting LLD-as-a-library design

[llvm-dev] RFC: Revisiting LLD-as-a-library design