Chris Lattner via llvm-dev
2017-Jul-06 00:43 UTC
[llvm-dev] Should we split llvm Support and ADT?
Sure, I guess that splitting the arrayref/stringref headers out is a fine first step. -Chris> On Jul 5, 2017, at 5:07 PM, Zachary Turner <zturner at google.com> wrote: > > Re-writing StringRef / ArrayRef etc to use the exact same API is a good idea long term, but there's a lot of ugly messy details that need to be dealt with. There's thousands of uses of take_front / drop_front, etc that have to be converted. Then there's some methods that aren't in string_view at all, like consume_integer(), consume_front(), etc that would have to be raised up to global functions in StringExtras. All of this can certainly be done, but it's going to be a *ton* of churn and hours spent to get it all STL-ified. > > Do you consider this a blocker for doing such a split? Would it make sense to do it incrementally where we first just move StringRef et all wholesale, and then incrementally work to STL-ify the interface? > > On Wed, Jul 5, 2017 at 5:01 PM Chris Lattner <clattner at nondot.org <mailto:clattner at nondot.org>> wrote: > Yes, that proposal makes sense to me: the split would be between things that *are* known to be subsumed into later versions of C++, and therefore are a compatibility library. > > What do you think about this as an implementation approach: > > - Rewrite StringRef (et al) to use the exact same APIs as std::string_view. Keep the StringRef name for now. > - When cmake detects that C++’17 mode is supported, the build would set a -D flag. > - StringRef.h would just include the C++’17 header and typedef StringRef to that type. > - When we start requiring C++’17, someone can “StringRef”->RAUW(“std::string_view”) and nuke the header. > > This allows us to have a clean path out of these custom types, and makes it very clear that these headers are compatibility shims that go away in the future. It also makes it clear what the division is. > > -Chris > > > >> On Jul 5, 2017, at 10:38 AM, Zachary Turner <zturner at google.com <mailto:zturner at google.com>> wrote: >> >> So, here is an example of where I think a split would be really helpful. >> >> https://reviews.llvm.org/D34667 <https://reviews.llvm.org/D34667> >> >> This code would benefit vastly even from just being able to use StringRef and ArrayRef. We have other cases as well where we export some code that cannot depend on the rest of LLVM. >> >> Thinking about it some, StringRef, ArrayRef, and various other things like STLExtras and iterator.h basically can be summarized as "things that are either already already planned for, or wouldn't be entirely out of place in the STL itself". For example, StringRef is std::string_view. ArrayRef is std::array_view. iterator_facade_base is a better version of std::iterator. >> >> So I would drop my suggestion to split the libraries in such a way that it might benefit TableGen, and instead re-word my suggestion to include only classes such as StringRef, ArrayRef, and other STL-like utilities that can benefit utilities like our demangler etc that cannot depend on the rest of LLVM. If and when we ever require C++17 for building LLVM (a long ways away, obviously, but we might as well be forward thinking), we would certainly be able to use std::string_view and std::array_view in the demangler. So splitting things in a way such as this makes long term sense IMO. >> >> On Sun, Jun 4, 2017 at 10:50 AM Zachary Turner <zturner at google.com <mailto:zturner at google.com>> wrote: >> Fair enough, i sort of regret mentioning that specific method of splitting originally. >> >> For the record, i think any splitting should make sense on its own merit without considering tablegen, and hopefully the end result of "tablegen eventually depends on less stuff" would happen naturally >> On Sun, Jun 4, 2017 at 10:37 AM Chris Lattner <clattner at nondot.org <mailto:clattner at nondot.org>> wrote: >> >> > On May 26, 2017, at 5:47 PM, Zachary Turner via llvm-dev <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>> wrote: >> > >> > Changing a header file somewhere and having to spend 10 minutes waiting for a build leads to a lot of wasted developer time. >> > >> > The real culprit here is tablegen. Can we split support and ADT into two - the parts that tablegen depends on and the parts that it doesn’t? >> >> In all the comments downthread, I think there is one thing that hasn't been mentioned: doing a split like this makes tblgen evolution more difficult. If libsupport was split into “used by tblgen” and “not used by tblgen” sections, and then a new tblgen feature needs to use other parts of libsupport, they’d have to be moved into the “used by tblgen” directory. >> >> Splitting libsupport as a whole out into its own llvm subproject has come up many times though, and does make a lot of sense. >> >> -Chris >> >> >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20170705/fe280d9f/attachment.html>
Zachary Turner via llvm-dev
2017-Jul-06 01:26 UTC
[llvm-dev] Should we split llvm Support and ADT?
You mentioned that a good line to draw is one where we're adding things that are *known* to be added to c++ future. How strictly do we want to enforce this? There are lots of things have equally broad utility, but aren't necessarily known to be added to c++ in the future. For example, all of MathExtras and StringExtras, many member functions of StringRef that are not in string_view, etc. can we still have these in the top level compatibility library? We could still aim for interfaces that 1-to-1 match STL, but it would nice if we could have some equally low level extras to enhance these classes On Wed, Jul 5, 2017 at 5:44 PM Chris Lattner <clattner at nondot.org> wrote:> Sure, I guess that splitting the arrayref/stringref headers out is a fine > first step. > > -Chris > > On Jul 5, 2017, at 5:07 PM, Zachary Turner <zturner at google.com> wrote: > > Re-writing StringRef / ArrayRef etc to use the exact same API is a good > idea long term, but there's a lot of ugly messy details that need to be > dealt with. There's thousands of uses of take_front / drop_front, etc that > have to be converted. Then there's some methods that aren't in string_view > at all, like consume_integer(), consume_front(), etc that would have to be > raised up to global functions in StringExtras. All of this can certainly > be done, but it's going to be a *ton* of churn and hours spent to get it > all STL-ified. > > Do you consider this a blocker for doing such a split? Would it make > sense to do it incrementally where we first just move StringRef et all > wholesale, and then incrementally work to STL-ify the interface? > > On Wed, Jul 5, 2017 at 5:01 PM Chris Lattner <clattner at nondot.org> wrote: > >> Yes, that proposal makes sense to me: the split would be between things >> that *are* known to be subsumed into later versions of C++, and therefore >> are a compatibility library. >> >> What do you think about this as an implementation approach: >> >> - Rewrite StringRef (et al) to use the exact same APIs as >> std::string_view. Keep the StringRef name for now. >> - When cmake detects that C++’17 mode is supported, the build would set >> a -D flag. >> - StringRef.h would just include the C++’17 header and typedef StringRef >> to that type. >> - When we start requiring C++’17, someone can >> “StringRef”->RAUW(“std::string_view”) and nuke the header. >> >> This allows us to have a clean path out of these custom types, and makes >> it very clear that these headers are compatibility shims that go away in >> the future. It also makes it clear what the division is. >> >> -Chris >> >> >> >> On Jul 5, 2017, at 10:38 AM, Zachary Turner <zturner at google.com> wrote: >> >> So, here is an example of where I think a split would be really helpful. >> >> https://reviews.llvm.org/D34667 >> >> This code would benefit vastly even from just being able to use StringRef >> and ArrayRef. We have other cases as well where we export some code that >> cannot depend on the rest of LLVM. >> >> Thinking about it some, StringRef, ArrayRef, and various other things >> like STLExtras and iterator.h basically can be summarized as "things that >> are either already already planned for, or wouldn't be entirely out of >> place in the STL itself". For example, StringRef is std::string_view. >> ArrayRef is std::array_view. iterator_facade_base is a better version of >> std::iterator. >> >> So I would drop my suggestion to split the libraries in such a way that >> it might benefit TableGen, and instead re-word my suggestion to include >> only classes such as StringRef, ArrayRef, and other STL-like utilities that >> can benefit utilities like our demangler etc that cannot depend on the rest >> of LLVM. If and when we ever require C++17 for building LLVM (a long ways >> away, obviously, but we might as well be forward thinking), we would >> certainly be able to use std::string_view and std::array_view in the >> demangler. So splitting things in a way such as this makes long term sense >> IMO. >> >> On Sun, Jun 4, 2017 at 10:50 AM Zachary Turner <zturner at google.com> >> wrote: >> >>> Fair enough, i sort of regret mentioning that specific method of >>> splitting originally. >>> >>> For the record, i think any splitting should make sense on its own merit >>> without considering tablegen, and hopefully the end result of "tablegen >>> eventually depends on less stuff" would happen naturally >>> On Sun, Jun 4, 2017 at 10:37 AM Chris Lattner <clattner at nondot.org> >>> wrote: >>> >>>> >>>> > On May 26, 2017, at 5:47 PM, Zachary Turner via llvm-dev < >>>> llvm-dev at lists.llvm.org> wrote: >>>> > >>>> > Changing a header file somewhere and having to spend 10 minutes >>>> waiting for a build leads to a lot of wasted developer time. >>>> > >>>> > The real culprit here is tablegen. Can we split support and ADT into >>>> two - the parts that tablegen depends on and the parts that it doesn’t? >>>> >>>> In all the comments downthread, I think there is one thing that hasn't >>>> been mentioned: doing a split like this makes tblgen evolution more >>>> difficult. If libsupport was split into “used by tblgen” and “not used by >>>> tblgen” sections, and then a new tblgen feature needs to use other parts of >>>> libsupport, they’d have to be moved into the “used by tblgen” directory. >>>> >>>> Splitting libsupport as a whole out into its own llvm subproject has >>>> come up many times though, and does make a lot of sense. >>>> >>>> -Chris >>>> >>>> >>>> >> >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20170706/9b605e16/attachment-0001.html>
Chandler Carruth via llvm-dev
2017-Jul-06 01:46 UTC
[llvm-dev] Should we split llvm Support and ADT?
Having watched a similar library go through this exact evolution, I really doubt we want to make any split around "things known to be in C++ in the future"... It turns out that this is nearly impossible to predict and precludes a tremendous amount of useful utilities. For example, there is no indication that the range helpers LLVM provides will ever end up in C++'s standard library, but they certainly seem useful for the demangler (the concrete use case cited). What is the concrete problem with just linking the support library, in all its glory, into the demangler? Why *shouldn't* we do that? I feel like that has gotten lost (for me).... On Wed, Jul 5, 2017 at 6:27 PM Zachary Turner via llvm-dev < llvm-dev at lists.llvm.org> wrote:> You mentioned that a good line to draw is one where we're adding things > that are *known* to be added to c++ future. How strictly do we want to > enforce this? There are lots of things have equally broad utility, but > aren't necessarily known to be added to c++ in the future. > > For example, all of MathExtras and StringExtras, many member functions of > StringRef that are not in string_view, etc. can we still have these in the > top level compatibility library? > > We could still aim for interfaces that 1-to-1 match STL, but it would nice > if we could have some equally low level extras to enhance these classes > On Wed, Jul 5, 2017 at 5:44 PM Chris Lattner <clattner at nondot.org> wrote: > >> Sure, I guess that splitting the arrayref/stringref headers out is a fine >> first step. >> >> -Chris >> >> On Jul 5, 2017, at 5:07 PM, Zachary Turner <zturner at google.com> wrote: >> >> Re-writing StringRef / ArrayRef etc to use the exact same API is a good >> idea long term, but there's a lot of ugly messy details that need to be >> dealt with. There's thousands of uses of take_front / drop_front, etc that >> have to be converted. Then there's some methods that aren't in string_view >> at all, like consume_integer(), consume_front(), etc that would have to be >> raised up to global functions in StringExtras. All of this can certainly >> be done, but it's going to be a *ton* of churn and hours spent to get it >> all STL-ified. >> >> Do you consider this a blocker for doing such a split? Would it make >> sense to do it incrementally where we first just move StringRef et all >> wholesale, and then incrementally work to STL-ify the interface? >> >> On Wed, Jul 5, 2017 at 5:01 PM Chris Lattner <clattner at nondot.org> wrote: >> >>> Yes, that proposal makes sense to me: the split would be between things >>> that *are* known to be subsumed into later versions of C++, and therefore >>> are a compatibility library. >>> >>> What do you think about this as an implementation approach: >>> >>> - Rewrite StringRef (et al) to use the exact same APIs as >>> std::string_view. Keep the StringRef name for now. >>> - When cmake detects that C++’17 mode is supported, the build would set >>> a -D flag. >>> - StringRef.h would just include the C++’17 header and typedef >>> StringRef to that type. >>> - When we start requiring C++’17, someone can >>> “StringRef”->RAUW(“std::string_view”) and nuke the header. >>> >>> This allows us to have a clean path out of these custom types, and makes >>> it very clear that these headers are compatibility shims that go away in >>> the future. It also makes it clear what the division is. >>> >>> -Chris >>> >>> >>> >>> On Jul 5, 2017, at 10:38 AM, Zachary Turner <zturner at google.com> wrote: >>> >>> So, here is an example of where I think a split would be really helpful. >>> >>> https://reviews.llvm.org/D34667 >>> >>> This code would benefit vastly even from just being able to use >>> StringRef and ArrayRef. We have other cases as well where we export some >>> code that cannot depend on the rest of LLVM. >>> >>> Thinking about it some, StringRef, ArrayRef, and various other things >>> like STLExtras and iterator.h basically can be summarized as "things that >>> are either already already planned for, or wouldn't be entirely out of >>> place in the STL itself". For example, StringRef is std::string_view. >>> ArrayRef is std::array_view. iterator_facade_base is a better version of >>> std::iterator. >>> >>> So I would drop my suggestion to split the libraries in such a way that >>> it might benefit TableGen, and instead re-word my suggestion to include >>> only classes such as StringRef, ArrayRef, and other STL-like utilities that >>> can benefit utilities like our demangler etc that cannot depend on the rest >>> of LLVM. If and when we ever require C++17 for building LLVM (a long ways >>> away, obviously, but we might as well be forward thinking), we would >>> certainly be able to use std::string_view and std::array_view in the >>> demangler. So splitting things in a way such as this makes long term sense >>> IMO. >>> >>> On Sun, Jun 4, 2017 at 10:50 AM Zachary Turner <zturner at google.com> >>> wrote: >>> >>>> Fair enough, i sort of regret mentioning that specific method of >>>> splitting originally. >>>> >>>> For the record, i think any splitting should make sense on its own >>>> merit without considering tablegen, and hopefully the end result of >>>> "tablegen eventually depends on less stuff" would happen naturally >>>> On Sun, Jun 4, 2017 at 10:37 AM Chris Lattner <clattner at nondot.org> >>>> wrote: >>>> >>>>> >>>>> > On May 26, 2017, at 5:47 PM, Zachary Turner via llvm-dev < >>>>> llvm-dev at lists.llvm.org> wrote: >>>>> > >>>>> > Changing a header file somewhere and having to spend 10 minutes >>>>> waiting for a build leads to a lot of wasted developer time. >>>>> > >>>>> > The real culprit here is tablegen. Can we split support and ADT >>>>> into two - the parts that tablegen depends on and the parts that it doesn’t? >>>>> >>>>> In all the comments downthread, I think there is one thing that hasn't >>>>> been mentioned: doing a split like this makes tblgen evolution more >>>>> difficult. If libsupport was split into “used by tblgen” and “not used by >>>>> tblgen” sections, and then a new tblgen feature needs to use other parts of >>>>> libsupport, they’d have to be moved into the “used by tblgen” directory. >>>>> >>>>> Splitting libsupport as a whole out into its own llvm subproject has >>>>> come up many times though, and does make a lot of sense. >>>>> >>>>> -Chris >>>>> >>>>> >>>>> >>> >> _______________________________________________ > LLVM Developers mailing list > llvm-dev at lists.llvm.org > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20170706/f3b7122c/attachment.html>
Chris Lattner via llvm-dev
2017-Jul-06 16:25 UTC
[llvm-dev] Should we split llvm Support and ADT?
> On Jul 5, 2017, at 6:26 PM, Zachary Turner <zturner at google.com> wrote: > > You mentioned that a good line to draw is one where we're adding things that are *known* to be added to c++ future. How strictly do we want to enforce this? There are lots of things have equally broad utility, but aren't necessarily known to be added to c++ in the future.In my opinion, this should be very strictly enforced. We need a black and white test for “what goes where”. The “already accepted into a future standard” metric satisfies that. I’d even go so far as to say that stuff in StringRef that isn’t in string_view (e.g. the atoi stuff) should be split out (e.g. to StringExtras) before the move. -Chris