thr3ads.net - llvm dev - [LLVMdev] lit: deprecating trailing \ in RUN lines [Dec 2013]

If this information is useful, please help other people find it:
Share via:

Alp Toker

2013-Dec-08 19:47 UTC

[LLVMdev] lit: deprecating trailing \ in RUN lines

On 08/12/2013 16:43, Daniel Dunbar wrote:>
>     With one-to-one mapping, it becomes possible to use simple tools
>     like grep to validate common mistakes like %clang / %clang_cc1
>     mixups, a missing -o flag and so on.
>
>     Right now there's no obvious way to do those checks and we've
>     ended up without an easy way to lint for broken tests as a result.
>     Each broken test has a high cost so we need to continually look at
>     ways to improve the situation.
>
>
> Can you elaborate on exactly what kind of checks you want to do?
Basically, it's for asking questions about what's currently tested..

    e.g. /Find me all tests for c++11 exceptions/:

    |git grep 'RUN:.*c++11' -- test/ | grep exceptions|

.. experimenting with new features or refactoring existing tests..

    e.g. /How many tests could we add -fsyntax-only to?/

    |giteach perl -pi -e 's/((RUN:.*%clang_cc1)(.*))$/$2
    -fsyntax-only$3/g unless

m/(-fsyntax-only|-analyze|-arcmt-check|-arcmt-migrate|-o|-emit-llvm|-E|-ast-print|-ast-dump|[|>]|\\$)/'|

    Answer, 405. Then git diff and page through to see if any are
    actionable.

and fixing broken tests..

    eg. /Let's find and fix %clang_cc1 substitution mistakes../

    giteach perl -pi -e 's/(RUN:.*%clang_cc1)[^\s]+/$1/g'

I committed the results of this run just a few minutes ago in r196729 
and r196730.

One was a test that had been disabled since 2007.

Losing 80 columns is a small price to pay if it'll help make tests 
easier to write, understand and validate in my opinion.

(The two editors I use, vim and XCode, both do a good job of virtual 
line wrapping so it turns out long RUN lines aren't that much of a big 
deal in practice. But I can see why it may be a bigger problem with 
editors that don't support wrapping. Is this still a problem in 2013?)

Alp.

-- 
http://www.nuanti.com
the browser experts

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20131208/d0474880/attachment.html>

Daniel Dunbar

2013-Dec-09 16:59 UTC

head link

[LLVMdev] lit: deprecating trailing \ in RUN lines

Ok, that makes sense. I don't see this as a good enough argument to remove
backslash support though.

For problems like the clang_cc1 substitution mistakes, it would be much
better to just improve the substitution support so that those cause
immediate test failures. I would definitely support a move to make lit's
substitution machinery more strict. For problems like your first search, I
can appreciate how wrapped lines make this less trivial, but this isn't
enough of a use case to motivate removing the feature in my opinion.

My opinion can be summarized as:

1. If you are worried about bogus or broken tests, but you have an
automatic way to detect the brokenness, then the right way to attack this
is to build the check into the test runner so no one ever can write the
test to pass again.

2. If you are trying to do more fuzzy searches that are going to no be
fully automatic, the individual developer can deal with the existence of
wrapped lines. Or we could add a way to have lit run a regexp over all of
the (collapsed) test script lines if this is a really important use case
for you. Either way, its not a good enough argument to remove a useful
feature.

 - Daniel

On Sun, Dec 8, 2013 at 11:47 AM, Alp Toker <alp at nuanti.com> wrote:
>
> On 08/12/2013 16:43, Daniel Dunbar wrote:
>
> With one-to-one mapping, it becomes possible to use simple tools like grep
>> to validate common mistakes like %clang / %clang_cc1 mixups, a missing
-o
>> flag and so on.
>>
>> Right now there's no obvious way to do those checks and we've
ended up
>> without an easy way to lint for broken tests as a result. Each broken
test
>> has a high cost so we need to continually look at ways to improve the
>> situation.
>
>
>  Can you elaborate on exactly what kind of checks you want to do?
>
>
> Basically, it's for asking questions about what's currently
tested..
>
> e.g. *Find me all tests for c++11 exceptions*:
>
> git grep 'RUN:.*c++11' -- test/ | grep exceptions
>
>
> .. experimenting with new features or refactoring existing tests..
>
> e.g. *How many tests could we add -fsyntax-only to?*
>
> giteach perl -pi -e 's/((RUN:.*%clang_cc1)(.*))$/$2 -fsyntax-only$3/g
> unless
>
m/(-fsyntax-only|-analyze|-arcmt-check|-arcmt-migrate|-o|-emit-llvm|-E|-ast-print|-ast-dump|[|>]|\\$)/'
>
> Answer, 405. Then git diff and page through to see if any are actionable.
>
>
> and fixing broken tests..
>
> eg. *Let's find and fix %clang_cc1 substitution mistakes..*
>
> giteach perl -pi -e 's/(RUN:.*%clang_cc1)[^\s]+/$1/g'
>
>
> I committed the results of this run just a few minutes ago in r196729 and
> r196730.
>
> One was a test that had been disabled since 2007.
>
> Losing 80 columns is a small price to pay if it'll help make tests
easier
> to write, understand and validate in my opinion.
>
> (The two editors I use, vim and XCode, both do a good job of virtual line
> wrapping so it turns out long RUN lines aren't that much of a big deal
in
> practice. But I can see why it may be a bigger problem with editors that
> don't support wrapping. Is this still a problem in 2013?)
>
>
> Alp.
>
> -- http://www.nuanti.com
> the browser experts
>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20131209/78ca5afa/attachment.html>

Alp Toker

2013-Dec-09 18:09 UTC

head link

[LLVMdev] lit: deprecating trailing \ in RUN lines

On 09/12/2013 16:59, Daniel Dunbar wrote:> Ok, that makes sense. I don't see this as a good enough argument to 
> remove backslash support though.
>
> For problems like the clang_cc1 substitution mistakes, it would be 
> much better to just improve the substitution support so that those 
> cause immediate test failures. I would definitely support a move to 
> make lit's substitution machinery more strict.
Totally -- I have a patch for this. Unfortunately the work to add 
BusyBox support for native Windows testing, ninja test driver and lit 
enhanced diagnostics patches all got squashed into a single commit due 
to a screwup and need to be split up before posting :-/

> For problems like your first search, I can appreciate how wrapped 
> lines make this less trivial, but this isn't enough of a use case to 
> motivate removing the feature in my opinion.
>
> My opinion can be summarized as:
>
> 1. If you are worried about bogus or broken tests, but you have an 
> automatic way to detect the brokenness, then the right way to attack 
> this is to build the check into the test runner so no one ever can 
> write the test to pass again.
Already doing this where possible, like r194919 which forbids direct use 
of the frontend in the driver tests.

This is great because it educates patch contributors rather than fixing 
up after the fact but we're obviously limited in the checks we can 
automate without preventing legitimate testing.
>
> 2. If you are trying to do more fuzzy searches that are going to no be 
> fully automatic, the individual developer can deal with the existence 
> of wrapped lines. Or we could add a way to have lit run a regexp over 
> all of the (collapsed) test script lines if this is a really important 
> use case for you. Either way, its not a good enough argument to remove 
> a useful feature.
This hits on the problem precisely.

For any given common mistake 'hypothesis', there are a large number of 
false positives which legitimately test incorrect usage. So the kind of 
validations I'm running are inherently fuzzy.

You're right that it's trivial to /detect/ these by putting a regex just
after the directives are combined.

The crux of the problem is that detection isn't sufficient to achieve 
real fixes on this scale.

There are 26,631 test runs between LLVM and clang. The only practical 
workflow I've found to fix these is to do in-place bulk edit using an 
assortment of command-line tools, then running and review the changes in 
context using word diff. Before this, we had no workflow so it's 
something of a step forward :-)

Some background: I've been working my way down a list of potential 
problems and committed ~150 fixes or cleanups to clang tests in the last 
few weeks, several of which were hiding crasher or invalid codegen 
regressions.

To put that in context, I only reached the letter 'a' in that list of 
potential issues a few days ago -- it's clear that there are more tests 
in a state of disrepair than anyone imagined and a lot of tests we 
though are fine just aren't working.

I like my 80 columns as much as the next guy, but in this case they're 
blocking important work that's already detected hundreds of issues in 
limited trial runs.

So in this light I think we need to take a close look at the 
cost/benefit of continuing to support split RUN lines.

Alp.

>
>  - Daniel
>
>
>
> On Sun, Dec 8, 2013 at 11:47 AM, Alp Toker <alp at nuanti.com 
> <mailto:alp at nuanti.com>> wrote:
>
>
>     On 08/12/2013 16:43, Daniel Dunbar wrote:
>>
>>         With one-to-one mapping, it becomes possible to use simple
>>         tools like grep to validate common mistakes like %clang /
>>         %clang_cc1 mixups, a missing -o flag and so on.
>>
>>         Right now there's no obvious way to do those checks and
we've
>>         ended up without an easy way to lint for broken tests as a
>>         result. Each broken test has a high cost so we need to
>>         continually look at ways to improve the situation.
>>
>>
>>     Can you elaborate on exactly what kind of checks you want to do?
>
>     Basically, it's for asking questions about what's currently
tested..
>
>         e.g. /Find me all tests for c++11 exceptions/:
>
>         |git grep 'RUN:.*c++11' -- test/ | grep exceptions|
>
>
>     .. experimenting with new features or refactoring existing tests..
>
>         e.g. /How many tests could we add -fsyntax-only to?/
>
>         |giteach perl -pi -e 's/((RUN:.*%clang_cc1)(.*))$/$2
>         -fsyntax-only$3/g unless
>        
m/(-fsyntax-only|-analyze|-arcmt-check|-arcmt-migrate|-o|-emit-llvm|-E|-ast-print|-ast-dump|[|>]|\\$)/'|
>
>         Answer, 405. Then git diff and page through to see if any are
>         actionable.
>
>
>     and fixing broken tests..
>
>         eg. /Let's find and fix %clang_cc1 substitution mistakes../
>
>         giteach perl -pi -e 's/(RUN:.*%clang_cc1)[^\s]+/$1/g'
>
>
>     I committed the results of this run just a few minutes ago in
>     r196729 and r196730.
>
>     One was a test that had been disabled since 2007.
>
>     Losing 80 columns is a small price to pay if it'll help make tests
>     easier to write, understand and validate in my opinion.
>
>     (The two editors I use, vim and XCode, both do a good job of
>     virtual line wrapping so it turns out long RUN lines aren't that
>     much of a big deal in practice. But I can see why it may be a
>     bigger problem with editors that don't support wrapping. Is this
>     still a problem in 2013?)
>
>
>     Alp.
>
>     -- 
>     http://www.nuanti.com
>     the browser experts
>
>
-- 
http://www.nuanti.com
the browser experts

Apparently Analagous Threads

Search for more maybe matching threads

llvm dev - Dec 2013 - [LLVMdev] lit: deprecating trailing \ in RUN lines

[LLVMdev] lit: deprecating trailing \ in RUN lines

[LLVMdev] lit: deprecating trailing \ in RUN lines

[LLVMdev] lit: deprecating trailing \ in RUN lines

Apparently Analagous Threads