thr3ads.net - Lustre devel - [Lustre-devel] [Twg] your opinion about testing improvements [Mar 2012]

If this information is useful, please help other people find it:
Share via:

Roman Grigoryev

2012-Mar-30 07:40 UTC

[Lustre-devel] [Twg] your opinion about testing improvements

(Sorry for cross posting, I haven''t get answers in twg).
Hi all,

I''m working in testing team  in Xyratex.

After some time which I(and our team)  have spent on testing and
automation activities I would like try to ask community opinion about
two problems and suggestion in testing. This not closed list of testing
problems but I think it is good enough for start discussion.

Problem 1

Currently Lustre and test are living in one code space and build in one
time,and often have specific dependencies between test and code.

This situation directly affect

1) interoperability testing between different version. (because testing
is started from client which have different test framework then server
and client remotely execute test framework as their own. just copying
tests for equalization could not work with big difference between versions)

2) it is not simple execute(especially in automation) testing for test.
F.e. a bug is fixed, the test on it added. Executing the test on an old
revision(probably on a previous release) should show failed test result.
But with big difference between versions where fixed and where execute
test-framework can fail to start.

Possible solution: split Lustre and lustre tests in  code and  build
levels. It means that lustre and tests will not be connected on code
revision, only by logic, f.e. via keywords. Also should be added in same
time an abstraction level in test framework which allows to execute
lustre utils from different version of lustre.



Problem 2

(to avoid term problems, I call there: sanity = test suite, 130 = test,
130c and 130a = test cases)

Different test cases, ended with letter(f.e. 130c),  have an different
idea of dependencies. Some test cases have dependences to previous test
cases, some don''t have.

All they now can be executed with "ONLY" parameter and all they have
separated item in result yaml file as just separated tests( which
doesn''t have test cases ended with letter, f.e. sanity 129). Also,
tests
which have testcases and don''t have their own body can be execute  with
ONLY parameter( but doesn''t have their special result).

So, logically, all test which can be execute via using ONLY must be not
depended to other tests. But we have test which depended. Moreover, some
developers prefer to consider testcases as step of full one test.

What is entities which I call "testcases" and "test" from
your point of
view?

Answer of this question affect automated test execution and test
development, and maybe ask some test-framework changes.


-- 
Thanks,
	Roman

Oleg Drokin

2012-Apr-02 03:08 UTC

head link

[Lustre-devel] [Twg] your opinion about testing improvements

Hello!

On Mar 30, 2012, at 3:40 AM, Roman Grigoryev wrote:> 2) it is not simple execute(especially in automation) testing for test.
> F.e. a bug is fixed, the test on it added. Executing the test on an old
> revision(probably on a previous release) should show failed test result.
> But with big difference between versions where fixed and where execute
> test-framework can fail to start.
I am not quite sure why would you want to constantly fail a test that is known
not to work with a particular release due to a missing bugfix.
I think it''s enough if a developer (or somebody else) runs the test
manually once on an unfixed codebase to make sure the test does without the fix.

The issue of running older release against a newer one is a real one, but the
truth is, when you run e.g. 1.8 vs 2.x, it''s not just the tests that
are different, the init code is different too, so it''s not just a
matter of separating tests subdir in its own repository.
On our side we just note known broken tests for such configurations and ignore
the failures for the lack of better solution.
> Different test cases, ended with letter(f.e. 130c),  have an different
> idea of dependencies. Some test cases have dependences to previous test
> cases, some don''t have.
Ideally dependencies should be eliminated (in my opinion, anyway).

Bye,
    Oleg
--
Oleg Drokin
Senior Software Engineer
Whamcloud, Inc.

Andreas Dilger

2012-Apr-02 05:33 UTC

head link

[Lustre-devel] [Twg] your opinion about testing improvements

On 2012-04-01, at 9:08 PM, Oleg Drokin wrote:> On Mar 30, 2012, at 3:40 AM, Roman Grigoryev wrote:
>> 2) it is not simple execute(especially in automation) testing for test.
>> F.e. a bug is fixed, the test on it added. Executing the test on an old
>> revision(probably on a previous release) should show failed test
result.
>> But with big difference between versions where fixed and where execute
>> test-framework can fail to start.
> 
> I am not quite sure why would you want to constantly fail a test that is
known not to work with a particular release due to a missing bugfix.
> I think it''s enough if a developer (or somebody else) runs the
test manually once on an unfixed codebase to make sure the test does without the
fix.
I think it makes sense to be able to skip a test that is failing for versions of
Lustre older than X, for cases where the test is exercising some fix on the
server.  We _do_ run interoperability tests and hit these failures, and it is
much better to skip the test with a clear message instead of marking the test as
failed.

Probably the easiest solution is for such tests to explicitly check the version
of the server, with a new helper function like "skip_old_version" or
similar.

Tests that are checking new features (as opposed to bugs) should normally be
able to check via "lctl get_param {mdc,osc}.*.connect_flags" output
whether the server supports a given feature or not.
> The issue of running older release against a newer one is a real one, but
the truth is, when you run e.g. 1.8 vs 2.x, it''s not just the tests
that are different, the init code is different too, so it''s not just a
matter of separating tests subdir in its own repository.
> On our side we just note known broken tests for such configurations and
ignore the failures for the lack of better solution.
As mentioned earlier - the presence of known failing tests causes confusion, and
it would be better to annotate these tests in a clear manner by skipping them
instead of just knowing that they will fail.
>> Different test cases, ended with letter(f.e. 130c),  have an different
>> idea of dependencies. Some test cases have dependences to previous test
>> cases, some don''t have.
> 
> Ideally dependencies should be eliminated (in my opinion, anyway).
Agreed - all of the sub-tests should be able to run independently, even though
they are normally run in order.


Cheers, Andreas
--
Andreas Dilger                       Whamcloud, Inc.
Principal Lustre Engineer            http://www.whamcloud.com/

Roman Grigoryev

2012-Apr-02 09:43 UTC

head link

[Lustre-devel] [Twg] your opinion about testing improvements

On 04/02/2012 09:33 AM, Andreas Dilger wrote:> On 2012-04-01, at 9:08 PM, Oleg Drokin wrote:
>> On Mar 30, 2012, at 3:40 AM, Roman Grigoryev wrote:
>>> 2) it is not simple execute(especially in automation) testing for
>>> test. F.e. a bug is fixed, the test on it added. Executing the
>>> test on an old revision(probably on a previous release) should
>>> show failed test result. But with big difference between versions
>>> where fixed and where execute test-framework can fail to start.
>> 
>> I am not quite sure why would you want to constantly fail a test
>> that is known not to work with a particular release due to a
>> missing bugfix. I think it''s enough if a developer (or
somebody
>> else) runs the test manually once on an unfixed codebase to make
>> sure the test does without the fix.

There can be more then one reason to execute tests which expectedly
failed and  I don''t mean an obligation to execute these tests every
time
but mean an execution on automated platform in specific cases.

The main problem now is compatibility testing(see below Oleg).
Also there is problematic connection between tests code and lustre code
taking into account consideration that test exclusion is done in test code.
> 
> I think it makes sense to be able to skip a test that is failing for
> versions of Lustre older than X, for cases where the test is
> exercising some fix on the server.  We _do_ run interoperability
> tests and hit these failures, and it is much better to skip the test
> with a clear message instead of marking the test as failed.
Absolutely agree with You about skipping(maybe is good idea to somehow
mark is as not skipped but "incompatible"?).
> 
> Probably the easiest solution is for such tests to explicitly check
> the version of the server, with a new helper function like
> "skip_old_version" or similar.
Maybe we can use just use some kind of keywords to say to framework just
ignore new tests on old setups. I think adding more logic on test level
isn''t so good when this new logic process meta information. Test can
just have an attribute which can be processed by a test-framework.
> 
> Tests that are checking new features (as opposed to bugs) should
> normally be able to check via "lctl get_param
> {mdc,osc}.*.connect_flags" output whether the server supports a given
> feature or not.
But situation when you are executing new tests on a old server could be
when you install in old lusre new tests. Oleg described below situation
when it is pretty hard to execute new tests(2.x) on old clients (1.8.x).
> 
>> The issue of running older release against a newer one is a real
>> one, but the truth is, when you run e.g. 1.8 vs 2.x, it''s not
just
>> the tests that are different, the init code is different too, so
>> it''s not just a matter of separating tests subdir in its own
>> repository.
Oleg,
i''m answering there because You are in ''To''

I used term "test-framework" for init code, and I agree with you about
reasons. My tests separating idea is mostly not for code tree but for
logical separating, mostly in build,dependency and versioning( but
separating in code tree force to do it too).

I think, it will be good to have lusre_tests version 1.2.3 and set on
any lustre version. I think it can be interesting also for developers
who do fixes for old or his own branches to simply get new tests ready
for them.

>>  On our side we just note known broken tests for such
>> configurations and ignore the failures for the lack of better
>> solution.
Could you please publish the list?
> 
> As mentioned earlier - the presence of known failing tests causes
> confusion, and it would be better to annotate these tests in a clear
> manner by skipping them instead of just knowing that they will fail.
> 
>>> Different test cases, ended with letter(f.e. 130c),  have an
>>> different idea of dependencies. Some test cases have dependences
>>> to previous test cases, some don''t have.
>> 
>> Ideally dependencies should be eliminated (in my opinion, anyway).
> 
> Agreed - all of the sub-tests should be able to run independently,
> even though they are normally run in order.
Maybe it is a good idea to define and publish rules like this:
1) test(test scenario) must have only number name (1,2,3..110...999)
2) test cases (test step) must have number+char index (1f,2,b...99c)

Test can be executed via ONLY.
Test cases can be execute only as part of test.
Tests must be independent.
Test cases can have dependencies.
Test define init, cleanup which are executing before and after test
cases (not before-after very test case but full group).

I think, these rules can somehow fix current situation with dependences
without many changes in tests.

-- 
Thanks,
	Roman

Lustre devel - Mar 2012 - [Twg] your opinion about testing improvements

[Lustre-devel] [Twg] your opinion about testing improvements

[Lustre-devel] [Twg] your opinion about testing improvements

[Lustre-devel] [Twg] your opinion about testing improvements

[Lustre-devel] [Twg] your opinion about testing improvements