[LU-1193] test script incompatibility when running server as 2.1 and client as 2.2 Created: 06/Mar/12  Updated: 25/May/12  Resolved: 08/May/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: Lustre 2.3.0

Type: Bug Priority: Minor
Reporter: Sarah Liu Assignee: Yang Sheng
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
is related to LU-1209 sanity.sh subtest test_133d failed wi... Resolved
is related to LU-664 1.8<->2.1 interop: recovery-small tes... Resolved
Severity: 3
Rank (Obsolete): 4244

 Description   

When running conf-sanity.sh with 2.1 server and 2.2 client, sub-test 61 failed, since the test script is different between server and client. It actually ran the client script on the server side which caused error. After copying conf-sanity from server to client, the test passed. The similar issue also found in some sanity sub-tests, such as 133a,133d,160



 Comments   
Comment by Peter Jones [ 09/Mar/12 ]

Yangsheng

Could you please look into this one?

THanks

Peter

Comment by Andreas Dilger [ 13/Mar/12 ]

This is causing many tests (14% or more) to fail during autotest. It makes sense to add a simple check to each of the failing subtests to verify whether the server is capable of running this test properly.

The easiest way would be something like "do_facet mds lctl get_param version" and check if it is new enough.

A better solution would be to have a test-specific check, like for sanity test_133d to see if the MDS has the right proc stats or not. In other cases it might not be so easy and a version check may be needed.

Comment by Yang Sheng [ 15/Mar/12 ]

Ok, So we have a list as below:

conf-sanity-61
sanity-133a,133d,160

Please comment other if not in the list.

Comment by Sarah Liu [ 21/Mar/12 ]

sanity-225a sanity-225b, these two tests should not be run on 2.1.x, it will cause oops.

Comment by Sarah Liu [ 21/Mar/12 ]

recovery-small, subtest 100 to 105 are new in 2.2 script, should not be run under 2.1

Comment by Andreas Dilger [ 02/Apr/12 ]

Yang Sheng,
having a list is the starting point, but what still needs to be done is to fix the test scripts so that they are skipped if the server does not have the right functionality to run the test from the client. This can be checked at the start of these failing tests by looking at "lctl get_param mdc.*.connect_flags" (for features that have a connect flag) or by "do_facet $SINGLEMDS

{some check}

" for other features. The check might be looking at the presence of "lctl list_param mdt.*.rename_stats" for sanity.sh test_133d, or simply "lctl get_param version" for others.

Having just a list of failing tests here in Jira does not stop those tests from failing, and it pollutes the test results with failures, which wastes everyone's time.

Comment by Chris Gearing (Inactive) [ 02/Apr/12 ]

Yang Sheng,

Could you create a new ticket for each of these tests, or possibly groups of tests. We don't group source code issues like this and shouldn't do so with test code issues either.

Thanks

Chris

Comment by Yang Sheng [ 08/May/12 ]

Close it, please reopen if any works need.

Generated at Sat Feb 10 01:14:23 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.