[LU-8333] replay-dual test_21b: can't check if COS works: rename replied w/o COS Created: 27/Jun/16  Updated: 11/Sep/20

Status: Reopened
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.7.0, Lustre 2.9.0, Lustre 2.10.0, Lustre 2.11.0
Fix Version/s: None

Type: Bug Priority: Critical
Reporter: Maloo Assignee: Hongchao Zhang
Resolution: Unresolved Votes: 0
Labels: always_except
Environment:

Hard failover EL7 Server/Client
master, build# 3399


Issue Links:
Duplicate
is duplicated by LU-2230 replay-dual test_21b: @@@@@@ FAIL: Th... Reopened
is duplicated by LU-4104 Failure on test suite replay-dual tes... Resolved
Related
is related to LU-4470 replay-dual test_21b: FAIL: lustre-MD... Resolved
is related to LU-9586 replay-dual test cases 15c 21b remov... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Saurabh Tandan <saurabh.tandan@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/31581902-3ad4-11e6-bbf5-5254006e85c2.

The sub-test test_21b failed with the following error:

can't check if COS works: rename replied w/o COS

Test log:

Started clients trevis-56vm5: 
CMD: trevis-56vm5 mount | grep /mnt/lustre' '
10.9.6.124@tcp:10.9.6.120@tcp:/lustre on /mnt/lustre type lustre (rw,flock,user_xattr)
CMD: trevis-56vm5 PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/utils/gss:/usr/lib64/lustre/utils:/usr/lib64/qt-3.3/bin:/usr/lib64/compat-openmpi16/bin:/usr/bin:/bin:/usr/sbin:/sbin::/sbin:/bin:/usr/sbin: NAME=autotest_config sh rpc.sh set_default_debug \"-1\" \"all -lnet -lnd -pinger\" 4 
 replay-dual test_21b: @@@@@@ FAIL: can't check if COS works: rename replied w/o COS 

In past 15 days this issue has occurred around 36 times.



 Comments   
Comment by Peter Jones [ 13/Aug/16 ]

Hongchao

Could you please advise on this one?

Thanks

Peter

Comment by Gerrit Updater [ 15/Aug/16 ]

Hongchao Zhang (hongchao.zhang@intel.com) uploaded a new patch: http://review.whamcloud.com/21924
Subject: LU-8333 test: make sure COS is cleared
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: d0f614f780182e3ea56290e215cf5da73faf2cf1

Comment by Gerrit Updater [ 08/Sep/16 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/21924/
Subject: LU-8333 test: make sure COS is cleared
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: ae2ccbc6da32def014463ee98fab76a93835d85d

Comment by Peter Jones [ 08/Sep/16 ]

Landed for 2.9

Comment by Saurabh Tandan (Inactive) [ 07/Nov/16 ]

This issue is still seen on master regularly.
https://testing.hpdd.intel.com/test_sets/c4a7f648-a1c0-11e6-8ed2-5254006e85c2
https://testing.hpdd.intel.com/sub_tests/4cab1870-a2d6-11e6-8986-5254006e85c2
https://testing.hpdd.intel.com/sub_tests/ca5e0b42-a2ae-11e6-8b77-5254006e85c2
https://testing.hpdd.intel.com/sub_tests/05adec2a-a274-11e6-8986-5254006e85c2

Comment by Saurabh Tandan (Inactive) [ 09/Nov/16 ]

Reopening this ticket as the issue is still seen on master consistently. Please refer previous message above for latest failures.

Comment by Gerrit Updater [ 17/Nov/16 ]

Hongchao Zhang (hongchao.zhang@intel.com) uploaded a new patch: http://review.whamcloud.com/23830
Subject: LU-8333 test: debug log
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 4df7fb623fb09fe078d86d51361e1f39e2db792d

Comment by Hongchao Zhang [ 13/Jan/17 ]

I have checked the recent failures and found the transactions of the client2 (intended to introduce the transaction gap)
has been committed before failing over the MDT, which cause the client1 to recover successfully.

I'll consider some new means to test the COS feature.

Comment by Gerrit Updater [ 07/Mar/17 ]

Minh Diep (minh.diep@intel.com) uploaded a new patch: https://review.whamcloud.com/25848
Subject: LU-8333 test: disable replay-dual 21b
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 9d046935201bf2ea6b0e788f16c2d1bdaf843577

Comment by Joseph Gmitter (Inactive) [ 08/Mar/17 ]

Hi Mike,

Can you comment on the validity of this test case in relation to commit on share? Is it a valid test case, providing useful testing?

Thanks.
Joe

Comment by Gerrit Updater [ 16/Mar/17 ]

Hongchao Zhang (hongchao.zhang@intel.com) uploaded a new patch: https://review.whamcloud.com/26030
Subject: LU-8333 test: add more conflict operations
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: dad5c6ded7336ee7dba2a1cf79833840032b7306

Comment by Gerrit Updater [ 14/Apr/17 ]

James Casper (jamesx.casper@intel.com) uploaded a new patch: https://review.whamcloud.com/26631
Subject: LU-8333: Excepts replay-dual test_21b while a new COS check test can be developed
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 95fde52018764c4a88c1d5ff479063631cb9cb6b

Comment by Gerrit Updater [ 22/Apr/17 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/26631/
Subject: LU-8333 test: Add replay-dual 21b to ALWAYS_EXCEPT
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 83296931009c8bdc423a2374c921e610786870c3

Comment by Peter Jones [ 22/Apr/17 ]

Landed for 2.10

Comment by Saurabh Tandan (Inactive) [ 10/Aug/17 ]

Re-opening the ticket. Recently, replay-dual test 21b was run removing it from Always_Except list and the issue still persists.
https://testing.hpdd.intel.com/test_sets/3dc43c8a-7d49-11e7-9ce0-5254006e85c2
Looking at the comments history on this ticket it appears that Hongchao's proposed solution https://review.whamcloud.com/#/c/26030/ never got landed.

Comment by Hongchao Zhang [ 07/Dec/17 ]

the patch https://review.whamcloud.com/#/c/26030/ has been updated.

Comment by Gerrit Updater [ 06/Mar/18 ]

Hongchao Zhang (hongchao.zhang@intel.com) uploaded a new patch: https://review.whamcloud.com/31538
Subject: LU-8333 test: use async_commit_count to test COS
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: c78f0c429012757b1b5d0c6bd5a368557f1d929a

Generated at Sat Feb 10 02:16:37 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.