[LU-8972] conf-sanity test_101: File hasn't object on OST Created: 24/Dec/16  Updated: 11/Jun/18  Resolved: 11/Jun/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.9.0
Fix Version/s: Lustre 2.12.0

Type: Bug Priority: Major
Reporter: Maloo Assignee: Alex Zhuravlev
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Duplicate
Related
is related to LU-8562 osp_precreate_cleanup_orphans/osp_pre... Resolved
is related to LU-9285 revert LU-8367 and LU-8972 Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/57cc8712-c9db-11e6-af18-5254006e85c2.

The sub-test test_101 failed with the following error:

File hasn't object on OST

Info required for matching: conf-sanity 101

 

This is quite new issue which starts to happen since yesterday, it can be related to some recent commits.



 Comments   
Comment by Mikhail Pershin [ 24/Dec/16 ]

I see that this issue happens quite often right now in Maloo. Failure rate is about 50% right now

Comment by Mikhail Pershin [ 24/Dec/16 ]

failed test 101 was introduced in LU-8562

Comment by Gerrit Updater [ 25/Dec/16 ]

Oleg Drokin (oleg.drokin@intel.com) uploaded a new patch: https://review.whamcloud.com/24517
Subject: LU-8972 tests: Disable conf-sanity test 101
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 12a9eee4334df25f1445a53e5b544de561c33a1b

Comment by Gerrit Updater [ 27/Dec/16 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/24517/
Subject: LU-8972 tests: Disable conf-sanity test 101
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: fa722ccf04fe6c386628ea5a575165c173643800

Comment by Bob Glossman (Inactive) [ 27/Dec/16 ]

another on master:
https://testing.hpdd.intel.com/test_sets/08853232-cc1d-11e6-bb5d-5254006e85c2

Comment by Ned Bass [ 28/Dec/16 ]

I'm disappointed that https://review.whamcloud.com/24517/ was fast-tracked through the landing process with no explanation or discussion. The lack of transparency is a bit alarming given that the failing test is designed to catch filesystem data loss. To make matters worse, the test results are hidden behind a login screen so there is no public way to discern why you might have decided to disable the test.

Comment by Joseph Gmitter (Inactive) [ 28/Dec/16 ]

Assigning to James to reproduce in manual mode and validate where the test is going wrong to identify next steps.

Comment by Gerrit Updater [ 25/Jan/17 ]

Alex Zhuravlev (alexey.zhuravlev@intel.com) uploaded a new patch: https://review.whamcloud.com/25079
Subject: LU-8972 osp: skip subsequent orphan cleanups
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: c2c5dcb106efdf290a93e7c91dcf48a09f5512b6

Comment by Joseph Gmitter (Inactive) [ 25/Jan/17 ]

Reassigning to Alex since he has a patch in flight for the issue.

Comment by Gerrit Updater [ 31/Jan/17 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/25079/
Subject: LU-8972 osp: skip subsequent orphan cleanups
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 6f56f71b407a8c14db4c2accd37da5b4feecde1a

Comment by Peter Jones [ 31/Jan/17 ]

Landed for 2.10

Comment by Gerrit Updater [ 01/Mar/17 ]

Hongchao Zhang (hongchao.zhang@intel.com) uploaded a new patch: https://review.whamcloud.com/25687
Subject: LU-8972 osp: delete orphans when precreate failed
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: e1ed33d0adf87cf85e7ee645d68dfafa05e8ce6c

Comment by John Hammond [ 09/Mar/17 ]

It would be nice to have a better description of the problem here and why it's not so easy to fix.

Comment by Gerrit Updater [ 10/Mar/17 ]

Alex Zhuravlev (alexey.zhuravlev@intel.com) uploaded a new patch: https://review.whamcloud.com/25924
Subject: Revert "LU-8972 osp: skip subsequent orphan cleanups"
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 26f13abe4326d641e9e98f339400c26560117b78

Comment by Andreas Dilger [ 29/May/17 ]

The conf-sanity test_101 is still in ALWAYS_EXCEPT, so this needs to be fixed before closing this ticket.

Comment by Gerrit Updater [ 01/May/18 ]

James Nunez (james.a.nunez@intel.com) uploaded a new patch: https://review.whamcloud.com/32220
Subject: LU-8972 tests: remove conf-sanity test from ALWAYS_EXCEPT
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 9673a14d4427f3a107272d16e498aec2e282901a

Comment by Gerrit Updater [ 07/Jun/18 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/32220/
Subject: LU-8972 tests: remove conf-sanity test from ALWAYS_EXCEPT
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 60c5a6fd35d4d819b61b4f58b33fdd3baefd52e4

Comment by James Nunez (Inactive) [ 11/Jun/18 ]

Patch to remove conf-sanity from the ALWAYS_EXCEPT list landed to master. Resolving issue.

Generated at Sat Feb 10 02:22:08 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.