[LU-7005] conf-sanity test_50i: lustre-MDT0001-osp-MDT0000:osp_attr_get update error Created: 13/Aug/15  Updated: 09/Sep/16  Resolved: 09/Oct/15

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: Lustre 2.8.0

Type: Bug Priority: Major
Reporter: Maloo Assignee: Lai Siyao
Resolution: Fixed Votes: 0
Labels: dne2

Issue Links:
Related
is related to LU-6831 The ticket for tracking all DNE2 bugs Reopened
is related to LU-6586 "lctl conf_param testfs-MDT0001.mdc.a... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Andreas Dilger <andreas.dilger@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/ed5c043a-fa55-11e4-8c8b-5254006e85c2.

The conf-sanity test_50i that was landed as part of LU-6586 is failing about 50% of the time (7 failures, 8 passes, 21 skips). I'm going to disable the test rather than reverting the whole patch.

The sub-test test_50i failed with the following error:

08:14:15:Lustre: setting import lustre-MDT0001_UUID INACTIVE by administrator request
08:14:15:Lustre: Skipped 4 previous similar messages
08:14:15:LustreError: 7648:0:(osp_object.c:586:osp_attr_get()) lustre-MDT0001-osp-MDT0002:osp_attr_get update error [0x200000009:0x1:0x0]: rc = -108
08:14:15:LustreError: 7648:0:(lod_sub_object.c:957:lod_sub_prep_llog()) lustre-MDT0002-mdtlov: can't get id from catalogs: rc = -108

This failure was introduced with the original patch for LU-6586.

Info required for matching: conf-sanity 50i



 Comments   
Comment by Gerrit Updater [ 13/Aug/15 ]

Andreas Dilger (andreas.dilger@intel.com) uploaded a new patch: http://review.whamcloud.com/15980
Subject: LU-7005 tests: disable conf-sanity test_50i
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 8cb53eeb012c164076a0fcc452f108b3f49929d2

Comment by Gerrit Updater [ 13/Aug/15 ]

Andreas Dilger (andreas.dilger@intel.com) merged in patch http://review.whamcloud.com/15980/
Subject: LU-7005 tests: disable conf-sanity test_50i
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 4d4d00c26f248bdcb9e486589f9b4f5cd0723bb5

Comment by Di Wang [ 13/Aug/15 ]

Hmm, this is actually due to the test script bug, so after "lctl conf_param testfs-MDT0001.mdc.active=1", we should wait the connection between clients and MDT0001 are fully recovered, then create the file. I will cook a patch.

Comment by Gerrit Updater [ 13/Aug/15 ]

wangdi (di.wang@intel.com) uploaded a new patch: http://review.whamcloud.com/15983
Subject: LU-7005 tests: wait client imports fully recovered
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 473f6368fd39c17a6fe77f4887783e58f389d0dc

Comment by Di Wang [ 14/Aug/15 ]

Lai: Could you please help to update this patch? Please see my comments on the patch.

Comment by Richard Henwood (Inactive) [ 03/Sep/15 ]

Hi Lai;

Is there any progress on this issue?

Comment by Lai Siyao [ 10/Sep/15 ]

previous fix failed on timeout issues, and it was updated to avoid timeout.

Comment by Richard Henwood (Inactive) [ 18/Sep/15 ]

makes sense!

Is this still an open issue?

Comment by Gerrit Updater [ 09/Oct/15 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/15983/
Subject: LU-7005 tests: wait client imports fully recovered
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 7b569574a484bb781ed5796040e0eb357aaeefb9

Comment by Peter Jones [ 09/Oct/15 ]

Landed for 2.8

Generated at Sat Feb 10 02:05:10 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.