[LU-9860] conf-sanity test_33a fails with “mount -t lustre failed“ for separate MDS and MGS Created: 10/Aug/17  Updated: 26/Aug/19  Resolved: 17/Jun/19

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.11.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Nunez (Inactive) Assignee: James Nunez (Inactive)
Resolution: Fixed Votes: 0
Labels: tests
Environment:

Separate MGS and MDS or separate MGT and MDT


Issue Links:
Related
is related to LU-10717 several conf-sanity tests failed: FAI... Resolved
is related to LU-8688 All Lustre test suites should run/PAS... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

conf-sanity test 33a fails when run on a Lustre file system configured with a separate MDS and MGS, even with a separate MDT and MGT on the same node. The error message is

conf-sanity test_33a: @@@@@@ FAIL: mount -t lustre failed 

In the test log, we see that mounting the MDS for the newly created file system fails

CMD: onyx-50vm7 mkdir -p /mnt/lustre-fs2mds; mount -t lustre   		                   /dev/lvm-Role_MDS/S1 /mnt/lustre-fs2mds
onyx-50vm7: mount.lustre: mount /dev/mapper/lvm--Role_MDS-S1 at /mnt/lustre-fs2mds failed: Operation already in progress
onyx-50vm7: The target service is already running. (/dev/mapper/lvm--Role_MDS-S1)

This failure looks like it is due to the mgs flags used when the MDT is formatted; “—mgs” flag. When an MGS already exists, specifying that this node will be the MGS for the new file system causes problems.

In addition, the lctl call in the following code needs to be run on the MGS

2418         do_facet $SINGLEMDS "$LCTL conf_param $FSNAME2.sys.timeout=200" ||
2419                 error "$LCTL conf_param $FSNAME2.sys.timeout=200 failed"

We see similar failures for conf-sanity tests 43b, 53b, 54b

Logs that capture test 33a failures are at
https://testing.hpdd.intel.com/test_sets/5176f130-729c-11e7-a0a2-5254006e85c2
https://testing.hpdd.intel.com/test_sets/8df40f4c-729e-11e7-a0a2-5254006e85c2

Note: This tickets description was modified to state the problem more clearly and not only subscribe a solution.



 Comments   
Comment by Gerrit Updater [ 11/Aug/17 ]

James Nunez (james.a.nunez@intel.com) uploaded a new patch: https://review.whamcloud.com/28478
Subject: LU-9860 tests: Run command on MGS for conf-sanity 33a
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: cf90d5c6dfd4b2acf39ab7dbddf120e39384fcd2

Comment by Gerrit Updater [ 11/Aug/17 ]

James Nunez (james.a.nunez@intel.com) uploaded a new patch: https://review.whamcloud.com/28497
Subject: LU-9860 tests: Add conf-sanity tests to ALWAYS_EXCEPT list
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: e3ace781ac6a4430b8071fbfb940fbb916389190

Comment by Gerrit Updater [ 28/Aug/17 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/28497/
Subject: LU-9860 tests: Add conf-sanity tests to ALWAYS_EXCEPT list
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 5297bfa671bd2cbb18fb3188c4d9e3b8edf3284a

Comment by James Nunez (Inactive) [ 28/Aug/17 ]

The patch to add conf-sanity tests to the ALWAYS_EXCEPT list has landed to master. Let's keep this ticket open until we get a fix for conf-sanity tests 33a, 43b, 53b, 54b.

Comment by Gerrit Updater [ 24/Oct/17 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/28478/
Subject: LU-9860 tests: Run command on MGS for conf-sanity 33a
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: d4f7bb22d87ba2b19e5ce56e1e9d019dccd0b2e0

Comment by Gerrit Updater [ 22/Nov/17 ]

James Nunez (james.a.nunez@intel.com) uploaded a new patch: https://review.whamcloud.com/30216
Subject: LU-9860 tests: Add conf-sanity tests to ALWAYS_EXCEPT list
Project: fs/lustre-release
Branch: b2_10
Current Patch Set: 1
Commit: 03450e06859b9f66298f3d616c67d4a4221effc4

Comment by Gerrit Updater [ 19/Dec/17 ]

John L. Hammond (john.hammond@intel.com) merged in patch https://review.whamcloud.com/30216/
Subject: LU-9860 tests: Add conf-sanity tests to ALWAYS_EXCEPT list
Project: fs/lustre-release
Branch: b2_10
Current Patch Set:
Commit: ad56044ff664e28643968b227a0b71d09baf7cfb

Comment by James Nunez (Inactive) [ 17/Jun/19 ]

All tests that were added to the ALWAYS_EXCEPT list have been removed or other tickets have been opened to track the issue.

Comment by Andreas Dilger [ 26/Aug/19 ]

This was resolved with the landing of patch https://review.whamcloud.com/33589 "LU-10717 tests: tests should not start mgs"

Generated at Sat Feb 10 02:29:58 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.