Details
-
Bug
-
Resolution: Fixed
-
Blocker
-
None
-
Lustre 2.8.0
-
Lustre build: https://build.hpdd.intel.com/job/lustre-master/3009
MDSCOUNT=2
-
3
-
9223372036854775807
Description
conf-sanity test 53b failed as follows under DNE configuration:
CMD: shadow-21vm8 /usr/sbin/lctl set_param mds.MDS.mdt.threads_min=8 mds.MDS.mdt.threads_min=8 CMD: shadow-21vm8 /usr/sbin/lctl set_param mds.MDS.mdt.threads_max=142 mds.MDS.mdt.threads_max=142 CMD: shadow-21vm8 lctl get_param -n mds.MDS.mdt.threads_min CMD: shadow-21vm8 lctl get_param -n mds.MDS.mdt.threads_max checking (($tmin2 == ($tmin + $nthrs))) (((8 == (6 + 2))))... checking (($tmax2 == ($tmax - $nthrs))) (((142 == (144 - 2))))... CMD: shadow-21vm8 lctl get_param -n mds.MDS.mdt.threads_started checking (($tstarted >= $tmin2)) (((7 >= 8)))... conf-sanity test_53b: @@@@@@ FAIL: Assertion 25 failed: (($tstarted >= $tmin2)) (expanded: ((7 >= 8)))
Maloo report: https://testing.hpdd.intel.com/test_sets/2a054c68-f25c-11e4-9f61-5254006e85c2
Attachments
Issue Links
- is related to
-
LU-6206 conf-sanity test_53a: Insane OST thread counts
-
- Resolved
-
I think the problem is that the test is increasing threads_min but it doesn't necessarily do anything to trigger the threads to start. The test probably needs to do something like "touch" or similar before sleeping to ensure the service thread is triggered and will check the ptlrpc_threads_enough() condition. The service thread probably handles some RPCs naturally via ping or DLM lock callback similar some of the time, but not consistently, which is why it is failing intermittently.