Details
-
Bug
-
Resolution: Fixed
-
Minor
-
Lustre 2.12.0
-
3
-
9223372036854775807
Description
This issue was created by maloo for James Nunez <james.a.nunez@intel.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/c386aaae-8b55-11e8-9028-52540065bddc
Since the patch for LU-11045 test: use provided directory in racer/racer.sh, commit d6a3908fee28, landed, we see parallel-scale-nfsv3 and parallel-scale-nfv4 test suites failing. racer uses the first input variable as the directory to run racer in, parallel-scale-nfs’ routine racer_on_nfs() is not calling racer correctly.
127 test_racer_on_nfs() {
128 $racer $CLIENTS
129 }
130 run_test racer_on_nfs "racer on NFS client"
We need to call racer with $TESTDIR.
parallel-scale-nfsv3 and parallel-scale-nfv4 test suites started failing on 2018-07-19 with no individual test failing, but the whole test suite is marked as failed. I think racer_on_nfs() calling racer with no directory is what is causing these failures.
In the suite_log for https://testing.whamcloud.com/test_sets/75546c6e-8dd4-11e8-8ee3-52540065bddc, the only signs of a failure are
layout: raid0 raid0 pfl pfl pfl flr flr flr layout: raid0 raid0 pfl pfl pfl flr flr flr layout: raid0 raid0 pfl pfl pfl flr flr flr racer cleanup ./file_create.sh: line 1: kill: (3030) - No such process ./file_create.sh: line 1: kill: (3044) - No such process ./file_create.sh: line 1: kill: (3045) - No such process Trace dump: = ./file_create.sh:1:main() parallel-scale-nfsv4: FAIL: test-framework exiting on error Trace dump: = ./file_create.sh:1:main() parallel-scale-nfsv4: FAIL: test-framework exiting on error Trace dump: = ./file_create.sh:1:main() parallel-scale-nfsv4: FAIL: test-framework exiting on error
In the output for racer_on_nfs(), we also see that we are using the wrong file system for this test
there should be NO racer processes: USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND Filesystem 1K-blocks Used Available Use% Mounted on /dev/vda1 20511312 2448832 16997520 13% / We survived /usr/lib64/lustre/tests/racer/racer.sh for 300 seconds.
Logs for more of these failures are at
https://testing.whamcloud.com/test_sets/5b0dee6e-8b6f-11e8-b0aa-52540065bddc
https://testing.whamcloud.com/test_sets/8d0547e8-8d61-11e8-87f3-52540065bddc
Attachments
Issue Links
- mentioned in
-
Page No Confluence page found with the given URL.