Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11171

parallel-scale-nfs* running racer against wrong directory

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: Lustre 2.12.0
    • Fix Version/s: Lustre 2.12.0
    • Labels:
    • Severity:
      3
    • Rank (Obsolete):
      9223372036854775807

      Description

      This issue was created by maloo for James Nunez <james.a.nunez@intel.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/c386aaae-8b55-11e8-9028-52540065bddc

      Since the patch for LU-11045 test: use provided directory in racer/racer.sh, commit d6a3908fee28, landed, we see parallel-scale-nfsv3 and parallel-scale-nfv4 test suites failing. racer uses the first input variable as the directory to run racer in, parallel-scale-nfs’ routine racer_on_nfs() is not calling racer correctly.

       127 test_racer_on_nfs() {
       128         $racer $CLIENTS
       129 }
       130 run_test racer_on_nfs "racer on NFS client"
      

      We need to call racer with $TESTDIR.

      parallel-scale-nfsv3 and parallel-scale-nfv4 test suites started failing on 2018-07-19 with no individual test failing, but the whole test suite is marked as failed. I think racer_on_nfs() calling racer with no directory is what is causing these failures.

      In the suite_log for https://testing.whamcloud.com/test_sets/75546c6e-8dd4-11e8-8ee3-52540065bddc, the only signs of a failure are

      layout: raid0 raid0 pfl pfl pfl flr flr flr
      layout: raid0 raid0 pfl pfl pfl flr flr flr
      layout: raid0 raid0 pfl pfl pfl flr flr flr
       racer cleanup
      ./file_create.sh: line 1: kill: (3030) - No such process
      ./file_create.sh: line 1: kill: (3044) - No such process
      ./file_create.sh: line 1: kill: (3045) - No such process
        Trace dump:
        = ./file_create.sh:1:main()
      parallel-scale-nfsv4: FAIL: test-framework exiting on error
        Trace dump:
        = ./file_create.sh:1:main()
      parallel-scale-nfsv4: FAIL: test-framework exiting on error
        Trace dump:
        = ./file_create.sh:1:main()
      parallel-scale-nfsv4: FAIL: test-framework exiting on error
      

      In the output for racer_on_nfs(), we also see that we are using the wrong file system for this test

      there should be NO racer processes:
      USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
      Filesystem     1K-blocks    Used Available Use% Mounted on
      /dev/vda1       20511312 2448832  16997520  13% /
      We survived /usr/lib64/lustre/tests/racer/racer.sh for 300 seconds.
      

      Logs for more of these failures are at
      https://testing.whamcloud.com/test_sets/5b0dee6e-8b6f-11e8-b0aa-52540065bddc
      https://testing.whamcloud.com/test_sets/8d0547e8-8d61-11e8-87f3-52540065bddc

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                jamesanunez James Nunez
                Reporter:
                maloo Maloo
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: