Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1696

Test failure on test suite sanity, subtest test_49

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.3.0
    • Lustre 2.3.0
    • None
    • 3
    • 4506

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/79266140-dba0-11e1-81e3-52540035b04c.

      The sub-test test_49 failed with the following error:

      test failed to respond and timed out

      Attachments

        Activity

          [LU-1696] Test failure on test suite sanity, subtest test_49
          pjones Peter Jones made changes -
          Fix Version/s New: Lustre 2.3.0 [ 10117 ]
          Resolution New: Fixed [ 1 ]
          Status Original: Open [ 1 ] New: Resolved [ 5 ]
          pjones Peter Jones added a comment -

          Landed for 2.3

          pjones Peter Jones added a comment - Landed for 2.3
          ys Yang Sheng added a comment - Patch upload to: http://review.whamcloud.com/3544
          ys Yang Sheng added a comment -

          Looks like 'dd' has hang and cause below part infinite loop:

                  # loop until dd process exits
                  while ps ax -opid | grep -q $dd_pid; do
                          $LCTL set_param $osc1_mppc=$((RANDOM % 256 + 1))
                          sleep $((RANDOM % 5 + 1))
                  done
          

          But i cannot found 'dd' process in stacktrace file. So i suspect it just because the script use 'grep -q $dd_pid' to match the pid, It maybe match partial number.

          ys Yang Sheng added a comment - Looks like 'dd' has hang and cause below part infinite loop: # loop until dd process exits while ps ax -opid | grep -q $dd_pid; do $LCTL set_param $osc1_mppc=$((RANDOM % 256 + 1)) sleep $((RANDOM % 5 + 1)) done But i cannot found 'dd' process in stacktrace file. So i suspect it just because the script use 'grep -q $dd_pid' to match the pid, It maybe match partial number.
          pjones Peter Jones made changes -
          Assignee Original: WC Triage [ wc-triage ] New: Yang Sheng [ ys ]
          pjones Peter Jones added a comment -

          yangsheng will look into this one

          pjones Peter Jones added a comment - yangsheng will look into this one
          jlevi Jodi Levi (Inactive) made changes -
          Affects Version/s New: Lustre 2.3.0 [ 10117 ]
          Priority Original: Minor [ 4 ] New: Blocker [ 1 ]
          maloo Maloo created issue -

          People

            ys Yang Sheng
            maloo Maloo
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: