Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10145

replay-single test_74: Timeout occurred after 151 mins, last suite running was replay-single, restarting cluster to continue tests

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.11.0
    • None
    • trevis, full
      servers: CentOS7.4, ldiskfs, branch master, v2.10.54, b3652
      clients: SLES12sp2, branch master, v2.10.54, b3652
    • 3
    • 9223372036854775807

    Description

      https://testing.hpdd.intel.com/test_sessions/7e242182-7767-4925-9974-77f1a02ec4f8

      Unable to find a process hang, LBUG, panic, or "BUG:". The timeout happens during or after clients are being stopped.

      From test_log:

      Stopping clients: trevis-4vm10,trevis-4vm9 /mnt/lustre (opts:)
      

      From client console:

      [ 5518.605357] Lustre: DEBUG MARKER: == replay-single test 74: Ensure applications don't fail waiting for OST recovery ==================== 01:04:51 (1507881891)
      [ 5518.764057] Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre' ' /proc/mounts);
      [ 5518.764057] if [ $running -ne 0 ] ; then
      [ 5518.764057] echo Stopping client $(hostname) /mnt/lustre opts:;
      [ 5518.764057] lsof /mnt/lustre || need_kill=no;
      [ 5518.764057] if [ x != x -a x$need_kill != xno ]; then
      [ 5518.764057]     pids=$(lsof -t /mnt/lustre | sort -u);
      [ 5518.764057]     if 
      [ 5519.407774] Lustre: Unmounted lustre-client
      

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              jcasper James Casper
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: