Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10102

conf-sanity test_105: test failed to respond and timed out

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.11.0
    • None
    • trevis, full
      servers: el7.4, ldiskfs, branch master, v2.10.53.1, b3642
      clients: sles12.2, branch master, v2.10.53.1, b3642
    • 3
    • 9223372036854775807

    Description

      https://testing.hpdd.intel.com/test_sessions/bce14d16-91a3-441a-b3ec-8a1817b00025

      Nothing found in any of the console or dmesg logs. It appears that the process of
      stopping the clients is not able to finish. This might be SLES related.

      From suite_log:

      umount lustre on /mnt/lustre.....
      CMD: trevis-3vm1 grep -c /mnt/lustre' ' /proc/mounts
      Stopping client trevis-3vm1 /mnt/lustre (opts:)
      CMD: trevis-3vm1 lsof -t /mnt/lustre
      CMD: trevis-3vm1 umount  /mnt/lustre 2>&1
      stop ost1 service on trevis-3vm3
      CMD: trevis-3vm3 grep -c /mnt/lustre-ost1' ' /proc/mounts
      Stopping /mnt/lustre-ost1 (opts:-f) on trevis-3vm3
      CMD: trevis-3vm3 umount -f /mnt/lustre-ost1
      CMD: trevis-3vm3 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
      stop mds service on trevis-3vm4
      CMD: trevis-3vm4 grep -c /mnt/lustre-mds1' ' /proc/mounts
      Stopping /mnt/lustre-mds1 (opts:-f) on trevis-3vm4
      CMD: trevis-3vm4 umount -f /mnt/lustre-mds1
      CMD: trevis-3vm4 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
      CMD: trevis-3vm1 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
      modules unloaded.
      Stopping clients: trevis-3vm1,trevis-3vm2 /mnt/lustre (opts:)
      CMD: trevis-3vm1,trevis-3vm2 running=\$(grep -c /mnt/lustre' ' /proc/mounts);
      if [ \$running -ne 0 ] ; then
      echo Stopping client \$(hostname) /mnt/lustre opts:;
      lsof /mnt/lustre || need_kill=no;
      if [ x != x -a x\$need_kill != xno ]; then
          pids=\$(lsof -t /mnt/lustre | sort -u);
          if [ -n \"\$pids\" ]; then
                   kill -9 \$pids;
          fi
      fi;
      while umount  /mnt/lustre 2>&1 | grep -q busy; do
          echo /mnt/lustre is still busy, wait one second && sleep 1;
      done;
      fi
      

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              jcasper James Casper
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: