Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-16536

MDS umount can get stuck due to LDLM locks

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.16.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      the scenario is the following:

      • rmdir is trying to unlink a striped dir
      • a set of LDLM locks (from different MDT) are held by MDT
      • number of RPCs are sent to another MDTs to destroy the objects
      • another MDT umounts and no reply is sent back
      • this MDT starts to umount, deactivate OSP#0 (corresponding RPCs are interrupted) and waiting for LDLM lock in this namespace to release
      • the lock is still held by the original MDT thread doing rmdir
      • other OSPs are still active trying to reconnect to umounted MDTs
      • deadlock

      Attachments

        Issue Links

          Activity

            [LU-16536] MDS umount can get stuck due to LDLM locks
            pjones Peter Jones added a comment -

            Landed for 2.16

            pjones Peter Jones added a comment - Landed for 2.16

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/49925/
            Subject: LU-16536 osp: don't cleanup ldlm in precleanup phase
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: eed4d4c7523c26cfc5bc230986d96b2acf152dee

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/49925/ Subject: LU-16536 osp: don't cleanup ldlm in precleanup phase Project: fs/lustre-release Branch: master Current Patch Set: Commit: eed4d4c7523c26cfc5bc230986d96b2acf152dee

            "Alex Zhuravlev <bzzz@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49925
            Subject: LU-16536 osp: don't cleanup ldlm in precleanup phase
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: fcec71d596988513e1ba841eda30a7336705c400

            gerrit Gerrit Updater added a comment - "Alex Zhuravlev <bzzz@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49925 Subject: LU-16536 osp: don't cleanup ldlm in precleanup phase Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: fcec71d596988513e1ba841eda30a7336705c400

            People

              bzzz Alex Zhuravlev
              bzzz Alex Zhuravlev
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: