Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-16536

MDS umount can get stuck due to LDLM locks

    XMLWordPrintable

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • Lustre 2.16.0
    • None
    • 3
    • 9223372036854775807

    Description

      the scenario is the following:

      • rmdir is trying to unlink a striped dir
      • a set of LDLM locks (from different MDT) are held by MDT
      • number of RPCs are sent to another MDTs to destroy the objects
      • another MDT umounts and no reply is sent back
      • this MDT starts to umount, deactivate OSP#0 (corresponding RPCs are interrupted) and waiting for LDLM lock in this namespace to release
      • the lock is still held by the original MDT thread doing rmdir
      • other OSPs are still active trying to reconnect to umounted MDTs
      • deadlock

      Attachments

        Activity

          People

            bzzz Alex Zhuravlev
            bzzz Alex Zhuravlev
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: