Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4498

MDT thread hung, ls fails on directory

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Major
    • None
    • Lustre 2.1.6
    • None
    • 3
    • 12303

    Description

      IU is running into an issue where running ls on a certain file causes clients to get evicted. It appears as if there is a hung MDT thread holding a lock on the file. After the MDT is rebooted, listing the directory and file works fine.

      We were able to capture client debug logs and a backtrace of all the threads from the running system, but due to an issue with STONITH, we were unable to get a good vmcore from the system. Also when we tried to get debug logs from the MDT, the log overflowed, even with a 20GB buffer.

      We are currently waiting for the issue to reappear and will get debug logs on a quiesced system, as well as a good vmcore.

      I'll upload the logs we have. Is there anything else we should be looking to get?

      Attachments

        Activity

          People

            bobijam Zhenyu Xu
            kitwestneat Kit Westneat (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: