Details
-
Bug
-
Resolution: Duplicate
-
Major
-
None
-
Lustre 2.1.6
-
None
-
3
-
12303
Description
IU is running into an issue where running ls on a certain file causes clients to get evicted. It appears as if there is a hung MDT thread holding a lock on the file. After the MDT is rebooted, listing the directory and file works fine.
We were able to capture client debug logs and a backtrace of all the threads from the running system, but due to an issue with STONITH, we were unable to get a good vmcore from the system. Also when we tried to get debug logs from the MDT, the log overflowed, even with a 20GB buffer.
We are currently waiting for the issue to reappear and will get debug logs on a quiesced system, as well as a good vmcore.
I'll upload the logs we have. Is there anything else we should be looking to get?