Details
-
Bug
-
Resolution: Cannot Reproduce
-
Critical
-
None
-
Lustre 2.1.6
-
None
-
3
-
13008
Description
Hi,
We are seeing all mdt threads on the MDS stuck in "cfs_waitq_wait". At the same time, we have a lot of rpc request (15k/s).
Looking closely at the console and the 'bt' from crash, we can see that those threads are coming from qos_statfs_update() where they block in l_wait_event and never wake up.
What is strange is that cfs_time_beforeq_64(max_age, obd->obd_osfs_age) should be true.
This issue was hit 4 times during February.
Please find attached the dmesg and 'foreach bt' outputs.
Sebastien.