Details
-
Bug
-
Resolution: Duplicate
-
Major
-
Lustre 2.1.0
-
None
-
2.6.18-194.17.1.el5
-
3
-
10397
Description
The following deadlock was hit when I was running mmap tests:
mmap test thread: pagefault -> lock page -> release dlm lock -> cancel dlm lock -> cl_lock_mutex_get;
bl_ast handler thread: cancel dlm lock -> cl_lock_mutex_get -> flush pages -> lock page;
And because of this deadlock, ll_imp_inval thread is blocked on cl_lock_mutex_get, so the client eviction can never be finished. I think it's the root cause of LU-180, but I'm not 100 percent sure because they didn't provide the stack trace to prove it yet.
The stack trace is attached.