[LU-5497] Many MDS service threads blocked in ldlm_completion_ast() - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Won't Fix
Priority: Blocker
Fix Version/s: None
Affects Version/s: Lustre 2.4.2
Labels:
- llnl

Severity:
3
Rank (Obsolete):
15338

Description

Our production MDS systems occasionally get stuck with many service threads stuck in ldlm_completion_ast(). The details were described in ~~LU-4579~~, but that issue was closed when the patch landed which fixed how timeouts are reported.

When this happens, client access hangs and the MDS appears completely idle.

Attachments

Issue Links

is related to

LU-4579 Timeout system horribly broken

Resolved

Activity

People

Assignee:: Oleg Drokin

Reporter:: Ned Bass (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 13 Start watching this issue

Dates

Created:: 15/Aug/14 8:34 PM

Updated:: 14/Aug/16 5:28 PM

Resolved:: 14/Aug/16 5:28 PM