Details
-
Improvement
-
Resolution: Fixed
-
Minor
-
None
-
None
-
3
-
9223372036854775807
Description
The default value for lru_max_age is currently 3900s (3900000 msec), or 1h5m.
For most systems and use cases, this is far too long and can lead to clients holding on to locks that they don't need and may cause issues for other clients if those clients holding the LDLM locks have any network issues and the server lock callback RPCs are slow. It may also cause issues with the LRU, if clients hold on to locks only used once for a long time and potentially evict more important locks sooner, though more work is still needed to the DLM LRU algorithm in LU-11509.
We regularly tune ldlm.namespaces.*.lru_max_age=300s on large clusters, and it makes sense to change this to be the default.
Attachments
Issue Links
- duplicates
-
LU-14517 Decrease default lru_max_age value
- Resolved
- is related to
-
LU-7266 Fix LDLM pool to make LRUR working properly
- Open
-
LU-6402 reduce the value of LDLM_POOL_MAX_AGE
- Open
-
LU-15630 recovery-small test_10a: no eviction: before:1646723217
- Open
-
LU-18288 sanity test_120e: FAIL: 2 cancel RPC occured
- Resolved
-
LU-6529 Server side lock limits to avoid unnecessary memory exhaustion
- Closed
-
LU-11509 LDLM: replace client lock LRU with improved cache algorithm
- Open
- is related to
-
LU-11092 NMI watchdog: BUG: soft lockup - CPU#12 stuck for 23s! [ptlrpcd_00_18:4222]
- Open
- mentioned in
-
Page Loading...