[LU-16596] Lustre client crashed with ASSERTION( ldlm_is_granted(lock) ) failed Created: 27/Feb/23 Updated: 30/Nov/23 Resolved: 30/Nov/23 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Dominika Wanat | Assignee: | WC Triage |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Rocky Linux 8.7, Kernel 4.18.0-425.10.1.el8_7.x86_64, Lustre Client 2.15.2 (compiled from b2_15 branch). |
||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||
| Description |
|
Yesterday one of our Lustre clients built on top of the 2.15.2 release (from the b2_15 branch) crashed with the following LBUG: [1361556.476660] LustreError: 882654:0:(ldlm_lock.c:1097:ldlm_grant_lock_with_skiplist()) ASSERTION( ldlm_is_granted(lock) ) failed: I found a similar LU: https://jira.whamcloud.com/browse/LU-13927 but with the Lustre 2.12.5 on MDS, patches linked in this LU are already there. |
| Comments |
| Comment by Etienne Aujames [ 01/Mar/23 ] |
|
The https://review.whamcloud.com/39854 is not included in 2.12.5: $ git log --oneline -1 dcbb023c2f57fff8c856cb5c777855266b7f7b6c dcbb023 LU-11276 ldlm: fix lock convert races $ git tag --contains=dcbb023c2f57fff8c856cb5c777855266b7f7b6c 2.12.6 2.12.6-RC1 2.12.6-RC2 2.12.7 2.12.7-RC1 2.12.8 2.12.9 2.12.9-RC1 v2_12_6 v2_12_6-RC1 v2_12_6-RC2 v2_12_7 v2_12_7-RC1 v2_12_8 v2_12_9 v2_12_9-RC1 |
| Comment by Dominika Wanat [ 06/Mar/23 ] |
|
Hello Etienne, thank you for your help. We have our own branch named "ares-client-2.15-rocky" and this commit seems to have different commit id in our case (probably because this branch was merged from master some time ago and has nothing in common with b2_12). This patch was cherry-picked to master, has commit id: 6c0b676e41245c2f74bcf7f3f1ac9fcb0fd6c319 and you can find it there: https://review.whamcloud.com/c/fs/lustre-release/+/36466 $ git log --oneline -1 dcbb023c2f57fff8c856cb5c777855266b7f7b6c dcbb023c2f LU-11276 ldlm: fix lock convert races $ git log --oneline -1 6c0b676e41245c2f74bcf7f3f1ac9fcb0fd6c319 6c0b676e41 LU-11276 ldlm: fix lock convert races The second patch is visible in our branch (and in master): $ git branch --contains=6c0b676e41245c2f74bcf7f3f1ac9fcb0fd6c319 * ares-client-2.15-rocky (...) master And patch with commit id cited by you is related to the different branches 2.12.x based on b2_12 (it is clearly visible in Gerrit): $ git branch --contains dcbb023c2f57fff8c856cb5c777855266b7f7b6c prometheus-client-2.12 prometheus-client-2.12-sysofed So, I think that we can assume that this patch is here.
|
| Comment by Andreas Dilger [ 30/Nov/23 ] |
|
This should be fixed by the patch in |