Details
-
Bug
-
Resolution: Fixed
-
Minor
-
None
-
None
-
3
-
9223372036854775807
Description
ne process got stalled on spin_lock(&pidmap->jp_lock).
PID: 15240 TASK: ffff8848c676edd0 CPU: 16 COMMAND: "wrf.exe"
#0 [ffff882f80005e48] crash_nmi_callback at ffffffff8104d342
#1 [ffff882f80005e58] nmi_handle at ffffffff816901b7
#2 [ffff882f80005eb0] do_nmi at ffffffff816903c3
#3 [ffff882f80005ef0] end_repeat_nmi at ffffffff8168f5d3
[exception RIP: _raw_spin_lock+50]
RIP: ffffffff8168ea92 RSP: ffff8848c7e47b48 RFLAGS: 00000206
RAX: 0000000000006d1c RBX: ffff883138e99240 RCX: 0000000000000004
RDX: 0000000000005a5a RSI: 0000000000005a5a RDI: ffff883138e99258
RBP: ffff8848c7e47b48 R8: 0000000000019b80 R9: ffffffffa091e1da
R10: ffff882f80019b80 R11: ffffea00bd578e00 R12: ffff883138e99258
R13: 0000000000000000 R14: 0000000000000020 R15: ffff882efccfeb24
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
— <NMI exception stack> —
#4 [ffff8848c7e47b48] _raw_spin_lock at ffffffff8168ea92
#5 [ffff8848c7e47b50] jobid_get_from_cache at ffffffffa0a739e8 [obdclass]
#6 [ffff8848c7e47bb8] lustre_get_jobid at ffffffffa0a7429c [obdclass]
#7 [ffff8848c7e47c20] vvp_io_init at ffffffffa0e89ff4 [lustre]
#8 [ffff8848c7e47c70] cl_io_submit_rw at ffffffffa0a6ebb8 [obdclass]
#9 [ffff8848c7e47ca8] cl_io_init at ffffffffa0a6ed4a [obdclass]
#10 [ffff8848c7e47cd8] cl_io_rw_init at ffffffffa0a6f743 [obdclass]
#11 [ffff8848c7e47d28] ll_file_io_generic at ffffffffa0e33232 [lustre]
#12 [ffff8848c7e47e40] ll_file_aio_write at ffffffffa0e33fad [lustre]
#13 [ffff8848c7e47ea0] ll_file_write at ffffffffa0e3413e [lustre]
#14 [ffff8848c7e47ef8] vfs_write at ffffffff811fe9fd
#15 [ffff8848c7e47f38] sys_write at ffffffff811ff51f
#16 [ffff8848c7e47f80] system_call_fastpath at ffffffff81697809
RIP: 00002b798058943d RSP: 00007ffeb53c6998 RFLAGS: 00010206
RAX: 0000000000000001 RBX: ffffffff81697809 RCX: 0000000000000000
RDX: 0000000000000033 RSI: 0000000007d4a6c0 RDI: 0000000000000002
RBP: 00007ffeb53c6180 R8: 0000000000000000 R9: 0000000007b6a7e0
R10: 0000000000000000 R11: 0000000000000293 R12: 0000000007d4a6c0
R13: 0000000000020000 R14: 0000000000000033 R1
Attachments
Issue Links
- is related to
-
LU-12808 jobid_get_from_cache softlockup.
-
- Resolved
-
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35008/
Subject:
LU-12225obdclass: improve jobid memory reclaim policyProject: fs/lustre-release
Branch: b2_12
Current Patch Set:
Commit: cffbcfd60b4a55387e968c92c8f6cfeb0d17a35f