Details
-
Bug
-
Resolution: Duplicate
-
Major
-
None
-
Lustre 2.1.0
-
None
-
3
-
6452
Description
Example of affected thread :
============================
PID: 7947 TASK: ffff881030721850 CPU: 1 COMMAND: "ptlrpcd_4"
#0 [ffff880044e27e90] crash_nmi_callback at ffffffff8101fd06
0000001 [ffff880044e27ea0] notifier_call_chain at ffffffff814837f5
0000002 [ffff880044e27ee0] atomic_notifier_call_chain at ffffffff8148385a
0000003 [ffff880044e27ef0] notify_die at ffffffff8108026e
0000004 [ffff880044e27f20] do_nmi at ffffffff81481443
0000005 [ffff880044e27f50] nmi at ffffffff81480d50
[exception RIP: _spin_lock+30]
RIP: ffffffff8148062e RSP: ffff881030757da0 RFLAGS: 00000202
RAX: 0000000000000000 RBX: ffff881030632540 RCX: 0000000000000000
RDX: 0000000000000001 RSI: ffff88103c45e3d0 RDI: ffff88103c45e498
RBP: ffff881030757da0 R8: ebc0de0100000000 R9: ffffffff00000100
R10: 0000000000000000 R11: 000000000000000f R12: ffff881030632540
R13: ffff881030632570 R14: ffff88103c45e3d0 R15: ffff88103c45e498
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
— <NMI exception stack> —
0000006 [ffff881030757da0] _spin_lock at ffffffff8148062e
0000007 [ffff881030757da8] ptlrpcd_check at ffffffffa05cfecc [ptlrpc]
0000008 [ffff881030757e38] ptlrpcd at ffffffffa05d03ff [ptlrpc]
0000009 [ffff881030757f48] kernel_thread at ffffffff810041aa
============================
Concerned "partner" ptlrpcds->pd_threads[]->pc_lock spin_lock seems not initialized causing current ptlrpcd thread to spin for-ever !!!
A possible fix for this problem should be to wait for all ptlrpcd/partner threads to fully initialize prior to start operations ....