Details
-
Bug
-
Resolution: Duplicate
-
Critical
-
None
-
Lustre 2.7.0
-
None
-
2.7.1-fe
-
3
-
9223372036854775807
Description
OSS deadlocked unable to ping ethernet or IB interfaces. Console showed no errors.
Attaching full trace of all threads. Most notable are kiblnd
ID: 8711 TASK: ffff882020b12ab0 CPU: 4 COMMAND: "kiblnd_sd_01_01"
#0 [ffff880060c86e90] crash_nmi_callback at ffffffff81032256
#1 [ffff880060c86ea0] notifier_call_chain at ffffffff81568515
#2 [ffff880060c86ee0] atomic_notifier_call_chain at ffffffff8156857a
#3 [ffff880060c86ef0] notify_die at ffffffff810a44fe
#4 [ffff880060c86f20] do_nmi at ffffffff8156618f
#5 [ffff880060c86f50] nmi at ffffffff815659f0
[exception RIP: _spin_lock+33]
RIP: ffffffff81565261 RSP: ffff882021b75b70 RFLAGS: 00000293
RAX: 0000000000002b8e RBX: ffff880ffe7dd240 RCX: 0000000000000000
RDX: 0000000000002b8b RSI: 0000000000000003 RDI: ffff88201ee3f140
RBP: ffff882021b75b70 R8: 6950000000000000 R9: 4a80000000000000
R10: 0000000000000001 R11: 0000000000000001 R12: 0000000000000018
R13: ffff881013262e40 R14: ffff8820268ecac0 R15: 0000000000000004
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
--- <NMI exception stack> ---
#6 [ffff882021b75b70] _spin_lock at ffffffff81565261
#7 [ffff882021b75b78] cfs_percpt_lock at ffffffffa049edab [libcfs]
#8 [ffff882021b75bb8] lnet_ptl_match_md at ffffffffa0529605 [lnet]
#9 [ffff882021b75c38] lnet_parse_local at ffffffffa05306e7 [lnet]
#10 [ffff882021b75cd8] lnet_parse at ffffffffa05316da [lnet]
#11 [ffff882021b75d68] kiblnd_handle_rx at ffffffffa0a16f3b [ko2iblnd]
#12 [ffff882021b75db8] kiblnd_scheduler at ffffffffa0a182be [ko2iblnd]
#13 [ffff882021b75ee8] kthread at ffffffff8109dc8e
#14 [ffff882021b75f48] kernel_thread at ffffffff8100c28a
Attachments
Issue Links
- is related to
-
LU-7980 Overrun in generic <size-128> kmem_cache Slabs causing OSS to crash
-
- Resolved
-