Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.12.4
-
kmod-lustre-2.12.4_5.chaos
3.10.0-1127.0.0.1chaos.ch6.x86_64
-
3
-
9223372036854775807
Description
During an lnet shutdown, the router crashes with the following stack:
BUG: unable to handle kernel NULL pointer dereference at 0000000000000038 IP: [<ffffffffc0d43bda>] lnet_post_send_locked+0x7a/0xa40 [lnet] PGD 0 Oops: 0000 [#1] SMP CPU: 1 PID: 19931 Comm: kiblnd_sd_00_02 Kdump: loaded Tainted: G W OE ------------ T 3.10.0-1127.0.0.1chaos.ch6.x86_64 #1 Hardware name: Cray Inc. SERVER-1824X-GN/S2600GZ, BIOS SE5C600.86B.02.06.0002.101320150901 10/13/2015 task: ffff8e581c0562a0 ti: ffff8e601d554000 task.ti: ffff8e601d554000 RIP: 0010:[<ffffffffc0d43bda>] [<ffffffffc0d43bda>] lnet_post_send_locked+0x7a/0xa40 [lnet] RSP: 0018:ffff8e601d557c50 EFLAGS: 00010202 RAX: 0000000000000000 RBX: ffff8e5f8666fc00 RCX: dead000000000200 RDX: ffff8e5eb083e810 RSI: 0000000000000001 RDI: ffff8e5f8666fc00 RBP: ffff8e601d557cb0 R08: ffff8e5f8666fc10 R09: ffff8e5174561070 R10: 0000000000000008 R11: ffff8e58154e48b8 R12: 0000000000000000 R13: ffff8e6017c3a800 R14: ffff8e58170fa100 R15: ffff8e57f5a94000 FS: 0000000000000000(0000) GS:ffff8e581ea40000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000038 CR3: 000000101ae34000 CR4: 00000000000607e0 Call Trace: [<ffffffffc0d46238>] lnet_return_tx_credits_locked+0x238/0x4a0 [lnet] [<ffffffffc0d3895c>] lnet_msg_decommit+0xec/0x700 [lnet] [<ffffffffc0d39a2c>] lnet_finalize+0x34c/0xd40 [lnet] [<ffffffffc0aa875d>] kiblnd_tx_done+0x10d/0x3e0 [ko2iblnd] [<ffffffffc0ab3d19>] kiblnd_scheduler+0x8c9/0x1160 [ko2iblnd] [<ffffffffaac2d59e>] ? __switch_to+0xce/0x5a0 [<ffffffffaace29b0>] ? wake_up_state+0x20/0x20 [<ffffffffc0ab3450>] ? kiblnd_cq_event+0x90/0x90 [ko2iblnd] [<ffffffffaaccca01>] kthread+0xd1/0xe0 [<ffffffffaaccc930>] ? insert_kthread_work+0x40/0x40 [<ffffffffab3bff77>] ret_from_fork_nospec_begin+0x21/0x21 [<ffffffffaaccc930>] ? insert_kthread_work+0x40/0x40 Code: 45 c0 49 8b 45 50 4e 8b 34 e0 0f 85 8e 08 00 00 80 7b 72 00 0f 88 52 08 00 00 f6 43 6d 01 0f 84 b6 08 00 00 48 8b 05 56 74 03 00 <48> 8b 40 38 49 39 87 20 01 00 00 0f 84 fe 07 00 00 8b 7b 28 85 RIP [<ffffffffc0d43bda>] lnet_post_send_locked+0x7a/0xa40 [lnet] RSP <ffff8e601d557c50> CR2: 0000000000000038