Details
-
Bug
-
Resolution: Not a Bug
-
Minor
-
None
-
Lustre 2.15.4
-
None
-
zfs-2.1.15_2llnl-1.t4.x86_64 https://github.com/LLNL/zfs/tree/zfs-2.1.15_2llnl
lustre-2.15.4_2.llnl-2.t4.x86_64 https://github.com/LLNL/lustre/commits/2.15.4_4.llnl
TOSS 4.7-5
kernel 4.18.0-513.18.1.2toss.t4.x86_64
-
3
-
9223372036854775807
Description
An OST server node (copper50) crashed and failed over.
The stack trace is
2024-04-17 23:18:22 [556577.979144] general protection fault, probably for non-canonical address 0x3760003237332428: 0000 [#1] SMP PTI 2024-04-17 23:18:22 [556577.990410] CPU: 3 PID: 220137 Comm: ll_ost00_020 Kdump: loaded Tainted: P W OE KX --------- - - 4.18.0-513.18.1.2toss.t4.x86_64 #1 2024-04-17 23:18:22 [556578.004602] Hardware name: Intel Corporation S2600WTTR/S2600WTTR, BIOS SE5C610.86B.01.01.0024.021320181901 02/13/2018 2024-04-17 23:18:22 [556578.016541] RIP: 0010:strcmp+0xc/0x30 2024-04-17 23:18:22 [556578.020730] Code: 75 f7 31 d2 44 0f b6 04 16 44 88 04 11 48 83 c2 01 45 84 c0 75 ee c3 cc cc cc cc 0f 1f 00 31 c0 eb 08 48 83 c0 01 84 d2 74 13 <0f> b6 14 07 3a 14 06 74 ef 19 c0 83 c8 01 c3 cc cc cc cc 31 c0 c3 2024-04-17 23:18:22 [556578.041782] RSP: 0018:ffffbc9d2debbb58 EFLAGS: 00010246 2024-04-17 23:18:22 [556578.047712] RAX: 0000000000000000 RBX: 3760003237332400 RCX: 000000000000000f 2024-04-17 23:18:22 [556578.055772] RDX: 0000000000000032 RSI: ffffffffc18f2c7d RDI: 3760003237332428 2024-04-17 23:18:22 [556578.063829] RBP: ffffffffc18f2c7d R08: 00000000c063529e R09: ffffffffc18f2c8c 2024-04-17 23:18:22 [556578.071888] R10: ffff970622b19400 R11: ffff97115b98a000 R12: 000000000000000a 2024-04-17 23:18:22 [556578.079959] R13: ffffbc9d2debbc04 R14: ffff97022a6e6c80 R15: ffff9701d08a0e00 2024-04-17 23:18:22 [556578.088019] FS: 0000000000000000(0000) GS:ffff970fffac0000(0000) knlGS:0000000000000000 2024-04-17 23:18:22 [556578.097144] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 2024-04-17 23:18:22 [556578.103651] CR2: 00007fffea61b1a0 CR3: 0000000d9ae10005 CR4: 00000000003706e0 2024-04-17 23:18:22 [556578.111710] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 2024-04-17 23:18:22 [556578.119768] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 2024-04-17 23:18:22 [556578.127826] Call Trace: 2024-04-17 23:18:22 [556578.130651] ? __die_body+0x1a/0x60 2024-04-17 23:18:22 [556578.134643] ? die_addr+0x38/0x51 2024-04-17 23:18:22 [556578.138438] ? do_general_protection+0x14f/0x2a0 2024-04-17 23:18:22 [556578.143687] ? general_protection+0x1e/0x30 2024-04-17 23:18:22 [556578.148454] ? strcmp+0xc/0x30 2024-04-17 23:18:22 [556578.151962] nvt_lookup_name_type.isra.53+0x72/0xb0 [znvpair] 2024-04-17 23:18:22 [556578.158479] nvlist_lookup_common+0x32/0x80 [znvpair] 2024-04-17 23:18:22 [556578.164219] __osd_sa_xattr_get.isra.10.part.11+0x37/0xc0 [osd_zfs] 2024-04-17 23:18:22 [556578.171324] osd_xattr_get_internal+0x5b/0x150 [osd_zfs] 2024-04-17 23:18:22 [556578.177357] osd_xattr_get+0x35b/0x5f0 [osd_zfs] 2024-04-17 23:18:22 [556578.182613] dt_version_get+0x75/0x240 [obdclass] 2024-04-17 23:18:22 [556578.188016] ofd_version_get_check+0x20/0x1f0 [ofd] 2024-04-17 23:18:22 [556578.193571] ofd_attr_set+0x77/0x1090 [ofd] 2024-04-17 23:18:22 [556578.198340] ofd_setattr_hdl+0x3db/0x6e0 [ofd] 2024-04-17 23:18:22 [556578.203401] tgt_request_handle+0xce0/0x1a40 [ptlrpc] 2024-04-17 23:18:22 [556578.209235] ? ptlrpc_nrs_req_get_nolock0+0xff/0x1f0 [ptlrpc] 2024-04-17 23:18:22 [556578.215788] ptlrpc_server_handle_request+0x323/0xbe0 [ptlrpc] 2024-04-17 23:18:22 [556578.222458] ptlrpc_main+0xc24/0x1580 [ptlrpc] 2024-04-17 23:18:22 [556578.227566] ? ptlrpc_wait_event+0x5d0/0x5d0 [ptlrpc] 2024-04-17 23:18:22 [556578.233355] kthread+0x14c/0x170 2024-04-17 23:18:22 [556578.237053] ? set_kthread_struct+0x50/0x50 2024-04-17 23:18:22 [556578.241815] ret_from_fork+0x35/0x40