Details
-
Bug
-
Resolution: Unresolved
-
Medium
-
None
-
None
-
None
-
3
-
9223372036854775807
Description
Terminal A:
while /bin/true; do umount /mnt/lustre; mount mgs9@tcp:/lustre -t lustre -o rw,checksum,rchecksum_force,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,encrypt,statfs_project /mnt/lustre; done
Terminal B:
while /bin/true; do lctl set_param *.*.checksums=0; lctl set_param *.*.checksums=1; done
Triggers crash:
{nofomrat}BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 28f88fe067 P4D 28f88fe067 PUD 291ac56067 PMD 0
Oops: 0000 1 SMP NOPTI
CPU: 89 PID: 77137 Comm: lctl Kdump: loaded Tainted: G OE n 6.4.0-150600.23.47_15.0.10-cray_shasta_c #1 SLE15-SP6 (unreleased) 1ae119e3d64f842183db60cbfc05effc837d5d66
Hardware name: HPE HPE_CRAY_EX425/HPE CRAY EX425, BIOS 1.7.4 04-16-2024
RIP: 0010:lov_set_info_async+0x278/0x850 [lov]
Code: 01 00 00 00 e8 69 6b 49 ff 8b 44 24 20 85 db 0f 44 d8 49 83 c6 10 49 81 fe 00 10 00 00 0f 84 87 03 00 00 48 8b 85 e8 07 00 00 <4a> 8b 04 30 48 85 c0 74 df 8b 15 d5 42 4b ff 4c 8b b8 40 04 00 00
RSP: 0018:ffffb85d7236fcc0 EFLAGS: 00010246
RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
RDX: ffff8e9e8f461000 RSI: 00000000000000a0 RDI: ffffffffc2c108ad
RBP: ffff8e976390a280 R08: 0000000000000080 R09: 0000000000000000
R10: ffffb85d7236fc80 R11: 0000000000000070 R12: 00000000ffffffa1
R13: ffff8e975eaf5f00 R14: 0000000000000000 R15: ffff8e974978c560
FS: 00007fba08df3740(0000) GS:ffff8e9e8f280000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 00000030b9d50000 CR4: 0000000000350ee0
Call Trace:
<TASK>
? __die_body+0x1a/0x60
? page_fault_oops+0x131/0x540
? srso_return_thunk+0x5/0x5f
? __kmem_cache_alloc_node+0x1a5/0x2f0
? srso_return_thunk+0x5/0x5f
? slab_post_alloc_hook+0x69/0x2b0
? exc_page_fault+0x69/0x130
? asm_exc_page_fault+0x22/0x30
? lov_set_info_async+0x278/0x850 [lov 12a56ad90984a8fc47d8ac56d92a4193606c32fc]
checksums_store+0x119/0x420 [lustre d2945174a7916eb345bcb2a17de02f96ba554259]
kernfs_fop_write_iter+0x137/0x1b0
vfs_write+0x2a7/0x340
ksys_write+0xa5/0xe0
? srso_return_thunk+0x5/0x5f
do_syscall_64+0x5b/0xc0
? do_sys_openat2+0x1d7/0x320
? srso_return_thunk+0x5/0x5f
? do_sys_openat2+0x1d7/0x320
? srso_return_thunk+0x5/0x5f
? do_sys_open+0x57/0x80
? srso_return_thunk+0x5/0x5f
? syscall_exit_to_user_mode+0x1e/0x40
? srso_return_thunk+0x5/0x5f
? do_syscall_64+0x67/0xc0
? syscall_exit_to_user_mode+0x1e/0x40
? srso_return_thunk+0x5/0x5f
? do_syscall_64+0x67/0xc0
? do_syscall_64+0x67/0xc0
? srso_return_thunk+0x5/0x5f
? exc_page_fault+0x69/0x130
entry_SYSCALL_64_after_hwframe+0x7c/0xe6{nofomrat}
Or crash:
{nofomrat}general protection fault, probably for non-canonical address 0x5a5a5a5a5a5a5e9a: 0000 1 PREEMPT SMP NOPTI
CPU: 1 PID: 636809 Comm: lctl Kdump: loaded Tainted: G OE ------- — 5.14.0-427.42.1.ldiskfs.el9.x86_64 #1
Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
RIP: 0010:lov_set_info_async+0x216/0x620 [lov]
Code: c3 10 48 81 fb 00 10 00 00 0f 84 7a 02 00 00 48 8b 85 f8 07 00 00 48 85 c0 74 e3 48 8b 04 18 48 85 c0 74 da 8b 0d 26 22 6c ff <4c> 8b a0 40 04 00 00 83 e1 01 0f 85 d3 fe ff ff b8 ed ff ff ff 4d
RSP: 0018:ffffbb0f0a0cbcc0 EFLAGS: 00010206
RAX: 5a5a5a5a5a5a5a5a RBX: 0000000000000000 RCX: 00000000133f0584
RDX: 000000000000000d RSI: 0000000000000001 RDI: ffff9efb30052400
RBP: ffff9efa6610c560 R08: 0000000000000004 R09: 0000000000000000
R10: ffff9efa10833300 R11: 0000000000000000 R12: 0000000000000009
R13: ffffffffc18158c1 R14: 0000000000000000 R15: 0000000000000000
FS: 00007ffa6ba12740(0000) GS:ffff9f015fc80000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000555d2e7a34b8 CR3: 000000015f234000 CR4: 0000000000750ee0
PKRU: 55555554
Call Trace:
<TASK>
? srso_alias_return_thunk+0x5/0xfbef5
? show_trace_log_lvl+0x26e/0x2df
? show_trace_log_lvl+0x26e/0x2df
? obd_set_info_async.constprop.0+0x1a7/0x370 [lustre]
? __die_body.cold+0x8/0xd
? die_addr+0x39/0x60
? exc_general_protection+0x1aa/0x400
? asm_exc_general_protection+0x22/0x30
? lov_set_info_async+0x216/0x620 [lov]
obd_set_info_async.constprop.0+0x1a7/0x370 [lustre]
checksums_store+0x7d/0x120 [lustre]
kernfs_fop_write_iter+0x121/0x1b0
vfs_write+0x2cb/0x410
ksys_write+0x5f/0xe0
do_syscall_64+0x59/0x90
? srso_alias_return_thunk+0x5/0xfbef5
? syscall_exit_work+0x103/0x130
? srso_alias_return_thunk+0x5/0xfbef5
? syscall_exit_to_user_mode+0x19/0x40
? srso_alias_return_thunk+0x5/0xfbef5
? do_syscall_64+0x69/0x90
? srso_alias_return_thunk+0x5/0xfbef5
? syscall_exit_to_user_mode+0x19/0x40
? srso_alias_return_thunk+0x5/0xfbef5
? do_syscall_64+0x69/0x90
? srso_alias_return_thunk+0x5/0xfbef5
? syscall_exit_work+0x103/0x130
? srso_alias_return_thunk+0x5/0xfbef5
? syscall_exit_to_user_mode+0x19/0x40
? srso_alias_return_thunk+0x5/0xfbef5
? do_syscall_64+0x69/0x90
? do_syscall_64+0x69/0x90
? do_syscall_64+0x69/0x90
entry_SYSCALL_64_after_hwframe+0x77/0xe1
RIP: 0033:0x7ffa6b8ff117
Code: 0f 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 48 89 54 24 18 48 89 74 24
RSP: 002b:00007ffffa2d6f78 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
RAX: ffffffffffffffda RBX: 0000000000000002 RCX: 00007ffa6b8ff117
RDX: 0000000000000001 RSI: 00007ffffa2db2b5 RDI: 0000000000000003
RBP: 0000000000000003 R08: 0000000000000000 R09: 0000555d2e793800
R10: 0000000000000000 R11: 0000000000000246 R12: 00007ffffa2db2b5
R13: 0000555d2e793800 R14: 00007ffffa2d9188 R15: 0000555d2e793550
</TASK>
Modules linked in: obdecho(OE) mgc(OE) ptlrpc_gss(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) rpcrdma rdma_cm iw_cm ib_cm ib_core rfkill intel_rapl_msr intel_rapl_common kvm_amd ccp qxl iTCO_wdt drm_ttm_helper iTCO_vendor_support ttm kvm i2c_i801 drm_kms_helper i2c_smbus syscopyarea irqbypass pcspkr sysfillrect lpc_ich virtio_balloon sysimgblt fb_sys_fops joydev nfsd auth_rpcgss nfs_acl lockd grace dm_mod sunrpc drm fuse ext4 mbcache jbd2 sr_mod cdrom sg ahci libahci crct10dif_pclmul crc32_pclmul virtio_net libata crc32c_intel net_failover failover ghash_clmulni_intel virtio_console virtio_blk serio_raw