Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
Lustre 2.16.0
-
None
-
3
-
9223372036854775807
Description
Hit this crash on master-next, for a recently landed patch
[67331.365407] Lustre: DEBUG MARKER: == sanity test 160s: changelog garbage collect on idle records acceptance-small.sh acl aiocp.c auster badarea_io badarea_io-badarea_io.o badarea_io.c cfg check_fallocate check_fallocate.c check_fallocate.o check_fhandle_syscalls check_fhandle_syscalls.c c [67334.927458] Lustre: 31259:0:(mdd_dir.c:895:mdd_changelog_store()) lustre-MDD0000: starting changelog garbage collection [67334.933112] Lustre: 13419:0:(mdd_trans.c:160:mdd_chlg_garbage_collect()) lustre-MDD0000: force deregister of changelog user cl24 idle for 864003s with 500000004 unprocessed records [67334.950027] general protection fault: 0000 [#1] SMP DEBUG_PAGEALLOC [67334.950643] Modules linked in: lustre(OE) ofd(OE) osp(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_zfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) nfsd ext4 mbcache jbd2 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) crc32_generic crc_t10dif crct10dif_generic crct10dif_common virtio_balloon pcspkr i2c_piix4 virtio_console ip_tables rpcsec_gss_krb5 drm_kms_helper ata_generic ttm pata_acpi drm drm_panel_orientation_quirks virtio_blk ata_piix serio_raw floppy i2c_core libata [last unloaded: libcfs] [67334.951005] CPU: 11 PID: 31259 Comm: mdt05_003 Kdump: loaded Tainted: P W OE ------------ 3.10.0-7.9-debug #2 [67334.951005] Hardware name: Red Hat KVM, BIOS 1.15.0-1.module_el8.6.0+1087+b42c8331 04/01/2014 [67334.951005] task: ffff8800b95449d0 ti: ffff8801f31e4000 task.ti: ffff8801f31e4000 [67334.951005] RIP: 0010:[<ffffffffa1164bac>] [<ffffffffa1164bac>] osd_attr_get+0x6c/0x4d0 [osd_zfs] [67334.951005] RSP: 0018:ffff8801f31e76e0 EFLAGS: 00010282 [67334.951005] RAX: 6b6b6b6b6b6b6b6b RBX: ffff8801e3349738 RCX: ffff8801f31e7fd8 [67334.951005] RDX: 0000000000000000 RSI: 0000000000000015 RDI: ffffffff81a98ea7 [67334.951005] RBP: ffff8801f31e7730 R08: 00000000ffffffff R09: ffff8802f6b33fc8 [67334.951005] R10: ffff88028bbcea08 R11: ffff88023db9b548 R12: ffff8802f86f5648 [67334.951005] R13: ffff880187c08000 R14: ffff8801e3349738 R15: ffff8802f86f56e8 [67334.951005] FS: 0000000000000000(0000) GS:ffff880331cc0000(0000) knlGS:0000000000000000 [67334.951005] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [67334.951005] CR2: 00007f3d1b48d000 CR3: 0000000288b26000 CR4: 00000000000007e0 [67334.951005] Call Trace: [67334.951005] [<ffffffffa0374680>] llog_osd_write_rec+0x330/0x1b60 [obdclass] [67334.951005] [<ffffffffa13cb995>] ? lod_xattr_set_internal+0xa5/0x2b0 [lod] [67334.951005] [<ffffffffa11e9a26>] mdd_changelog_write_rec+0xc6/0x120 [mdd] [67334.951005] [<ffffffffa1167ca9>] ? osd_attr_set+0x4b9/0xf50 [osd_zfs] [67334.951005] [<ffffffffa0363760>] llog_write_rec+0x290/0x590 [obdclass] [67334.951005] [<ffffffffa0369691>] llog_cat_add_rec+0x1e1/0x990 [obdclass] [67334.951005] [<ffffffffa03605ff>] llog_add+0x17f/0x1f0 [obdclass] [67334.951005] [<ffffffffa11e9c8e>] mdd_changelog_store+0x17e/0x580 [mdd] [67334.951005] [<ffffffffa11ea755>] mdd_changelog_ns_store+0x3d5/0x910 [mdd] [67334.951005] [<ffffffffa11fdc18>] ? mdd_attr_set_internal+0xc8/0x2e0 [mdd] [67334.951005] [<ffffffffa11fdea5>] ? mdd_update_time+0x75/0x1d0 [mdd] [67334.951005] [<ffffffffa11ec7e4>] mdd_create+0x1554/0x1ce0 [mdd] [67334.951005] [<ffffffffa1299ab1>] mdt_create+0xbf1/0x11f0 [mdt] [67334.951005] [<ffffffffa129a450>] mdt_reint_create+0x3a0/0x460 [mdt] [67334.951005] [<ffffffffa129e157>] mdt_reint_rec+0x87/0x240 [mdt] [67334.951005] [<ffffffffa1273cec>] mdt_reint_internal+0x76c/0xb50 [mdt] [67334.951005] [<ffffffffa127e4d7>] mdt_reint+0x67/0x150 [mdt] [67334.951005] [<ffffffffa06ba02a>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [67334.951005] [<ffffffffa02c7a5e>] ? libcfs_nid2str_r+0xfe/0x130 [lnet] [67334.951005] [<ffffffffa06693a0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc] [67334.951005] [<ffffffffa066b059>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc] [67334.951005] [<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90 [67334.951005] [<ffffffffa066a480>] ? ptlrpc_wait_event+0x620/0x620 [ptlrpc] [67334.951005] [<ffffffff810ba114>] kthread+0xe4/0xf0 [67334.951005] [<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140 [67334.951005] [<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21 [67334.951005] [<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140 [67334.951005] Code: 48 3d 20 c5 18 a1 0f 84 73 02 00 00 e8 69 66 02 00 66 0f 1f 44 00 00 4d 8d bc 24 a0 00 00 00 4c 89 ff e8 28 b2 67 e0 49 8b 04 24 <f6> 40 1c 01 0f 84 d2 03 00 00 41 f6 84 24 74 01 00 00 01 0f 85
Crashdump and such in http://testing.linuxhacker.ru/lustre-reports/external/crashes/boilpot-bigmem-18-2022-06-08-17:49:52/