Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.14.0
-
None
-
3
-
9223372036854775807
Description
This seems to only trigger when running in "one unmount/mount per subtest" mode like this:
FLAKEY=false REFORMAT=yes ONLY="160-999" MDSCOUNT=2 SETUP=setup CLEANUP=cleanup sh sanity.sh
The crash is 100% reproducible on a single node testing and looks like this:
[421223.630585] Lustre: server umount lustre-OST0001 complete [421223.879781] general protection fault: 0000 [#1] SMP DEBUG_PAGEALLOC [421223.881119] Modules linked in: ofd(OE) osp(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) lov(OE) mdc(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) pcc_cpufreq loop zfs(PO) zunicode(PO) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common virtio_balloon pcspkr virtio_console i2c_piix4 binfmt_misc ip_tables rpcsec_gss_krb5 ata_generic pata_acpi drm_kms_helper ttm drm ata_piix drm_panel_orientation_quirks serio_raw libata virtio_blk i2c_core floppy [last unloaded: lustre] [421223.890017] CPU: 1 PID: 18476 Comm: sh Kdump: loaded Tainted: P OE ------------ 3.10.0-7.7-debug #1 [421223.890922] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011 [421223.891370] task: ffff8803b185a340 ti: ffff8800ca418000 task.ti: ffff8800ca418000 [421223.892207] RIP: 0010:[<ffffffff8110a016>] [<ffffffff8110a016>] module_put+0x26/0xc0 [421223.893060] RSP: 0018:ffff8800ca41bda8 EFLAGS: 00010202 [421223.893504] RAX: ffff8800ca41bfd8 RBX: 6b6b6b6b6b6b6b6b RCX: ffff8800ca41bce0 [421223.894355] RDX: 000000000000000b RSI: 800000038530b060 RDI: 6b6b6b6b6b6b6b6b [421223.895185] RBP: ffff8800ca41bdc0 R08: 0000000000000001 R09: 0000000000000067 [421223.896016] R10: ffff880000000148 R11: 0000000000000000 R12: 6b6b6b6b6b6b6b6b [421223.896871] R13: ffff8803dd8d6d30 R14: ffff8803ca1ff168 R15: ffff8803d43ece60 [421223.897696] FS: 00007ffa9ed7f740(0000) GS:ffff8803f4640000(0000) knlGS:0000000000000000 [421223.898540] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [421223.898991] CR2: 00007f119516c8d0 CR3: 0000000001c10000 CR4: 00000000000006e0 [421223.899845] Call Trace: [421223.900249] [<ffffffff812448e0>] cdev_put+0x20/0x30 [421223.900684] [<ffffffff812415c6>] __fput+0x266/0x300 [421223.901129] [<ffffffff8124173e>] ____fput+0xe/0x10 [421223.901565] [<ffffffff810b4af5>] task_work_run+0xb5/0xf0 [421223.902010] [<ffffffff81092395>] do_exit+0x2f5/0xb30 [421223.902450] [<ffffffff817e0ee1>] ? system_call_after_swapgs+0xae/0x146 [421223.902941] [<ffffffff81092c4f>] do_group_exit+0x3f/0xa0 [421223.904158] [<ffffffff81092cc4>] SyS_exit_group+0x14/0x20 [421223.904605] [<ffffffff817e0f98>] system_call_fastpath+0x1f/0x24 [421223.905071] [<ffffffff817e0ee1>] ? system_call_after_swapgs+0xae/0x146 [421223.905550] Code: 00 00 00 00 00 0f 1f 44 00 00 55 48 85 ff 48 89 e5 41 55 41 54 49 89 fc 53 74 3c 65 48 8b 04 25 f8 0e 01 00 83 80 44 c0 ff ff 01 <48> 8b 87 30 02 00 00 65 48 ff 40 08 4c 8b 6d 08 0f 1f 44 00 00 [421223.907311] RIP [<ffffffff8110a016>] module_put+0x26/0xc0 [421223.907761] RSP <ffff8800ca41bda8>
I thin kit must be due to some recent module changes for upstreaming/cleanup reasons?
Attachments
Issue Links
- is related to
-
LU-12506 Client unable to mount filesystem with very large number of MDTs
- Resolved