Details
-
Bug
-
Resolution: Fixed
-
Major
-
Lustre 2.10.6
-
None
-
3
-
9223372036854775807
Description
We have seen this at least 4 time.
5426573.363714] Lustre: Mounted nbp16-client [5428374.627398] general protection fault: 0000 [#1] [5428374.627407] Lustre: Unmounted nbp14-client [5428374.636811] SMP [5428374.639106] 5428374.639307] Modules linked in: vtsspp(OEN) sep5(OEN) socperf3(OEN) pax(OEN) osc(OEN) mgc(OEN) lustre(OEN) lmv(OEN) fld(OEN) mdc(OEN) fid(OEN) lov(OEN) ko2iblnd(OEN) ptlrpc(OEN) obdclass(OEN) lnet(OEN) libcfs(OEN) beegfs(OEN) rdma_ucm(OEX) ib_ucm(OEX) rdma_cm(OEX) iw_cm(OEX) configfs(E) ib_ipoib(OEX) inet_lro(E) ib_cm(OEX) ib_uverbs(OEX) ib_umad(OEX) mlx4_ib(OEX) ib_core(OEX) mlx4_core(OEX) devlink(E) mlx_compat(OEX) iscsi_ibft(E) iscsi_boot_sysfs(E) msr(E) joydev(E) intel_rapl(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) kvm_intel(E) kvm(E) irqbypass(E) crct10dif_pclmul(E) crc32_pclmul(E) ghash_clmulni_intel(E) drbg(E) ansi_cprng(E) ipmi_ssif(E) iTCO_wdt(E) iTCO_vendor_support(E) aesni_intel(E) aes_x86_64(E) lrw(E) gf128mul(E) glue_helper(E) mgag200(E) ablk_helper(E) cryptd(E) ttm(E) [5428374.711255] acpi_cpufreq(E) drm_kms_helper(E) pcspkr(E) drm(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) fb_sys_fops(E) lpc_ich(E) mei_me(E) i2c_i801(E) mfd_core(E) mei(E) ioatdma(E) shpchp(E) ipmi_si(E) wmi(E) ipmi_devintf(E) ipmi_msghandler(E) processor(E) button(E) tcp_bic(EN) hwperf(OEX) numatools(OEX) xpmem(OEX) gru(OEX) xvma(OEX) sg(E) dm_multipath(E) dm_mod(E) scsi_dh_rdac(E) scsi_dh_emc(E) scsi_dh_alua(E) autofs4(E) nfsv3(E) nfs_acl(E) nfs(E) lockd(E) grace(E) sunrpc(E) fscache(E) bridge(E) stp(E) llc(E) hid_generic(E) usbhid(E) ahci(E) libahci(E) ehci_pci(E) libata(E) ehci_hcd(E) igb(E) i2c_algo_bit(E) dca(E) ptp(E) scsi_mod(E) usbcore(E) pps_core(E) usb_common(E) af_packet(E) crc32c_intel(E) fjes(E) [last unloaded: socperf2_0] [5428374.776352] Supported: No, Unsupported modules are loaded [5428374.782187] CPU: 23 PID: 85345 Comm: umount Tainted: G OE NX 4.4.162-94.72.1.20181113-nasa #1 [5428374.792175] Hardware name: SGI.COM ICE-XIP113/X9DRT-Dakota, BIOS DA0E2016 02/01/2016 [5428374.800341] task: ffff88026ade1000 ti: ffff88026ade4000 task.ti: ffff88026ade4000 [5428374.808253] RIP: 0010:[<ffffffffa07a47dd>] [<ffffffffa07a47dd>] mdc_changelog_cdev_finish+0x3d/0x1b1 [mdc] [5428374.818437] RSP: 0018:ffff88026ade7b68 EFLAGS: 00010286 [5428374.824175] RAX: 5a5a5a5a5a5a4b62 RBX: ffff88040e20e008 RCX: ffff88037b826fb0 [5428374.831741] RDX: 5a5a5a5a5a5a5a5a RSI: ffff88037b826f40 RDI: ffff88040e20e008 [5428374.839306] RBP: 0000000000000000 R08: 0000000000000c3a R09: 0000000000000000 [5428374.846863] R10: 0000000000000000 R11: ffff8807c8d833c6 R12: 0000000000000000 [5428374.854421] R13: ffff88040e20e048 R14: ffff880d1635f000 R15: ffff880cf81e6b60 [5428374.861978] FS: 00007ffff7fd1880(0000) GS:ffff88085fb40000(0000) knlGS:0000000000000000 [5428374.870489] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [5428374.876661] CR2: 00007ffff7ff6000 CR3: 0000000371dbe000 CR4: 0000000000160670 [5428374.884227] Stack: [5428374.886678] ffff88040e20e008 0000000000000000 ffffffffa07904fa ffff88040e20e008 [5428374.894566] 0000000000000000 0000000000000000 ffffffffa0b8bc9c ffff88026ade7bf8 [5428374.902452] ffffffffa0a6afb7 ffff880200000010 ffff88026ade7c08 ffff88026ade7bc8 [5428374.910337] Call Trace: [5428374.913246] [<ffffffffa07904fa>] mdc_precleanup+0x2a/0x3f0 [mdc] [5428374.919816] [<ffffffffa0b8bc9c>] class_cleanup+0x26c/0xc40 [obdclass] [5428374.926811] [<ffffffffa0b8e5ba>] class_process_config+0x190a/0x2360 [obdclass] [5428374.934582] [<ffffffffa0b8f1ba>] class_manual_cleanup+0x1aa/0x6a0 [obdclass] [5428374.942177] [<ffffffffa0f6f341>] ll_put_super+0x111/0x9f0 [lustre] [5428374.948881] [<ffffffff81212a1c>] generic_shutdown_super+0x6c/0xf0 [5428374.955497] [<ffffffff81212aae>] kill_anon_super+0xe/0x20 [5428374.961416] [<ffffffff8121236f>] deactivate_locked_super+0x3f/0x70 [5428374.968117] [<ffffffff8122da1b>] cleanup_mnt+0x3b/0x80 [5428374.973775] [<ffffffff8109f718>] task_work_run+0x78/0x90 [5428374.979609] [<ffffffff8107d3cf>] exit_to_usermode_loop+0x91/0xc2 [5428374.986136] [<ffffffff81003ae5>] syscall_return_slowpath+0x85/0xa0 [5428374.992837] [<ffffffff8161dfec>] int_ret_from_sys_call+0x8/0x6d [5428375.002321] DWARF2 unwinder stuck at int_ret_from_sys_call+0x8/0x6d [5428375.009019] [5428375.010951] Leftover inexact backtrace: [5428375.017130] Code: 3d 90 21 7b a0 48 8d b0 78 ff ff ff 0f 84 d0 00 00 00 48 8b 56 70 48 8d 4e 70 48 39 d1 48 8d 82 08 f1 ff ff 75 1c e9 9d 00 00 00 <48> 8b 90 f8 0e 00 00 48 39 d1 48 8d 82 08 f1 ff ff 0f 84 86 00 [5428375.037514] RIP [<ffffffffa07a47dd>] mdc_changelog_cdev_finish+0x3d/0x1b1 [mdc] [5428375.045359] RSP <ffff88026ade7b68>
Attachments
Issue Links
- duplicates
-
LU-11626 mdc: obd might go away while referenced by code in mdc_changelog
- Resolved