Details
-
Bug
-
Resolution: Duplicate
-
Minor
-
None
-
Lustre 2.16.0
-
3
-
9223372036854775807
Description
MDS crashes with the following stack trace:
[28571.627000] list_add double add: new=ffff8953191d5d40, prev=ffff8953191d5d40, next=ffff8953191d5d40. [28571.629294] ------------[ cut here ]------------ [28571.630471] kernel BUG at lib/list_debug.c:31! [28571.631630] invalid opcode: 0000 [#1] SMP NOPTI [28571.632700] CPU: 13 PID: 3392 Comm: mdt06_002 Kdump: loaded Tainted: G OE --------- - - 4.18.0-425.13.1.el8_lustre.ddn17.x86_64 #1 [28571.634620] Hardware name: DDN SFA7990XE, BIOS rel-1.16.0-0-gd239552ce722-prebuilt.qemu.org 04/01/2014 [28571.636131] RIP: 0010:__list_add_valid+0x45/0x50 [28571.637195] Code: 00 48 39 c7 74 0f 48 39 d7 74 0a b8 01 00 00 00 e9 90 e1 71 00 48 89 f2 4c 89 c1 48 89 fe 48 c7 c7 80 fb 92 ad e8 7f 10 c8 ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 48 8b 07 48 8b 57 08 48 b9 00 01 [28571.640407] RSP: 0018:ffff9dff4ea0fa58 EFLAGS: 00010246 [28571.641506] RAX: 0000000000000058 RBX: ffff8953191d1800 RCX: 0000000000000000 [28571.642856] RDX: 0000000000000000 RSI: ffff897569b56698 RDI: ffff897569b56698 [28571.644200] RBP: ffff8953191d5d40 R08: 0000000000000000 R09: c0000000ffff7fff [28571.645570] R10: 0000000000000001 R11: ffff9dff4ea0f878 R12: ffff8953191d5d40 [28571.646930] R13: ffff8953191d1800 R14: ffff8953191d5d40 R15: ffff8953191d5d40 [28571.648308] FS: 0000000000000000(0000) GS:ffff897569b40000(0000) knlGS:0000000000000000 [28571.649781] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [28571.650965] CR2: 000000c00081b000 CR3: 000000225a410003 CR4: 0000000000770ee0 [28571.652342] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [28571.653666] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [28571.654994] PKRU: 55555554 [28571.655767] Call Trace: [28571.656501] upcall_cache_get_entry+0x1cf/0xb60 [obdclass] [28571.657767] ? null_alloc_rs+0xb7/0x370 [ptlrpc] [28571.658905] mdt_identity_get+0x2b/0x60 [mdt] [28571.659994] old_init_ucred_common+0x17c/0x370 [mdt] [28571.661077] mdt_init_ucred+0x224/0x2d0 [mdt] [28571.662068] mdt_intent_getattr+0xac/0x440 [mdt] [28571.663088] ? mdt_getattr_name+0x260/0x260 [mdt] [28571.664106] mdt_intent_opc+0x452/0xa80 [mdt] [28571.665057] mdt_intent_policy+0x1fd/0x390 [mdt] [28571.666041] ldlm_lock_enqueue+0x469/0xa90 [ptlrpc] [28571.667176] ? cfs_hash_bd_add_locked+0x1f/0x90 [libcfs] [28571.668271] ldlm_handle_enqueue0+0x61a/0x16e0 [ptlrpc] [28571.669414] tgt_enqueue+0xa4/0x200 [ptlrpc] [28571.670437] tgt_request_handle+0xc94/0x1890 [ptlrpc] [28571.671521] ? ptlrpc_nrs_req_get_nolock0+0xff/0x1f0 [ptlrpc] [28571.672689] ptlrpc_server_handle_request+0x323/0xbd0 [ptlrpc] [28571.673875] ? lprocfs_counter_add+0x12a/0x1a0 [obdclass] [28571.674964] ptlrpc_main+0xbf3/0x1540 [ptlrpc] [28571.675977] ? __schedule+0x2d9/0x860 [28571.676781] ? ptlrpc_register_service+0xfb0/0xfb0 [ptlrpc] [28571.677938] kthread+0x10b/0x130 [28571.678681] ? set_kthread_struct+0x50/0x50 [28571.679553] ret_from_fork+0x1f/0x40 [28571.680339] Modules linked in: ofd(OE) ost(OE) osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) sctp ip6_udp_tunnel udp_tunnel libcrc32c rdma_ucm(OE) intel_rapl_msr rdma_cm(OE) iw_cm(OE) intel_rapl_common ib_ipoib(OE) ib_cm(OE) isst_if_common nfit ib_umad(OE) libnvdimm kvm_intel kvm bochs drm_vram_helper irqbypass drm_ttm_helper crct10dif_pclmul crc32_pclmul sunrpc ghash_clmulni_intel ttm iTCO_wdt ppdev iTCO_vendor_support rapl drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm i2c_i801 joydev pcspkr lpc_ich i6300esb parport_pc parport sch_fq tcp_htcp ext4 mbcache jbd2 mlx5_ib(OE) ib_uverbs(OE) ib_core(OE) sr_mod sd_mod cdrom t10_pi sg mlx5_core(OE) mlxfw(OE) pci_hyperv_intf ahci tls libahci virtio_net psample libata mlxdevm(OE) crc32c_intel net_failover serio_raw virtio_blk igbvf mlx_compat(OE) virtio_scsi [28571.680539] failover dm_mirror dm_region_hash dm_log dm_mod
Attachments
Issue Links
- is related to
-
LU-16498 change upcall uc_lock to read-write lock
- Resolved