[LU-15346] sanityn test_43g: kernel BUG at lib/list_debug.c:50! Created: 08/Dec/21  Updated: 08/Dec/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.15.0
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Elena Gryaznova Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Attachments: Zip Archive 1638539191-sanityn-dectet_L300-43g.zip.zip    
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Server 2.14.55_848738a
Client 2.14.55_848738a

[11944.879301] list_del corruption, ffff9d7d011c5048->prev is LIST_POISON2 (dead000000000200)
[11944.892269] ------------[ cut here ]------------
[11944.896306] kernel BUG at lib/list_debug.c:50!
[11944.899815] invalid opcode: 0000 [#1] SMP PTI
[11944.903330] CPU: 0 PID: 421777 Comm: kworker/0:3 Kdump: loaded Tainted: G           OE    ---------r-  - 4.18.0-305.10.2.x6.1.000.41.x86_64 #1
[11944.914498] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
[11944.925092] Workqueue: cgroup_destroy css_release_work_fn
[11944.941234] RIP: 0010:__list_del_entry_valid.cold.1+0x45/0x4c
[11944.947451] Code: e8 8a a3 cb ff 0f 0b 48 89 f2 48 89 fe 48 c7 c7 d8 70 f0 94 e8 76 a3 cb ff 0f 0b 48 89 fe 48 c7 c7 a0 70 f0 94 e8 65 a3 cb ff <0f> 0b 90 90 90 90 90 41 55 41 54 55 53 48 85 d2 74 5f 48 85 f6 74
[11944.963013] RSP: 0018:ffffba2e00f13e68 EFLAGS: 00010246
[11944.965520] RAX: 000000000000004e RBX: ffff9d7d011c5090 RCX: 0000000000000000
[11944.972538] RDX: 0000000000000000 RSI: ffff9d7d3bc167c8 RDI: ffff9d7d3bc167c8
[11944.977396] RBP: ffffffff95626640 R08: 0000000000001252 R09: 00000000ad55ad55
[11944.982439] R10: 0000000000000000 R11: 0000000000000002 R12: ffff9d7d011c5000
[11944.988464] R13: ffff9d7c9f4ce000 R14: ffff9d7d013efd80 R15: ffff9d7d011c5098
[11944.993034] FS:  0000000000000000(0000) GS:ffff9d7d3bc00000(0000) knlGS:0000000000000000
[11944.998445] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[11945.002132] CR2: 00007f8024ec1500 CR3: 000000010ee10000 CR4: 00000000000006f0
[11945.006908] Call Trace:
[11945.008996]  css_release_work_fn+0x3f/0x240
[11945.011934]  process_one_work+0x1a7/0x360
[11945.014740]  worker_thread+0x30/0x390
[11945.017327]  ? create_worker+0x1a0/0x1a0
[11945.020073]  kthread+0x116/0x130
[11945.022436]  ? kthread_flush_work_fn+0x10/0x10
[11945.025476]  ret_from_fork+0x35/0x40
[11945.028074] Modules linked in: dm_flakey dm_mod osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lustre(OE) lmv(OE) mdc(OE) lov(OE) osc(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) crc32_generic rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache rdma_ucm(OE) rdma_cm(OE) iw_cm(OE) ib_ipoib(OE) ib_cm(OE) ib_umad(OE) mlx5_ib(OE) mlx5_core(OE) mlxdevm(OE) auxiliary(OE) ib_uverbs(OE) ib_core(OE) mlx_compat(OE) tls psample mlxfw pci_hyperv_intf cirrus drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm sunrpc virtio_balloon joydev pcspkr i2c_piix4 binfmt_misc ip_tables ext4 mbcache jbd2 ata_generic ata_piix libata e1000 serio_raw virtio_blk [last unloaded: dm_mod]


 Comments   
Comment by Alexander Zarochentsev [ 08/Dec/21 ]

it looks as a kernel bug, see https://access.redhat.com/solutions/6094611
the same stack trace:

[570928.662950] RSP: 0018:ffffa22203613e68 EFLAGS: 00010246
[570928.662969] RAX: 000000000000004e RBX: ffff8b3c3b76b090 RCX: 0000000000000000
[570928.662992] RDX: 0000000000000000 RSI: ffff8b3f33d167c8 RDI: ffff8b3f33d167c8
[570928.663014] RBP: ffffffff95826040 R08: 00000000000005b7 R09: 0000000000aaaaaa
[570928.663037] R10: 0000000000000000 R11: ffffa22202dff200 R12: ffff8b3c3b76b000
[570928.663059] R13: ffff8b3f2c0b0000 R14: ffff8b3dfe60d240 R15: ffff8b3c3b76b098
[570928.663082] FS:  0000000000000000(0000) GS:ffff8b3f33d00000(0000) knlGS:0000000000000000
[570928.663107] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[570928.663126] CR2: 00007f2a5bddc500 CR3: 00000001e1a10005 CR4: 00000000003706e0
[570928.663184] Call Trace:
[570928.663204]  css_release_work_fn+0x3f/0x240
[570928.663254]  process_one_work+0x1a7/0x360
[570928.663276]  worker_thread+0x30/0x390
[570928.663291]  ? create_worker+0x1a0/0x1a0
[570928.663305]  kthread+0x116/0x130
[570928.663326]  ? kthread_flush_work_fn+0x10/0x10
[570928.663344]  ret_from_fork+0x35/0x40
Generated at Sat Feb 10 03:17:32 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.