[LU-14001] kernel BUG at ...ldiskfs/htree_lock.c:717 Created: 29/Sep/20  Updated: 29/Sep/20

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Sergey Cheremencev Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

https://testing.whamcloud.com/test_sets/6651bc93-8d34-44ba-ba7c-2bcf80357069

[21813.884656] Lustre: DEBUG MARKER: /usr/sbin/lctl mark == sanity-sec test 41: test race on encrypted file size \(1\) ========================================== 00:40:55 \(1601340055\)
[21814.324239] Lustre: DEBUG MARKER: == sanity-sec test 41: test race on encrypted file size (1) ========================================== 00:40:55 (1601340055)
[21814.694332] ------------[ cut here ]------------
[21814.695435] kernel BUG at /tmp/rpmbuild-lustre-jenkins-AuR3ylgp/BUILD/lustre-2.13.55_69_gbab39da/ldiskfs/htree_lock.c:717!
[21814.697865] invalid opcode: 0000 [#1] SMP PTI
[21814.698816] CPU: 1 PID: 1478787 Comm: mdt00_002 Kdump: loaded Tainted: G           OE    --------- -  - 4.18.0-193.6.3.el8_lustre.x86_64 #1
[21814.701439] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
[21814.702825] RIP: 0010:htree_lock_try+0x46/0x60 [ldiskfs]
[21814.703962] Code: 00 75 29 48 89 5f 08 89 57 1c f0 48 0f ba 2b 00 72 1c 89 ce e8 5b fe ff ff 85 c0 74 04 f0 80 23 fe f7 d0 5b c1 e8 1f c3 0f 0b <0f> 0b 0f 0b f3 90 48 8b 13 83 e2 01 75 f6 eb d0 66 2e 0f 1f 84 00
[21814.707883] RSP: 0018:ffffa6d2c0af7aa8 EFLAGS: 00010286
[21814.708999] RAX: ffff9a3bb1167750 RBX: ffff9a3bcc2aae40 RCX: 0000000000000001
[21814.710511] RDX: 0000000000000004 RSI: 0000000000000003 RDI: ffff9a3bb1167600
[21814.712023] RBP: ffff9a3bb1167600 R08: ffff9a3bcd2e8000 R09: ffff9a3bf1433480
[21814.713532] R10: ffffa6d2c0af7b68 R11: ffffffffc11db660 R12: ffff9a3bb8214f40
[21814.715041] R13: ffff9a3bf0ce2f40 R14: ffff9a3bcd2e6468 R15: ffff9a3bb1167600
[21814.716550] FS:  0000000000000000(0000) GS:ffff9a3bffd00000(0000) knlGS:0000000000000000
[21814.718251] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[21814.719469] CR2: 00007fe914c05988 CR3: 000000001080a004 CR4: 00000000000606e0
[21814.720986] Call Trace:
[21814.721588]  ldiskfs_htree_lock+0x52/0xc0 [ldiskfs]
[21814.722795]  osd_ea_lookup_rec+0x114/0xd80 [osd_ldiskfs]
[21814.724010]  ? ksocknal_send+0x14b/0x330 [ksocklnd]
[21814.725074]  ? osd_index_ea_lookup+0xea/0x160 [osd_ldiskfs]
[21814.726281]  osd_index_ea_lookup+0xea/0x160 [osd_ldiskfs]
[21814.727536]  __mdd_lookup.isra.19+0x298/0x3b0 [mdd]
[21814.728609]  mdd_lookup+0x109/0x150 [mdd]
[21814.729671]  mdt_lookup_version_check+0x59/0x2d0 [mdt]
[21814.730827]  mdt_create+0x37f/0xc50 [mdt]
[21814.732217]  ? ldlm_request_cancel+0x46/0x740 [ptlrpc]
[21814.733342]  mdt_reint_create+0x30b/0x3c0 [mdt]
[21814.734341]  mdt_reint_rec+0x11f/0x250 [mdt]
[21814.735287]  mdt_reint_internal+0x498/0x780 [mdt]
[21814.736335]  mdt_reint+0x5e/0x100 [mdt]
[21814.737239]  tgt_request_handle+0xc64/0x1840 [ptlrpc]
[21814.738406]  ptlrpc_server_handle_request+0x31a/0xba0 [ptlrpc]
[21814.739729]  ptlrpc_main+0xba4/0x14a0 [ptlrpc]
[21814.740746]  ? __schedule+0x257/0x650
[21814.741596]  ? ptlrpc_register_service+0xfb0/0xfb0 [ptlrpc]
[21814.742838]  kthread+0x112/0x130
[21814.743558]  ? kthread_flush_work_fn+0x10/0x10
[21814.744527]  ret_from_fork+0x35/0x40 


 Comments   
Comment by Sergey Cheremencev [ 29/Sep/20 ]

sanity-sec_41 crushed after failed sanity-sec_39 and sanity-sec_40(LU-14002). Probably related issues.

Generated at Sat Feb 10 03:06:00 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.