[LU-14457] racer test_1: crash with directory migration Created: 19/Feb/21  Updated: 09/Sep/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.14.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Related
is related to LU-6963 racer test_1: lu_object_attr() ASSERT... Open
is related to LU-7145 mdd_object_type() uses need audit Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/07632945-d143-4352-9ed9-297c98f1fdce

test_1 crashed after running for 3100s with the following error:

[ 3557.862509] LustreError: 31802:0:(lu_object.h:885:lu_object_attr()) ASSERTION( ((o)->lo_header->loh_attr & LOHA_EXISTS) != 0 ) failed: 
[ 3557.866245] LustreError: 31802:0:(lu_object.h:885:lu_object_attr()) LBUG
[ 3557.888908] Kernel panic - not syncing: LBUG
[ 3557.889650] CPU: 1 PID: 31802 Comm: mdt00_075 Kdump: loaded  3.10.0-1127.19.1.el7_lustre.x86_64 #1
[ 3557.891877] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
[ 3557.892830] Call Trace:
[ 3557.893247]  [<ffffffffb877ffa5>] dump_stack+0x19/0x1b
[ 3557.894137]  [<ffffffffb8779541>] panic+0xe8/0x21f
[ 3557.894929]  [<ffffffffc097d9eb>] lbug_with_loc+0x9b/0xa0 [libcfs]
[ 3557.895951]  [<ffffffffc131e7a0>] lu_object_attr.isra.21.part.22+0x36/0x36 [mdt]
[ 3557.897156]  [<ffffffffc12e0bb3>] mdt_link_parents_lock.isra.53+0x14c3/0x15a0 [mdt]
[ 3557.898477]  [<ffffffffc145e19b>] ? __mdd_lookup.isra.17+0x28b/0x430 [mdd]
[ 3557.899810]  [<ffffffffc0bbe3e9>] ? lu_object_find+0x19/0x20 [obdclass]
[ 3557.900900]  [<ffffffffc12d565a>] ? mdt_migrate_lookup.isra.42+0x47a/0x870 [mdt]
[ 3557.902171]  [<ffffffffc12be2ca>] ? mdt_remote_object_lock+0x2a/0x30 [mdt]
[ 3557.903298]  [<ffffffffc12e1545>] mdt_reint_migrate+0x8b5/0xfb0 [mdt]
[ 3557.905607]  [<ffffffffc12e1cc3>] mdt_reint_rec+0x83/0x210 [mdt]
[ 3557.906597]  [<ffffffffc12b9a30>] mdt_reint_internal+0x720/0xaf0 [mdt]
[ 3557.907667]  [<ffffffffc12c55c7>] mdt_reint+0x67/0x140 [mdt]
[ 3557.908671]  [<ffffffffc0f136fa>] tgt_request_handle+0x7ea/0x1750 [ptlrpc]
[ 3557.911001]  [<ffffffffc0eb31a6>] ptlrpc_server_handle_request+0x256/0xb10 [ptlrpc]
[ 3557.912283]  [<ffffffffc0eb7cfc>] ptlrpc_main+0xb3c/0x14e0 [ptlrpc]
[ 3557.914543]  [<ffffffffb80c6691>] kthread+0xd1/0xe0

Since this is during racer, it is entirely possible that the source or target file was removed at the same time.

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
racer test_1 - trevis-55vm4 crashed during racer test_1



 Comments   
Comment by Alex Zhuravlev [ 28/Jul/21 ]

hitting this as well:

 LustreError: 9866:0:(lu_object.h:883:lu_object_attr()) ASSERTION( ((o)->lo_header->loh_attr & LOHA_EXISTS) != 0 ) failed: in racer / 1

Trace:

PID: 9866   TASK: ffff8800da804000  CPU: 1   COMMAND: "mdt00_012"
 #0 [ffff8800da827a78] panic at ffffffff810af881
    /tmp/kernel/kernel/panic.c: 265
 #1 [ffff8800da827af8] mdt_link_parents_lock at ffffffffa0c91ade [mdt]
    /home/lustre/master-mine/lustre/include/lu_object.h: 883
 #2 [ffff8800da827be0] mdt_reint_migrate at ffffffffa0c93735 [mdt]
    /home/lustre/master-mine/lustre/mdt/mdt_reint.c: 2258
 #3 [ffff8800da827d18] mdt_reint_rec at ffffffffa0c95eb7 [mdt]
    /home/lustre/master-mine/lustre/mdt/mdt_reint.c: 3100
 #4 [ffff8800da827d48] mdt_reint_internal at ffffffffa0c6b24c [mdt]
    /home/lustre/master-mine/lustre/mdt/mdt_handler.c: 2760
 #5 [ffff8800da827d80] mdt_reint at ffffffffa0c76d37 [mdt]
    /home/lustre/master-mine/lustre/mdt/mdt_handler.c: 2834
 #6 [ffff8800da827da8] tgt_request_handle at ffffffffa05904b0 [ptlrpc]
    /home/lustre/master-mine/lustre/include/lu_target.h: 618
 #7 [ffff8800da827e20] ptlrpc_main at ffffffffa05469b0 [ptlrpc]
    /home/lustre/master-mine/lustre/include/lustre_net.h: 2379
 #8 [ffff8800da827f10] kthread at ffffffff810d02aa
    /tmp/kernel/kernel/kthread.c: 246
 #9 [ffff8800da827f50] ret_from_fork at ffffffff818001c4
    /tmp/kernel/arch/x86/entry/entry_64.S: 422
Generated at Sat Feb 10 03:09:56 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.