[LU-13816] LustreError: 18408:0:(mdt_handler.c:892:mdt_big_xattr_get()) ASSERTION( info->mti_big_lmm_used == 0 ) failed: Created: 23/Jul/20  Updated: 05/Apr/23  Resolved: 05/Aug/20

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.12.3
Fix Version/s: None

Type: Bug Priority: Critical
Reporter: Mahmoud Hanafi Assignee: Lai Siyao
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Duplicate
duplicates LU-13599 LustreError: 30166:0:(service.c:189:p... Resolved
Related
is related to LU-16206 PCC crashes MDS: mdt_big_xattr_get())... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

While running MDT migration like this

lfs migrate -v -m 2 dir1

 

We got LBUG on mdt2 server.

[5435767.755756] LustreError: 18408:0:(mdt_handler.c:892:mdt_big_xattr_get()) ASSERTION( info->mti_big_lmm_used == 0 ) failed: 
[5435767.789499] LustreError: 18408:0:(mdt_handler.c:892:mdt_big_xattr_get()) LBUG
[5435767.811480] Pid: 18408, comm: mdt02_017 3.10.0-1062.12.1.el7_lustre2124.x86_64 #1 SMP Tue Mar 17 13:32:19 PDT 2020
[5435767.811481] Call Trace:
[5435767.811491] [<ffffffffc0b8d7cc>] libcfs_call_trace+0x8c/0xc0 [libcfs]
[5435767.816221] 
[5435767.816225] [<ffffffffc0b8d87c>] lbug_with_loc+0x4c/0xa0 [libcfs]
[5435767.816240] [<ffffffffc1870cc0>] mdt_big_xattr_get+0x640/0x810 [mdt]
[5435767.816247] [<ffffffffc18710c7>] mdt_stripe_get+0x237/0x400 [mdt]
[5435767.816257] [<ffffffffc18935eb>] mdt_reint_migrate+0x101b/0x1310 [mdt]
[5435767.816265] [<ffffffffc1893963>] mdt_reint_rec+0x83/0x210 [mdt]
[5435767.816272] [<ffffffffc1870273>] mdt_reint_internal+0x6e3/0xaf0 [mdt]
[5435767.816280] [<ffffffffc187b6e7>] mdt_reint+0x67/0x140 [mdt]
[5435767.816326] [<ffffffffc15b53ca>] tgt_request_handle+0xada/0x1570 [ptlrpc]
[5435767.816351] [<ffffffffc155947b>] ptlrpc_server_handle_request+0x24b/0xab0 [ptlrpc]
[5435767.816375] [<ffffffffc155cde4>] ptlrpc_main+0xb34/0x1470 [ptlrpc]
[5435767.816378] [<ffffffff836c61f1>] kthread+0xd1/0xe0
[5435767.816381] [<ffffffff83d8dd1d>] ret_from_fork_nospec_begin+0x7/0x21
[5435767.816395] [<ffffffffffffffff>] 0xffffffffffffffff
[5435767.816396] Kernel panic - not syncing: LBUG
[5435767.816397] CPU: 5 PID: 18408 Comm: mdt02_017 Kdump: loaded Tainted: G OE ------------ 3.10.0-1062.12.1.el7_lustre2124.x86_64 #1
[5435767.816398] Hardware name: HPE ProLiant DL380 Gen10/ProLiant DL380 Gen10, BIOS U30 06/15/2018
[5435767.816398] Call Trace:
[5435767.816402] [<ffffffff83d7ac43>] dump_stack+0x19/0x1b
[5435767.816404] [<ffffffff83d74987>] panic+0xe8/0x21f
[5435767.816410] [<ffffffffc0b8d8cb>] lbug_with_loc+0x9b/0xa0 [libcfs]
[5435767.816417] [<ffffffffc1870cc0>] mdt_big_xattr_get+0x640/0x810 [mdt]
[5435767.816425] [<ffffffffc18710c7>] mdt_stripe_get+0x237/0x400 [mdt]
[5435767.816433] [<ffffffffc18935eb>] mdt_reint_migrate+0x101b/0x1310 [mdt]
[5435767.816458] [<ffffffffc12d7039>] ? check_unlink_entry+0x19/0xd0 [obdclass]
[5435767.816472] [<ffffffffc12d7c88>] ? upcall_cache_get_entry+0x218/0x8b0 [obdclass]
[5435767.816481] [<ffffffffc1893963>] mdt_reint_rec+0x83/0x210 [mdt]
[5435767.816489] [<ffffffffc1870273>] mdt_reint_internal+0x6e3/0xaf0 [mdt]
[5435767.816496] [<ffffffffc187b6e7>] mdt_reint+0x67/0x140 [mdt]
[5435767.816523] [<ffffffffc15b53ca>] tgt_request_handle+0xada/0x1570 [ptlrpc]
[5435767.816529] [<ffffffffc0b93fa7>] ? libcfs_debug_msg+0x57/0x80 [libcfs]
[5435767.816554] [<ffffffffc155947b>] ptlrpc_server_handle_request+0x24b/0xab0 [ptlrpc]
[5435767.816557] [<ffffffff836d3903>] ? __wake_up+0x13/0x20
[5435767.816580] [<ffffffffc155cde4>] ptlrpc_main+0xb34/0x1470 [ptlrpc]
[5435767.816582] [<ffffffff83d805c2>] ? __schedule+0x402/0x840
[5435767.816605] [<ffffffffc155c2b0>] ? ptlrpc_register_service+0xf80/0xf80 [ptlrpc]
[5435767.816607] [<ffffffff836c61f1>] kthread+0xd1/0xe0
[5435767.816608] [<ffffffff836c6120>] ? insert_kthread_work+0x40/0x40
[5435767.816609] [<ffffffff83d8dd1d>] ret_from_fork_nospec_begin+0x7/0x21
[5435767.816611] [<ffffffff836c6120>] ? insert_kthread_work+0x40/0x40

 


 Comments   
Comment by Peter Jones [ 23/Jul/20 ]

Lai

Can you please advise?

Thanks

Peter

Comment by Gerrit Updater [ 28/Jul/20 ]

Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/39519
Subject: LU-13816 mdt: read stripe into temp buf
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 8dff94d71376c7dc08677294304723c289d6bb89

Comment by Lai Siyao [ 05/Aug/20 ]

This is a duplicate of LU-13599, and it's fixed there.

Generated at Sat Feb 10 03:04:28 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.