[LU-11654] kernel crash on client eviction with CL_OPEN changelog enabled Created: 10/Nov/18  Updated: 24/May/19  Resolved: 17/Nov/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.12.0
Fix Version/s: Lustre 2.12.0

Type: Bug Priority: Minor
Reporter: Jinshan Xiong Assignee: Jinshan Xiong
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
Severity: 1
Rank (Obsolete): 9223372036854775807

 Description   

Encounter this issue in my test:

[5996825.601469] Hardware name: Quanta Cloud Technology Inc. QuantaGrid D51PH-1ULH/S2PH-MB, BIOS S2P_3B04 03/04/2016
{{ [5996825.611713] task: ffff8fbf68309fa0 ti: ffff8fbf6cc9c000 task.ti: ffff8fbf6cc9c000}}
{{ [5996825.619358] RIP: 0010:[<ffffffffc17c2b41>] [<ffffffffc17c2b41>] mdd_close+0x241/0xf40 [mdd]}}
{{ [5996825.627991] RSP: 0018:ffff8fbf6cc9fcb0 EFLAGS: 00010246}}
{{ [5996825.633468] RAX: 0000000000000000 RBX: ffff8fa7fda20ba0 RCX: 00000000000000c3}}
{{ [5996825.640766] RDX: 0000000000000001 RSI: ffffffffc0b3fec0 RDI: ffff8fa7fda20ba0}}
{{ [5996825.648064] RBP: ffff8fbf6cc9fd08 R08: 0000000000000001 R09: ffffffffc1026a71}}
{{ [5996825.655362] R10: 0000000000000000 R11: ffffd41edd731800 R12: ffff8fbf6cc9fd90}}
{{ [5996825.662659] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000001}}
{{ [5996825.669958] FS: 0000000000000000(0000) GS:ffff8fafffbc0000(0000) knlGS:0000000000000000}}
{{ [5996825.678218] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033}}
{{ [5996825.684128] CR2: 0000000000000018 CR3: 0000000f9fe0e000 CR4: 00000000003607e0}}
{{ [5996825.691427] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000}}
{{ [5996825.698732] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400}}
{{ [5996825.706030] Call Trace:}}
{{ [5996825.708679] [<ffffffffc163165b>] mdt_mfd_close+0x3cb/0x810 [mdt]}}
{{ [5996825.714951] [<ffffffffc15fe55e>] mdt_obd_disconnect+0x36e/0x670 [mdt]}}
{{ [5996825.721689] [<ffffffffc0ff9914>] class_fail_export+0x224/0x520 [obdclass]}}
{{ [5996825.728794] [<ffffffffc11e5a38>] expired_lock_main+0x5f8/0xc00 [ptlrpc]}}
{{ [5996825.735665] [<ffffffff9dccf670>] ? wake_up_state+0x20/0x20}}
{{ [5996825.741449] [<ffffffffc11e5440>] ? ldlm_server_glimpse_ast+0x3f0/0x3f0 [ptlrpc]}}
{{ [5996825.749012] [<ffffffff9dcbb621>] kthread+0xd1/0xe0}}
{{ [5996825.754065] [<ffffffff9dcbb550>] ? insert_kthread_work+0x40/0x40}}
{{ [5996825.760333] [<ffffffff9e3205f7>] ret_from_fork_nospec_begin+0x21/0x21}}
{{ [5996825.767030] [<ffffffff9dcbb550>] ? insert_kthread_work+0x40/0x40}}
{{ [5996825.773285] Code: 74 50 4c 89 e7 44 89 45 c0 e8 bc f5 87 ff 48 85 c0 44 8b 45 c0 74 06 80 78 70 00 74 35 48 8b 45 b8 8b 4d c4 48 89 df 44 89 45 c0 <8b> 50 18 8b 70 14 e8 34 ae ff ff 48 85 c0 48 89 c2 44 8b 45 c0}}
{{ [5996825.793410] RIP [<ffffffffc17c2b41>] mdd_close+0x241/0xf40 [mdd]}}
{{ [5996825.799703] RSP <ffff8fbf6cc9fcb0>}}
{{ [5996825.803360] CR2: 0000000000000018}}

}}

The root cause of this issue is that mdd_close() assumes that lu_ucred is always initialized.

 

Patch is coming soon



 Comments   
Comment by Gerrit Updater [ 10/Nov/18 ]

Jinshan Xiong (jinshan.xiong@gmail.com) uploaded a new patch: https://review.whamcloud.com/33638
Subject: LU-11654 mdd: check ucred before using it
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 6a87b1ec872956c6b31f44cf4190f2a350aa4f37

Comment by Gerrit Updater [ 17/Nov/18 ]

Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/33638/
Subject: LU-11654 mdd: check ucred before using it
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: ebf742028b57a88817b26d6fb7748110ec15d31c

Comment by Peter Jones [ 17/Nov/18 ]

Landed for 2.12

Generated at Sat Feb 10 02:45:46 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.