[LU-17446] BL AST should stop resending if lock is cancelled Created: 18/Jan/24  Updated: 01/Feb/24

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Oleg Drokin Assignee: Oleg Drokin
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Blocker
Related
is related to LU-16357 a mechanism to inform other nodes to ... Open
is related to LU-17426 parallel cross-directory rename of re... Open
is related to LU-16004 Blocking callback for already cancell... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

It was recently observed on at least two sites that AST resend logic will keep resending ASTs until timeout is hit, even if the cancel for that lock has long been received and therefore it makes no sense to insist on getting the AST reply.

Eventually once the timeout is hit, we even see teh ast error message saying as much, that "Hey, the lock was already cancelled" so the client is not even evicted as the result :

 

(client.c:1273:ptlrpc_import_delay_req()) @@@ send limit expired  req@00000000063d38a1
   x1788029434094272/t0(0) lustre-MDT0001@1.1.1.1@tcp:15/16 lens 328/224 e 0 to 1
   dl 1705530737 ref 1 fl Rpc:XQU/2/ffffffff rc 0/-1 job:''


(ldlm_lockd.c:739:ldlm_handle_ast_error()) ### blocking AST (req@00000000063d38a1 x1788029434094272)
   timeout from nid 1.1.1.1@tcp, but cancel was received (AST reply lost?)
   ns: lustre-MDT0001_UUID lock: 000000008783e209/0x4937295654f3a72a lrc: 1/0,0 mode: --/PR
   res: [0x24001144f:0x8c35:0x0].0x0 bits 0x13/0x0 rrc: 6 type: IBT gid 0 flags: 0x44a01400000020
   nid: 1.1.1.2@tcp remote: 0x6196afd4e206854a expref: 7 pid: 376559 timeout: 337171 lvb_type: 0


 Comments   
Comment by Gerrit Updater [ 19/Jan/24 ]

"Oleg Drokin <green@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/53739
Subject: LU-17446 ldlm: Do not wait for AST RPC completion on lock cancel
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 578d772b3481fab2fb5307cb34be8ffffb9f292e

Generated at Sat Feb 10 03:35:30 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.