[LU-9416] lmv_hsm_ct_register() should add kkuc before sending registration RPCs Created: 28/Apr/17  Updated: 24/Oct/17  Resolved: 19/Oct/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: Lustre 2.11.0, Lustre 2.10.2

Type: Bug Priority: Minor
Reporter: John Hammond Assignee: Henri Doreau (Inactive)
Resolution: Fixed Votes: 0
Labels: hsm

Severity: 3
Rank (Obsolete): 9223372036854775807

 Comments   
Comment by Gerrit Updater [ 28/Aug/17 ]

Henri Doreau (henri.doreau@cea.fr) uploaded a new patch: https://review.whamcloud.com/28751
Subject: LU-9416 hsm: add kkuc before sending registration RPCs
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: f76ade97e83d909f0247c93ae4e560ff2b442b0b

Comment by Bruno Faccini (Inactive) [ 04/Oct/17 ]

John,
Can you at least comment/detail here about the main reason/need for this change ??!!
Could this be to avoid that some KUC msgs/requests from Server to Client could be lost, or to cause other inconvenient, during some racy cases ??... Or simply to do local operations first and thus prevent from doing useless remote/MDTs registrations in case of failure ?

Comment by John Hammond [ 04/Oct/17 ]

> Could this be to avoid that some KUC msgs/requests from Server to Client could be lost, or to cause other inconvenient, during some racy cases ??

Correct. I have seen in sanity-hsm runs where the resistration completed and the CDTs sent HSM actions just before the kkuc registration happens. In this case the client drops the actions because there are no CT pipes in the KKUC list. Then the CDT must wait and resent the actions causing a 200+ second delay.

Comment by Bruno Faccini (Inactive) [ 04/Oct/17 ]

So this should be kind of a follow-on to my previous patch for LU-9038, that fixed the crashes that could occur before and during the same race but introduced this retry mechanism from CDT, right...

Comment by John Hammond [ 06/Oct/17 ]

Correct.

Comment by Gerrit Updater [ 19/Oct/17 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/28751/
Subject: LU-9416 hsm: add kkuc before sending registration RPCs
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: bdb04079570dd2de3b9310221e0206d64efb9ecc

Comment by Peter Jones [ 19/Oct/17 ]

Landed for 2.11

Comment by Gerrit Updater [ 19/Oct/17 ]

Minh Diep (minh.diep@intel.com) uploaded a new patch: https://review.whamcloud.com/29687
Subject: LU-9416 hsm: add kkuc before sending registration RPCs
Project: fs/lustre-release
Branch: b2_10
Current Patch Set: 1
Commit: 45b51a053c014a11b49768757ab6d08103f8547e

Comment by Gerrit Updater [ 24/Oct/17 ]

John L. Hammond (john.hammond@intel.com) merged in patch https://review.whamcloud.com/29687/
Subject: LU-9416 hsm: add kkuc before sending registration RPCs
Project: fs/lustre-release
Branch: b2_10
Current Patch Set:
Commit: 46aee8eba9000e458d231c975716feb275b8eb92

Generated at Sat Feb 10 02:26:00 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.