[LU-11468] LNet Health: Recovery interval Created: 03/Oct/18  Updated: 10/Nov/18  Resolved: 10/Nov/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.12.0
Fix Version/s: Lustre 2.12.0

Type: Bug Priority: Major
Reporter: Amir Shehata (Inactive) Assignee: Amir Shehata (Inactive)
Resolution: Fixed Votes: 0
Labels: lnet-health

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Add a configurable parameter to determine the Recovery interval. It is likely that some sites might not want to do health pings on failed NIDs once per second. So allow the ability to configure the difference between each recovery ping.



 Comments   
Comment by Gerrit Updater [ 05/Oct/18 ]

Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33296
Subject: LU-11468 lnet: configure recovery interval
Project: fs/lustre-release
Branch: multi-rail
Current Patch Set: 1
Commit: 6f14cfe69c6079e0b9b8aa2d3eec2c20b395a8ae

Comment by Gerrit Updater [ 05/Oct/18 ]

Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33309
Subject: LU-11468 lnet: configure recovery interval
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 98a7670c9b7bc19e9e7410a28aa16313615c90ca

Comment by Gerrit Updater [ 26/Oct/18 ]

Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33498
Subject: LU-11468 lnet: set recovery interval from lnetctl
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 1ecd232931b92fb86bfba66edd4dee3aa1a5c35a

Comment by Gerrit Updater [ 06/Nov/18 ]

Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/33309/
Subject: LU-11468 lnet: configure recovery interval
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: dc1f5f08b420aba99f613a6bc6b8acb7afd0e894

Comment by Peter Jones [ 06/Nov/18 ]

Landed for 2.12

Comment by Peter Jones [ 06/Nov/18 ]

one more patch to land still

 

Comment by Gerrit Updater [ 10/Nov/18 ]

Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/33498/
Subject: LU-11468 lnet: set recovery interval from lnetctl
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: b7f8d156db696fcc15fd37cfdfbee6549148fb69

Comment by Peter Jones [ 10/Nov/18 ]

Landed for 2.12

Generated at Sat Feb 10 02:44:08 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.