[LU-17519] Remove dead peers automatically Created: 09/Feb/24  Updated: 09/Feb/24

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Improvement Priority: Minor
Reporter: Cyril Bordage Assignee: Cyril Bordage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

When a client is decommissioned, it stays forever in the peer list of the servers.

It is cleaner to remove it from the list of peers. It avoids the need to change the value of lnet_recovery_limit to remove LNetError messages about this removed client. Moreover, having this parameter changed can mask a problem on an active but faulty node.

However, it can be cumbersome to remove it manually. That is why an automatic deletion could be relevant.

This feature could use a parameter to enable it and to set the delay before a client is considered to be removed.


Generated at Sat Feb 10 03:36:06 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.