Details
-
Improvement
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
None
-
3
-
9223372036854775807
Description
Right now, it's possible for the server to initiate a connection to client if it has lost the connection and server has to notify the client, for example in case of a DLM lock callback (AST). This can cause issues in cloud environments because the server and client may not belong to the same virtual network.
The current solution is to configure the firewall so that it allows connection to client. @Andreas has shared me this PR: https://review.whamcloud.com/41021 ("LU-14224 misc: add firewalld service configuration") that makes things a little bit easier.
It would be helpful if we can avoid this totally. That being said, clients should maintain such connections if they have resource caching locally. If server detects that a connection is lost, it should wait for the client to connect back. LNET should notify server if connection is restored in order to reduce latency.