-
Type:
New Feature
-
Status: Open
-
Priority:
Major
-
Resolution: Unresolved
-
Affects Version/s: Lustre 2.4.3
-
Fix Version/s: None
-
Labels:None
-
Rank (Obsolete):15964
In order to minimize user disruptions NASA performs some system maintenance "Live". Typical maintance includes activities such as adding new compute node or reconfigurations of IB fabric. During such times users jobs are suspend via pbs. Although we are able to suspend user job, which does minimize usage of lustre, it does not stop all lustre client/server activity. Therefore NASA requires:
1. mechanism to halt and block all lustre client IO.
2. Halt client/server keep alive ping and all other network traffic.
3. Clients should be able to recover after the quiesce without eviction.
- is duplicated by
-
LU-13078 mgs trigger umount of clients
-
- Open
-
- is related to
-
LU-3290 disallow ptlrpc RPCs with old client XIDs
-
- Open
-
-
LU-13521 WBC: special readdir() handling for root WBC directory
-
- Open
-
- is related to
-
LU-13010 WBC: Reopen the file when WBC EX lock revoking
-
- Open
-
-
LU-18 Allow 100k open files on single client
-
- Open
-
-
LU-7236 connections on demand
-
- Resolved
-