Details
-
New Feature
-
Resolution: Unresolved
-
Major
-
None
-
Lustre 2.4.3
-
15964
Description
In order to minimize user disruptions NASA performs some system maintenance "Live". Typical maintance includes activities such as adding new compute node or reconfigurations of IB fabric. During such times users jobs are suspend via pbs. Although we are able to suspend user job, which does minimize usage of lustre, it does not stop all lustre client/server activity. Therefore NASA requires:
1. mechanism to halt and block all lustre client IO.
2. Halt client/server keep alive ping and all other network traffic.
3. Clients should be able to recover after the quiesce without eviction.
Attachments
Issue Links
- is duplicated by
-
LU-13078 mgs trigger umount of clients
- Open
- is related to
-
LU-3290 disallow ptlrpc RPCs with old client XIDs
- Open
-
LU-13521 WBC: special readdir() handling for root WBC directory
- Open
-
LU-15250 RPC Replay Signature
- Open
- is related to
-
LU-13010 WBC: Reopen the file when WBC EX lock revoking
- Open
-
LU-18 Allow 100k open files on single client
- Resolved
-
LU-7236 OST connect and disconnect on demand
- Resolved