Details
-
Bug
-
Resolution: Fixed
-
Blocker
-
Lustre 2.12.0
-
None
-
3
-
9223372036854775807
Description
There are two issues:
- A router checker ping can timeout, causing the mdh to be invalidated. We need to recreate the mdh in that case
- When re-transmitting a message, even if the peer is marked as down we should re-transmit the message to fulfill it's retry quota.
Attachments
Issue Links
- is related to
-
LU-9120 LNet Network Health Feature
-
- Resolved
-
Activity
Resolution | New: Fixed [ 1 ] | |
Status | Original: Open [ 1 ] | New: Resolved [ 5 ] |
Affects Version/s | New: Lustre 2.12.0 [ 13495 ] |
Fix Version/s | New: Lustre 2.12.0 [ 13495 ] |
Description |
Original:
There are two issues:
# A router checker ping can timeout, causing the mdh to be invalidated. We need to recreate the mdh in that case # When re-transmitting a message, even if the peer is marked as down we should re-transmit the message to fullfill it's retry quota. |
New:
There are two issues:
# A router checker ping can timeout, causing the mdh to be invalidated. We need to recreate the mdh in that case # When re-transmitting a message, even if the peer is marked as down we should re-transmit the message to fulfill it's retry quota. |
Landed for 2.12