Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11272

LNet Health: handle routing special case

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: Lustre 2.12.0
    • Fix Version/s: Lustre 2.12.0
    • Labels:
      None
    • Severity:
      3
    • Rank (Obsolete):
      9223372036854775807

      Description

      There are two issues:

      1. A router checker ping can timeout, causing the mdh to be invalidated. We need to recreate the mdh in that case
      2. When re-transmitting a message, even if the peer is marked as down we should re-transmit the message to fulfill it's retry quota.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ashehata Amir Shehata
                Reporter:
                ashehata Amir Shehata
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: