Details

    • Technical task
    • Resolution: Fixed
    • Minor
    • Lustre 2.13.0
    • None
    • 9223372036854775807

    Description

      In case of routers (as well as for the general case) it's important to update the health of the ni/lpni for incoming messages. For an lpni specifically when we receive a message is when we know that the lpni is up.

      A percentage router health is required in order to send a message to a gateway. That defaults to 100, meaning that a router interface has to be absolutely healthy in order to send to it. This matches the current behavior. So if a router interface goes down an its health goes down significantly, but then it comes back up again; either we receive a message from it or we discover it and get a reply, then in order to start using that router interface again we have to boost its health all the way up to maximum.

      This behavior is special cased for routers.

      Attachments

        Activity

          [LU-11477] handle health for both incoming and outgoing messages

          Olaf Faaland-LLNL (faaland1@llnl.gov) uploaded a new patch: https://review.whamcloud.com/38357
          Subject: LU-11477 lnet: handle health for incoming messages
          Project: fs/lustre-release
          Branch: b2_12
          Current Patch Set: 1
          Commit: 5cf18b910dc4bcc3308f64a8a282e62acdd39b03

          gerrit Gerrit Updater added a comment - Olaf Faaland-LLNL (faaland1@llnl.gov) uploaded a new patch: https://review.whamcloud.com/38357 Subject: LU-11477 lnet: handle health for incoming messages Project: fs/lustre-release Branch: b2_12 Current Patch Set: 1 Commit: 5cf18b910dc4bcc3308f64a8a282e62acdd39b03

          Olaf Faaland-LLNL (faaland1@llnl.gov) uploaded a new patch: https://review.whamcloud.com/38354
          Subject: LU-11477 lnet: handle health for incoming messages
          Project: fs/lustre-release
          Branch: b2_12
          Current Patch Set: 1
          Commit: 3f5a661a1d8ca65fe1412d6ec13630fc1f8ab394

          gerrit Gerrit Updater added a comment - Olaf Faaland-LLNL (faaland1@llnl.gov) uploaded a new patch: https://review.whamcloud.com/38354 Subject: LU-11477 lnet: handle health for incoming messages Project: fs/lustre-release Branch: b2_12 Current Patch Set: 1 Commit: 3f5a661a1d8ca65fe1412d6ec13630fc1f8ab394

          Work has landed as part of the MR Routing merge commit: https://review.whamcloud.com/#/c/34983/

          jgmitter Joseph Gmitter (Inactive) added a comment - Work has landed as part of the MR Routing merge commit: https://review.whamcloud.com/#/c/34983/

          Amir Shehata (ashehata@whamcloud.com) merged in patch https://review.whamcloud.com/33301/
          Subject: LU-11477 lnet: handle health for incoming messages
          Project: fs/lustre-release
          Branch: multi-rail
          Current Patch Set:
          Commit: 18c850cb91a64fcc38ae0801e6fab983607ded71

          gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) merged in patch https://review.whamcloud.com/33301/ Subject: LU-11477 lnet: handle health for incoming messages Project: fs/lustre-release Branch: multi-rail Current Patch Set: Commit: 18c850cb91a64fcc38ae0801e6fab983607ded71

          Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33301
          Subject: LU-11477 lnet: handle health for incoming messages
          Project: fs/lustre-release
          Branch: multi-rail
          Current Patch Set: 1
          Commit: 88c83415ebaeaf0fe473cb74f6801996f5808c2f

          gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33301 Subject: LU-11477 lnet: handle health for incoming messages Project: fs/lustre-release Branch: multi-rail Current Patch Set: 1 Commit: 88c83415ebaeaf0fe473cb74f6801996f5808c2f

          People

            ashehata Amir Shehata (Inactive)
            ashehata Amir Shehata (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: