Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13575

LNet should ensure round-robin interface selection when interfaces are healthy

Details

    • Improvement
    • Resolution: Fixed
    • Minor
    • Lustre 2.15.0
    • None
    • None

    Description

      When an interface fails and stays out of commission for a period of time, and then is brought back into commission, the sequence numbers for the interface which has been currently in use would be far larger than the newly commissioned interface. This leads to the new interface being used continuously until its sequence number catches up with the in use interface. This is not ideal behavior, because the system has two available interfaces, but only one is being used simply because of the sequence number, which is intended to allow round robin. Ideally, once an interface comes back into service, it should immediately be used.

      A similar thing happens when there are a lot of source specified sends. One NI gets a bunch of sequence increments so then it takes a while for other NIs to "catch up".

      We should modify the sequence number manipulation to help ensure we actually round robin when desired, or otherwise modify the relevant code.

      Attachments

        Activity

          [LU-13575] LNet should ensure round-robin interface selection when interfaces are healthy
          pjones Peter Jones added a comment -

          Landed for 2.15

          pjones Peter Jones added a comment - Landed for 2.15

          "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/45004/
          Subject: LU-13575 lnet: Ensure round robin selection of peer NIs
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: c51763948abfdbdc8e3f3ea7e73f2632320a095a

          gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/45004/ Subject: LU-13575 lnet: Ensure round robin selection of peer NIs Project: fs/lustre-release Branch: master Current Patch Set: Commit: c51763948abfdbdc8e3f3ea7e73f2632320a095a

          "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/45003/
          Subject: LU-13575 lnet: Ensure round robin selection of local NIs
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: a18c4a16246e6185919eda805eca52772bbc3efe

          gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/45003/ Subject: LU-13575 lnet: Ensure round robin selection of local NIs Project: fs/lustre-release Branch: master Current Patch Set: Commit: a18c4a16246e6185919eda805eca52772bbc3efe

          "Chris Horn <chris.horn@hpe.com>" uploaded a new patch: https://review.whamcloud.com/45004
          Subject: LU-13575 lnet: Ensure round robin selection of peer NIs
          Project: fs/lustre-release
          Branch: master
          Current Patch Set: 1
          Commit: 3a60ea6a3c8f341b130c4176821a1ad8c5067033

          gerrit Gerrit Updater added a comment - "Chris Horn <chris.horn@hpe.com>" uploaded a new patch: https://review.whamcloud.com/45004 Subject: LU-13575 lnet: Ensure round robin selection of peer NIs Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 3a60ea6a3c8f341b130c4176821a1ad8c5067033

          "Chris Horn <chris.horn@hpe.com>" uploaded a new patch: https://review.whamcloud.com/45003
          Subject: LU-13575 lnet: Ensure round robin selection of local NIs
          Project: fs/lustre-release
          Branch: master
          Current Patch Set: 1
          Commit: 1c9b558346be925aa5e3e7e9e24fc5d7ff6ea3b8

          gerrit Gerrit Updater added a comment - "Chris Horn <chris.horn@hpe.com>" uploaded a new patch: https://review.whamcloud.com/45003 Subject: LU-13575 lnet: Ensure round robin selection of local NIs Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 1c9b558346be925aa5e3e7e9e24fc5d7ff6ea3b8

          People

            hornc Chris Horn
            hornc Chris Horn
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: