Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-15713

Round robin across nets can be broken

Details

    • Bug
    • Resolution: Fixed
    • Major
    • Lustre 2.16.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue is very similar to LU-13575, but it relates to the round robin across multiple nets whereas that ticket was about round robin across interfaces within a single net.

      Currently if a peer has multiple network types (either multiple LNDs or multiple nets on their interfaces) there are situations where traffic can be routed to the interfaces on one net (like if a peer is talking to another peer that only has interfaces on one of the nets, or if interfaces go down on the other net for an extended period of time). This causes the peer net/local net sequence numbers to diverge in the same manner documented in LU-13575. This can cause future traffic to funnel to just one of the available nets leading to degraded performance.

      Attachments

        Activity

          [LU-15713] Round robin across nets can be broken

          "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51547
          Subject: LU-15713 lnet: Ensure round robin across nets
          Project: fs/lustre-release
          Branch: b2_15
          Current Patch Set: 1
          Commit: 4b616f00cacc448f1be6607754feb77dbb347167

          gerrit Gerrit Updater added a comment - "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51547 Subject: LU-15713 lnet: Ensure round robin across nets Project: fs/lustre-release Branch: b2_15 Current Patch Set: 1 Commit: 4b616f00cacc448f1be6607754feb77dbb347167
          pjones Peter Jones added a comment -

          Landed for 2.16

          pjones Peter Jones added a comment - Landed for 2.16

          "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/46976/
          Subject: LU-15713 lnet: Ensure round robin across nets
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: 05413b3d84f7d1febb89cf4e9c86a7e017d147df

          gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/46976/ Subject: LU-15713 lnet: Ensure round robin across nets Project: fs/lustre-release Branch: master Current Patch Set: Commit: 05413b3d84f7d1febb89cf4e9c86a7e017d147df

          People

            hornc Chris Horn
            hornc Chris Horn
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: