Details

    • Technical task
    • Resolution: Fixed
    • Minor
    • Lustre 2.13.0, Lustre 2.12.3
    • None
    • None
    • 9223372036854775807

    Description

      In the following scenario

      Lustre->LNetPrimaryNID with 0@lo
      Discover is initiated on 0@lo
      The peer is created with 0@lo and <addr>@<net>
      The interface health of the peer's <addr>@<net> is decremented
      LNetPut() to self
      selection algorithm selects 0@lo to send to

      This exposes an issue where we try and go through the peer credit management algorithm, but because there are no credits associated with 0@lo we end up indefinitely queuing the message. ptlrpc will then get stuck waiting for send completion on the message.

      This was exposed via conf-sanity 32

       

      Attachments

        Activity

          [LU-12339] LNet Health: selecting loopback interface for sending

          Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/36040/
          Subject: LU-12339 lnet: select LO interface for sending
          Project: fs/lustre-release
          Branch: b2_12
          Current Patch Set:
          Commit: bab7da820e36d3c00e888704fc2c8d6022786c42

          gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/36040/ Subject: LU-12339 lnet: select LO interface for sending Project: fs/lustre-release Branch: b2_12 Current Patch Set: Commit: bab7da820e36d3c00e888704fc2c8d6022786c42

          Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/36040
          Subject: LU-12339 lnet: select LO interface for sending
          Project: fs/lustre-release
          Branch: b2_12
          Current Patch Set: 1
          Commit: 3d0d0149b832804c36d432370e23ed91b8ec43c8

          gerrit Gerrit Updater added a comment - Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/36040 Subject: LU-12339 lnet: select LO interface for sending Project: fs/lustre-release Branch: b2_12 Current Patch Set: 1 Commit: 3d0d0149b832804c36d432370e23ed91b8ec43c8

          Work has landed as part of the MR Routing merge commit: https://review.whamcloud.com/#/c/34983/

          jgmitter Joseph Gmitter (Inactive) added a comment - Work has landed as part of the MR Routing merge commit: https://review.whamcloud.com/#/c/34983/

          Amir Shehata (ashehata@whamcloud.com) merged in patch https://review.whamcloud.com/34957/
          Subject: LU-12339 lnet: select LO interface for sending
          Project: fs/lustre-release
          Branch: multi-rail
          Current Patch Set:
          Commit: 69d1535ebdac139c6b19db2bca5f65663fe88467

          gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) merged in patch https://review.whamcloud.com/34957/ Subject: LU-12339 lnet: select LO interface for sending Project: fs/lustre-release Branch: multi-rail Current Patch Set: Commit: 69d1535ebdac139c6b19db2bca5f65663fe88467

          Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/34957
          Subject: LU-12339 lnet: select LO interface for sending
          Project: fs/lustre-release
          Branch: multi-rail
          Current Patch Set: 1
          Commit: d4fb79b10c186dbec3263885773e4b48f0e7cd6e

          gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/34957 Subject: LU-12339 lnet: select LO interface for sending Project: fs/lustre-release Branch: multi-rail Current Patch Set: 1 Commit: d4fb79b10c186dbec3263885773e4b48f0e7cd6e

          People

            ashehata Amir Shehata (Inactive)
            ashehata Amir Shehata (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: