Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13478

LNet: peer update adjustment on discovery toggle

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.14.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      Do not delete a non-MR peer when it first connects. We should only delete a peer from our local database iff we've already discovered it and it's state is changing from discovery on to discovery off.

      I think it is better to do a push to the peers only when discovery is toggled from on to off. This lets the peers know to clear their representation of the node. When it attempts to connect to it after, it'll get the correct list of NIDs.

      However in the reverse case; discovery going from off to on, the PUSH can be sent on an interface which is not recorded on the far side. The push is dropped in this case. By not pushing when we go from off to on we avoid this scenario and allow the peer to rediscover when it commences communication.

      Attachments

        Activity

          [LU-13478] LNet: peer update adjustment on discovery toggle
          pjones Peter Jones added a comment -

          Landed for 2.14

          pjones Peter Jones added a comment - Landed for 2.14

          Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/38321/
          Subject: LU-13478 lnet: handle discovery off properly
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: adae4295b62b1074f5c3c45543c586282394b1be

          gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/38321/ Subject: LU-13478 lnet: handle discovery off properly Project: fs/lustre-release Branch: master Current Patch Set: Commit: adae4295b62b1074f5c3c45543c586282394b1be

          Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/38321
          Subject: LU-13478 lnet: handle discovery off properly
          Project: fs/lustre-release
          Branch: master
          Current Patch Set: 1
          Commit: 0aba7078644af2f8379a9dee58f4c8827263683c

          gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/38321 Subject: LU-13478 lnet: handle discovery off properly Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 0aba7078644af2f8379a9dee58f4c8827263683c

          People

            ashehata Amir Shehata (Inactive)
            ashehata Amir Shehata (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: