Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11272

LNet Health: handle routing special case

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.12.0
    • Lustre 2.12.0
    • None
    • 3
    • 9223372036854775807

    Description

      There are two issues:

      1. A router checker ping can timeout, causing the mdh to be invalidated. We need to recreate the mdh in that case
      2. When re-transmitting a message, even if the peer is marked as down we should re-transmit the message to fulfill it's retry quota.

      Attachments

        Issue Links

          Activity

            [LU-11272] LNet Health: handle routing special case
            pjones Peter Jones added a comment -

            Landed for 2.12

            pjones Peter Jones added a comment - Landed for 2.12

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/33043/
            Subject: LU-11272 lnet: router handling
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 05becd69bc0c79fde00f0fddf4935ed8d8e3beb3

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/33043/ Subject: LU-11272 lnet: router handling Project: fs/lustre-release Branch: master Current Patch Set: Commit: 05becd69bc0c79fde00f0fddf4935ed8d8e3beb3

            Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33046
            Subject: LU-11272 lnet: router handling
            Project: fs/lustre-release
            Branch: multi-rail
            Current Patch Set: 1
            Commit: 0917bef280bf0abe7821c255d8d5f74f359bc9e2

            gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33046 Subject: LU-11272 lnet: router handling Project: fs/lustre-release Branch: multi-rail Current Patch Set: 1 Commit: 0917bef280bf0abe7821c255d8d5f74f359bc9e2

            Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33043
            Subject: LU-11272 lnet: router handling
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 9381aeb15f68789d1e65cc3c5b6201362f4423dd

            gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33043 Subject: LU-11272 lnet: router handling Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 9381aeb15f68789d1e65cc3c5b6201362f4423dd

            People

              ashehata Amir Shehata (Inactive)
              ashehata Amir Shehata (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: