Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11144

Dynamic Discovery is not triggered for router peers

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      It appears to me that dynamic discovery will not be triggered for LNet routers under normal operating conditions (i.e. without an explicit lnetctl discover of the routers).

      The discovery logic is early in lnet_select_pathway() and based on the peer associated with the dst_nid. The only LNetGet where dst_nid is a router is going to be the router checker traffic. But that traffic goes on the LNET_RESERVED_PORTAL, so it will never trigger discovery.
      Seems like we need to check for discovery of the router peer after selecting it later in lnet_select_pathway().

      Attachments

        Issue Links

          Activity

            [LU-11144] Dynamic Discovery is not triggered for router peers
            hornc Chris Horn added a comment -

            Resolved with MR routing feature

            hornc Chris Horn added a comment - Resolved with MR routing feature
            spitzcor Cory Spitz added a comment -

            ashehata, will you be resolving this issue then? Can you point at a specific commit or LU that resolved it?

            spitzcor Cory Spitz added a comment - ashehata , will you be resolving this issue then? Can you point at a specific commit or LU that resolved it?

            I believe this issue has been resolved in the new routing code.

            ashehata Amir Shehata (Inactive) added a comment - I believe this issue has been resolved in the new routing code.
            spitzcor Cory Spitz added a comment -

            ashehata, ready to resolve this and LU-11143?

            spitzcor Cory Spitz added a comment - ashehata , ready to resolve this and LU-11143 ?

            both LU-11143 and LU-11144 are related.

            I address them here:

            https://wiki.whamcloud.com/display/LNet/Routing+and+MR+integration

            Might be a good idea to use that link for feedback on the proposals

            ashehata Amir Shehata (Inactive) added a comment - both LU-11143 and LU-11144 are related. I address them here: https://wiki.whamcloud.com/display/LNet/Routing+and+MR+integration Might be a good idea to use that link for feedback on the proposals
            hornc Chris Horn added a comment -

            I think what I was missing with router peers and dynamic discovery is the DD push
            router should queue other peers for discovery based on where it is forwarding traffic, and the discovery thread should push the router's MR info to those peers as part of that. Right?

            So while the description in this ticket is, I believe, accurate it's probably by design and not necessarily a problem.

            hornc Chris Horn added a comment - I think what I was missing with router peers and dynamic discovery is the DD push router should queue other peers for discovery based on where it is forwarding traffic, and the discovery thread should push the router's MR info to those peers as part of that. Right? So while the description in this ticket is, I believe, accurate it's probably by design and not necessarily a problem.

            People

              ashehata Amir Shehata (Inactive)
              hornc Chris Horn
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: