Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-14386

LNet: select reachable remote peer nid

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.12.6
    • 3
    • 9223372036854775807

    Description

       The following results in a problem:

      NodeA --tcp0-- GW --tcp1-- NodeB 
      ------ NodeA ------
      lnetctl net show
          - net type: tcp9
            local NI(s):
              - nid: 192.168.122.10@tcp9
          - net type: tcp
            local NI(s):
              - nid: 192.168.122.142@tcp
      ------ NodeB ------
      lnetctl net show
      net:
          - net type: tcp1
            local NI(s):
              - nid: 192.168.122.40@tcp1
      ------ NodeB ------
      lnetctl peer show
      peer:
          - primary nid: 192.168.122.10@tcp9
            Multi-Rail: True
            peer ni:
              - nid: 192.168.122.142@tcp
                state: NA
              - nid: 192.168.122.10@tcp9
                state: NA

      Note that NodeB lists NodeA under the unreachable tcp9 primary nid. Even though NodeB is aware of the reachable nid for NodeA, it gets confused if using the primary nid:

      ------ NodeB ------
      lnetctl ping 192.168.122.10@tcp9
      manage:
          - ping:
                errno: -1
                descr: failed to ping 192.168.122.10@tcp9: Input/output error

      Attachments

        Issue Links

          Activity

            People

              ssmirnov Serguei Smirnov
              ssmirnov Serguei Smirnov
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: