Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11297 Align LNet routing with Multi-Rail and LNet health
  3. LU-11478

LNet: discovery sequence numbers could be misleading

    XMLWordPrintable

    Details

    • Rank (Obsolete):
      9223372036854775807

      Description

      There is a sequence number used when sending discovery messages. This sequence number is intended to detect stale messages. However it could be misleading if the peer reboots. In this case the peer's sequence number will reset. The node will think that all information being sent to it is stale, while in reality the peer might've changed configuration.

      There is no reliable why to know whether a peer rebooted, so we'll always assume that the messages we're receiving are valid. So we'll operate on the first come first serve basis.

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              ashehata Amir Shehata
              Reporter:
              ashehata Amir Shehata
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved: