Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-16999

Revert caf6095ade LU-15595 lnet: LNet peer aliveness broken

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.16.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This patch restored the historic behavior of the LNet router peer health feature, but it did not account for the fact that the old lnet router checker behaved differently than the current implementation that leverages LNet discovery to perform the router checker pings. Because of this change to use discovery we can no longer guarantee that each router end point will be ping'd within peer aliveness window, and as a result the router may incorrectly determine that some peer NIs are not alive.

      Just revert this for now

      Attachments

        Activity

          People

            hornc Chris Horn
            hornc Chris Horn
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: