Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8429

Add option for gnilnd to not reconnect after connection timeout

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.9.0
    • None
    • 3
    • 9223372036854775807

    Description

      When routers time out a client connection during a catastrophic
      network disturbance like a cabinet EPO, there still may be
      traffic from the file system that is using the router for the
      return path to the client. This will cause a new connection to try
      to be formed before the network has quiesced causing multiple failed
      connection attempts which need to be put in purgatory since they could
      possibly connect in the future. This can cause the gart space to be
      consumed with registrations.

      So we'll add an option to not reconnect after connection timeout

      Attachments

        Activity

          People

            hornc Chris Horn
            hornc Chris Horn
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: