Details

    • Type: Bug
    • Status: Open
    • Priority: Critical
    • Resolution: Unresolved
    • Affects Version/s: Lustre 2.12.2
    • Fix Version/s: None
    • Labels:
      None
    • Severity:
      2
    • Rank (Obsolete):
      9223372036854775807

      Description

      We have been setting ko2iblnd timeout = 150 (default of 50) for our cluster. From reading the code this is no longer being used and instead lnet_lnd_timeout is used.

      For example in kiblnd_queue_tx_locked

          timeout_ns = lnet_get_lnd_timeout() * NSEC_PER_SEC;
          tx->tx_queued = 1;
          tx->tx_deadline = ktime_add_ns(ktime_get(), timeout_ns);
      

      and
      lnet_get_lnd_timeout() returns the new default of 5. Does this mean we went from 150 to 5!

      In the documentation it says that lnet_lnd_timeout derived from lnet_transaction_timeout and retry_count. But that is not getting set for tx->tx_deadline.

      Am I reading the code correctly?

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ashehata Amir Shehata
                Reporter:
                mhanafi Mahmoud Hanafi
              • Votes:
                0 Vote for this issue
                Watchers:
                8 Start watching this issue

                Dates

                • Created:
                  Updated: