Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13020

ko2iblnd tuning

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Cannot Reproduce
    • Icon: Critical Critical
    • None
    • Lustre 2.12.2
    • None
    • 2
    • 9223372036854775807

      We have been setting ko2iblnd timeout = 150 (default of 50) for our cluster. From reading the code this is no longer being used and instead lnet_lnd_timeout is used.

      For example in kiblnd_queue_tx_locked

          timeout_ns = lnet_get_lnd_timeout() * NSEC_PER_SEC;
          tx->tx_queued = 1;
          tx->tx_deadline = ktime_add_ns(ktime_get(), timeout_ns);
      

      and
      lnet_get_lnd_timeout() returns the new default of 5. Does this mean we went from 150 to 5!

      In the documentation it says that lnet_lnd_timeout derived from lnet_transaction_timeout and retry_count. But that is not getting set for tx->tx_deadline.

      Am I reading the code correctly?

            ashehata Amir Shehata (Inactive)
            mhanafi Mahmoud Hanafi
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

              Created:
              Updated:
              Resolved: