Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10875

Optimize o2iblnd wrq size

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Major
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      Description written by Mike Marciniszyn

      The issue grows out of the issues:

      • Lustre creates a send queue size of 16193 with our default module parameters

       

      The key here is that the LNET_MAX_IOV sizes the QP based on the anticipated number of WQEs to consume a credit.

       

      Modern Lustre memory region use (FMR,FRMR*) uses much fewer WQEs to consume a credit.   Worse case 3 (vs. 256).

       

      It is possible to detour the memory foot print by setting hfi1 max_qp_wrs to 4095.  This will reduce the credits available to Lustre to 8, and will probably impact parallelism on the receive side.

      Attachments

        Issue Links

          Activity

            People

              sharmaso Sonia Sharma (Inactive)
              ashehata Amir Shehata (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated: