[LU-10875] Optimize o2iblnd wrq size Created: 03/Apr/18  Updated: 06/Apr/18

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Amir Shehata (Inactive) Assignee: Sonia Sharma (Inactive)
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Description written by Mike Marciniszyn

The issue grows out of the issues:

  • Lustre creates a send queue size of 16193 with our default module parameters

 

The key here is that the LNET_MAX_IOV sizes the QP based on the anticipated number of WQEs to consume a credit.

 

Modern Lustre memory region use (FMR,FRMR*) uses much fewer WQEs to consume a credit.   Worse case 3 (vs. 256).

 

It is possible to detour the memory foot print by setting hfi1 max_qp_wrs to 4095.  This will reduce the credits available to Lustre to 8, and will probably impact parallelism on the receive side.


Generated at Sat Feb 10 02:38:57 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.