Improve LNet Statistics (LU-14040)

[LU-11817] LNet: timing statistics Created: 19/Dec/18  Updated: 09/Dec/20

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Technical task Priority: Minor
Reporter: Amir Shehata (Inactive) Assignee: Cyril Bordage
Resolution: Unresolved Votes: 0
Labels: lnet

Issue Links:
Related
is related to LU-11816 LNet Health: Correct timeout defaults Resolved
Rank (Obsolete): 9223372036854775807

 Description   

Add statistics to measure the average/maximum time spent in the LNet/LNDs for the following cases:
1. Average/maximum time for receiving an LNet ACK/REPLY
2. Average/maximum time messages are queued on the LND outgoing queues
3. Average/maximum time LND transmits remain active (IE awaiting completion, either via response or a completion event).

Add ability to extract these values from lnetctl.



 Comments   
Comment by Gerrit Updater [ 19/Jan/19 ]

Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/34066
Subject: LU-11817 lnet: add timing statistics
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 328d1d5a026fe896858db773309a511e658ebcae

Comment by Gerrit Updater [ 19/Jan/19 ]

Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/34067
Subject: LU-11817 lnet: display time stats from lnetctl
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: e1acd66542d2ffdcccb9192cb0afe9e53879de94

Generated at Sat Feb 10 02:47:11 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.