Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Critical
Fix Version/s: None
Affects Version/s: Lustre 2.12.0
Labels:
None
Environment:
CentOS 7.6

Severity:
3
Rank (Obsolete):
9223372036854775807

I'm investigating a metadata slowdown we had tonight on Fir, in terms of metadata. A simple find was super slow. However, when I started to gather stats, the performance came back, so now it seems ok. However, I can still see a lot of ldlm_cancel RPCs so I wanted to report it. I have a script (that I can share if needed)) that takes a 5 secs sample of Lustre RPCs on the MDS and I can see there is a high rate of ldlm_cancel locks. I also see a lot of Prolong DOM lock in the full logs also.
I'm attaching the output of my script as fir-md1-s2-lrpc-sample.log, which shows the NIDs from a 5s rpctrace/rpc debug along with each RPC type found and RPC count), for example:

Total_RPC_count_for_NID NID LND# RPC_type:count,RPC_type:count,...
3718 sh-107-42-ib0.ib o2ib4 mds_close:1285,ldlm_enqueue:1213,ldlm_cancel:1220

Also attaching a 5 sec full rpctrace/rpc of fir-md1-s2 (MDT0001 and MDT0003) as fir-md1-s2-rpctrace-rpc-20190430-5s.log.gz . This is the most loaded server that's why.

I wonder, could the patch for ~~LU-10777~~ (DoM performance is bad with FIO write), that just landed into master, help us in this case (or the question is... does resends trigger such ldlm_cancel rpcs?).

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

fir-md1-s1-htop-20190506.png
558 kB
06/May/19 5:48 PM
fir-md1-s2-htop-20190506.png
526 kB
06/May/19 5:50 PM
fir-md1-s2-lrpc-sample.log
18 kB
30/Apr/19 7:27 AM
fir-md1-s2-lrpc-sample-best.log
36 kB
30/Apr/19 7:49 AM
fir-md1-s2-lrpc-sample-with-LU-10777.log
25 kB
30/Apr/19 3:38 PM
fir-md1-s2-rpctrace-rpc-20190430-5s.log.gz
21.19 MB
30/Apr/19 7:28 AM
fir-MDS-reqstats.png
86 kB
06/May/19 5:40 PM

Assignee:: Oleg Drokin

Reporter:: Stephane Thiell

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Created:: 30/Apr/19 7:28 AM

Updated:: 24/Jul/20 8:56 AM

Details

Description

Attachments

Attachments

Activity

People

Dates