[LU-10633] Convert MDS restoring RPC message to D_WARNING Created: 07/Feb/18  Updated: 31/Jan/22  Resolved: 31/Jan/22

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: Lustre 2.15.0

Type: Improvement Priority: Minor
Reporter: Chris Horn Assignee: Chris Horn
Resolution: Fixed Votes: 0
Labels: None

Rank (Obsolete): 9223372036854775807

 Description   

Cray debugged an issue where we identified a huge number of what appear to be duplicated requests to the MDS from various clients. This is highly abnormal, and while Lustre is able to handle a certain amount of this, it seems that this code turns up a bug that occurs when many requests are being replayed/restored.

It looked very much like a network issue, and some fabric maintenance saw the problem go away.

While we were not able to root cause the issue (though we suspect LU-2827 may have been at play), Patrick Farrell observed that we would've figured out the problem much sooner if a debug message was printed to the console and dk log under the default debug settings.

Restore/replay of a request is A) relatively rare, and B) even when handled without incident, indicates a strong possibility of something wrong, and merits a warning. Having this debug in place would have led to this being solved much more quickly.

I've opened this ticket to track the change to the debug message.



 Comments   
Comment by Gerrit Updater [ 07/Feb/18 ]

Chris Horn (hornc@cray.com) uploaded a new patch: https://review.whamcloud.com/31214
Subject: LU-10633 mdt: Convert MDS restoring RPC message to D_WARNING
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: e55ea65c8d872ffe365d8ff6915aaac4fca98f83

Comment by Gerrit Updater [ 31/Jan/22 ]

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/31214/
Subject: LU-10633 mdt: Convert MDS restoring RPC message to D_WARNING
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 3f8fb726881ff7447817d5dc1ea840b0d9029ddf

Comment by Peter Jones [ 31/Jan/22 ]

Landed for 2.15

Generated at Sat Feb 10 02:36:50 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.