Details
-
Bug
-
Resolution: Not a Bug
-
Major
-
None
-
Lustre 2.6.0, Lustre 2.7.0
-
3
-
9223372036854775807
Description
ldlm_cli_cancel_req immediately returns if it gets -EWOULDBLOCK(-EAGAIN) from a LDLM_CANCEL request it tried to send, and the client is being evicted before long.
Some network errors can be recovered in many cases then we can avoid eviction. (eviction is really annoying in suppress_ping environment. so I want to avoid it as far as I can)
that's why I believe ldlm_cli_cancel_req should retry LDLM_CANCEL here.
By the way, the patch I'll upload has worked well in Luster-1.8.8 base FEFS for a very long time and the same in Lustre-2.6 based FEFS. I haven't seen any serious issues so far. That's why I'm convinced it will work in Lustre-2.x too.