[LU-16564] Remove FL_CANCEL_ON_BLOCK - Whamcloud Community JIRA

Details

Type: Improvement
Resolution: Unresolved
Priority: Minor
Fix Version/s: None
Affects Version/s: None
Labels:
None

Rank (Obsolete):
9223372036854775807

Description

The LDLM_FL_CANCEL_ON_BLOCK flag was used by liblustre, but isn't used now. The comment on it sort of explains why - it's for clients that can't reply reliably to BL callbacks, which is sort of a disaster waiting to happen (or, not waiting, as the case may be).

This should be removed - it's continued presence is confusing (at least to me!).

Attachments

Issue Links

is related to

LU-17493 restore LDLM cancel on blocking callback

Open

Activity

[LU-16564] Remove FL_CANCEL_ON_BLOCK

Patrick Farrell added a comment - 25/Feb/25 9:58 PM

"Allowing misbehaving clients to degrade into a "synchronous write" mode without eviction, where the DLM locks can be cancelled arbitrarily by the server without waiting for reply, would allow such flakey clients to remain part of the filesystem (at a reduced performance level) without compromising the stability of well-behaved clients."

Makes sense - This idea has been sort of bouncing around in a few places. So this would be specifically for those degraded clients... But I think if a client is degraded we'd just have it NOT use DLM locks (on the client, anyway)? So the transition to degraded mode might for example cancel all existing locks? That's perhaps a bit forceful, but... Simple?

Patrick Farrell added a comment - 25/Feb/25 9:58 PM "Allowing misbehaving clients to degrade into a "synchronous write" mode without eviction, where the DLM locks can be cancelled arbitrarily by the server without waiting for reply, would allow such flakey clients to remain part of the filesystem (at a reduced performance level) without compromising the stability of well-behaved clients." Makes sense - This idea has been sort of bouncing around in a few places. So this would be specifically for those degraded clients... But I think if a client is degraded we'd just have it NOT use DLM locks (on the client, anyway)? So the transition to degraded mode might for example cancel all existing locks? That's perhaps a bit forceful, but... Simple?

Andreas Dilger added a comment - 25/Feb/25 4:53 PM - edited

There is discussion in LU-17493 about restoring the cancel-on-block functionality as a first-class citizen, in order to make large clusters more reliable in the face of unstable network connectivity to clients.

We've seen in a few cases where clients have poor network connections (nearly inevitable as cluster size grows while using commodity Ethernet network hardware), and clients will sometimes lose their connection for a second or two. If an AST is sent during this time, the client may not get it, but will eventually restore the network connection in time to get the server resend, and otherwise remain alive in the eyes of the server.

We've made the client and server connection "too robust" in some senses, to avoid client eviction that causes application failures and is otherwise disruptive, while allowing misbehaving clients to cause problems elsewhere in the cluster.

If the flakey client is holding an important DLM lock (eg. on the root directory) the AST loss/delay can block the whole cluster for tens of seconds, and the lack of client eviction means this can happen over and over again.

Allowing misbehaving clients to degrade into a "synchronous write" mode without eviction, where the DLM locks can be cancelled arbitrarily by the server without waiting for reply, would allow such flakey clients to remain part of the filesystem (at a reduced performance level) without compromising the stability of well-behaved clients.

Andreas Dilger added a comment - 25/Feb/25 4:53 PM - edited There is discussion in LU-17493 about restoring the cancel-on-block functionality as a first-class citizen, in order to make large clusters more reliable in the face of unstable network connectivity to clients. We've seen in a few cases where clients have poor network connections (nearly inevitable as cluster size grows while using commodity Ethernet network hardware), and clients will sometimes lose their connection for a second or two. If an AST is sent during this time, the client may not get it, but will eventually restore the network connection in time to get the server resend, and otherwise remain alive in the eyes of the server. We've made the client and server connection "too robust" in some senses, to avoid client eviction that causes application failures and is otherwise disruptive, while allowing misbehaving clients to cause problems elsewhere in the cluster. If the flakey client is holding an important DLM lock (eg. on the root directory) the AST loss/delay can block the whole cluster for tens of seconds, and the lack of client eviction means this can happen over and over again. Allowing misbehaving clients to degrade into a "synchronous write" mode without eviction, where the DLM locks can be cancelled arbitrarily by the server without waiting for reply, would allow such flakey clients to remain part of the filesystem (at a reduced performance level) without compromising the stability of well-behaved clients.

Gerrit Updater added a comment - 16/Feb/23 8:33 PM

"Patrick Farrell <pfarrell@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/50028
Subject: LU-16564 ldlm: Remove cancel on block
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 08c22f878fd1f571d8ddffc1e6c35d2ad202a966

Gerrit Updater added a comment - 16/Feb/23 8:33 PM "Patrick Farrell <pfarrell@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/50028 Subject: LU-16564 ldlm: Remove cancel on block Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 08c22f878fd1f571d8ddffc1e6c35d2ad202a966

Remove FL_CANCEL_ON_BLOCK

Details

Description

Attachments

Issue Links

Activity

People

Dates