[LU-10275] ptlrpc reply acknowledgement - Whamcloud Community JIRA

Details

Type: New Feature
Resolution: Unresolved
Priority: Minor
Fix Version/s: None
Affects Version/s: None
Labels:
- lnet
- performance

Rank (Obsolete):
16881

Description

Because most ptlrpc messages do not have ACK , RPC client cannot distinguish message loss from long service time. Also, in current implementation, message-resend can only be triggered by RPC client after service timeout, no matter which message is lost in lifecycle of RPC.

To improve Lustre RAS against message loss, we should allow message resend for any step of RPC lifecycle. However, current RPC client already has request message timeout/resend protocol and adaptive timeout, it may need fundamental changes if we want to have ACK for request message and use network timeout instead of service time to trigger request message resend. This may require a lot more efforts and resources, so it is not covered by this document.

Reply-resend is relatively simple and more practicable, RPC server can repeatedly resend reply at fix time interval (e.g. 20 seconds), which should be sufficient even for latency in environment with router. Reply-resend can be stopped when there is an ACK for reply message, or client is evicted/disconnected.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

pltrpc_reply_resend_2.docx
129 kB
21/Dec/14 7:00 AM
ack_perf_5.xlsx
69 kB
26/Feb/15 3:47 AM

Issue Links

is related to

LU-10274 LNet event add-on function

Closed

Activity

[LU-10275] ptlrpc reply acknowledgement

Amir Shehata (Inactive) added a comment - 18/Dec/18 2:58 AM

This looks like it's covered with the LNet Health work. I'll take a look at the docs in more detail to see what he had intended.

Amir Shehata (Inactive) added a comment - 18/Dec/18 2:58 AM This looks like it's covered with the LNet Health work. I'll take a look at the docs in more detail to see what he had intended.

Andreas Dilger added a comment - 17/Dec/18 10:49 PM

Amir, how does this relate to our recent discussions about LNet Health and reply timeouts? Are these patches still useful?

Andreas Dilger added a comment - 17/Dec/18 10:49 PM Amir, how does this relate to our recent discussions about LNet Health and reply timeouts? Are these patches still useful?

Liang Zhen (Inactive) added a comment - 19/Mar/15 5:57 AM

I will work on CORAL soon, so have to reassign this ticket to bobijam. I will maintain patches for a while, but can't finish the landing process.

Liang Zhen (Inactive) added a comment - 19/Mar/15 5:57 AM I will work on CORAL soon, so have to reassign this ticket to bobijam. I will maintain patches for a while, but can't finish the landing process.

Liang Zhen (Inactive) added a comment - 14/Mar/15 3:31 AM

Andreas, yes I think I can do this, I will update the patch to make it optional.

Liang Zhen (Inactive) added a comment - 14/Mar/15 3:31 AM Andreas, yes I think I can do this, I will update the patch to make it optional.

Andreas Dilger added a comment - 12/Mar/15 6:07 AM

Liang, thanks for the data. It looks like the overhead is noticeable, but not so bad that the change is unusable.

Is it possible to make this feature optional, so that we can turn it on or off to debug?

Andreas Dilger added a comment - 12/Mar/15 6:07 AM Liang, thanks for the data. It looks like the overhead is noticeable, but not so bad that the change is unusable. Is it possible to make this feature optional, so that we can turn it on or off to debug?

Liang Zhen (Inactive) added a comment - 11/Mar/15 2:53 PM

Andreas, do you have any concern/comment on these data?

Liang Zhen (Inactive) added a comment - 11/Mar/15 2:53 PM Andreas, do you have any concern/comment on these data?

Liang Zhen (Inactive) added a comment - 26/Feb/15 3:47 AM

performance data for ACKed lnet messages.

Liang Zhen (Inactive) added a comment - 26/Feb/15 3:47 AM performance data for ACKed lnet messages.

Liang Zhen (Inactive) added a comment - 26/Feb/15 3:45 AM - edited

Andreas, because this patch can only improve reliability on one direction so far, so I did some tests with enabling message drop for reply portals only, without this patch, I got client eviction time to time while running random workload. After I applied this patch (because it is a small cluster, so I set reply-resend interval to a small value like 2-4 seconds instead of default value, so it can resend within AT), there was almost no client eviction.Of course we can't expect improvement like this in real world because message drop can be on both directions, but I think it is a good step to start.
This feature can be applied on a per-client basis, or let's say we can upgrade any node without breaking interoperability, this feature is enabled only when both end of ptlrpc connection has this patch. Also, it can be enabled/disabled at runtime.

I will attach some data, these data are not from this patch because I collected them before I worked this patch, but from a simple patch to enable ACK for all ptlrpc messages, so I think they are essentially same. From these data, we lost about 10% performance for lightweight metadata operations (0-stripe) when we have ACK for all messages (both request & reply), so I assume we may lose 5% performance with reply-ack only.

Liang Zhen (Inactive) added a comment - 26/Feb/15 3:45 AM - edited Andreas, because this patch can only improve reliability on one direction so far, so I did some tests with enabling message drop for reply portals only, without this patch, I got client eviction time to time while running random workload. After I applied this patch (because it is a small cluster, so I set reply-resend interval to a small value like 2-4 seconds instead of default value, so it can resend within AT), there was almost no client eviction.Of course we can't expect improvement like this in real world because message drop can be on both directions, but I think it is a good step to start. This feature can be applied on a per-client basis, or let's say we can upgrade any node without breaking interoperability, this feature is enabled only when both end of ptlrpc connection has this patch. Also, it can be enabled/disabled at runtime. I will attach some data, these data are not from this patch because I collected them before I worked this patch, but from a simple patch to enable ACK for all ptlrpc messages, so I think they are essentially same. From these data, we lost about 10% performance for lightweight metadata operations (0-stripe) when we have ACK for all messages (both request & reply), so I assume we may lose 5% performance with reply-ack only.

Andreas Dilger added a comment - 25/Feb/15 8:21 PM

Liang, have you done any testing on this to determine how much it improves reliability, recoverability, etc? Any idea on what kind of performance impact it has on normal operation? What kind of interoperability is needed for this (i.e. can it be done on a per-client basis, or do all clients need it, or what)?

Andreas Dilger added a comment - 25/Feb/15 8:21 PM Liang, have you done any testing on this to determine how much it improves reliability, recoverability, etc? Any idea on what kind of performance impact it has on normal operation? What kind of interoperability is needed for this (i.e. can it be done on a per-client basis, or do all clients need it, or what)?

Liang Zhen (Inactive) added a comment - 04/Jan/15 1:57 PM - edited

patch list:
http://review.whamcloud.com/#/c/13203
http://review.whamcloud.com/#/c/13204
http://review.whamcloud.com/#/c/13219
http://review.whamcloud.com/#/c/13220
http://review.whamcloud.com/#/c/13227
http://review.whamcloud.com/#/c/13228
http://review.whamcloud.com/#/c/13489

Liang Zhen (Inactive) added a comment - 04/Jan/15 1:57 PM - edited patch list: http://review.whamcloud.com/#/c/13203 http://review.whamcloud.com/#/c/13204 http://review.whamcloud.com/#/c/13219 http://review.whamcloud.com/#/c/13220 http://review.whamcloud.com/#/c/13227 http://review.whamcloud.com/#/c/13228 http://review.whamcloud.com/#/c/13489

People

Assignee:: Amir Shehata (Inactive)

Reporter:: Liang Zhen (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 21/Dec/14 7:00 AM

Updated:: 18/Dec/18 2:58 AM