[LU-2713] limit HSM RPC count from client - Whamcloud Community JIRA

Details

Type: Technical task
Resolution: Fixed
Priority: Blocker
Fix Version/s: Lustre 2.4.0
Affects Version/s: Lustre 2.4.0
Labels:
- MB

Rank (Obsolete):
6608

Description

The client-side HSM coordinator patches in http://review.whamcloud.com/5029 and http://review.whamcloud.com/5030 were landed, but Oleg realized that there are no client-side limits on the number of concurrent RPCs that can be sent.

This could potentially overwhelm the MDS service threads and block all other requests if they become blocked handling HSM requests, or if they are not being processed very quickly.

Please institue a client-side RPC limit, like cl_max_rpcs_in_flight, but for HSM requests, that introduces some reasonable limit.

The ticket is assigned to Jinshan, but only because we cannot currently assign it to someone external.

Attachments

Issue Links

is related to

LU-2949 ensure MDC RPCs are controlled by max_rpcs_in_flight param

Closed

Activity

[LU-2713] limit HSM RPC count from client

Peter Jones added a comment - 13/Mar/13 8:57 AM

Landed for 2.4

Peter Jones added a comment - 13/Mar/13 8:57 AM Landed for 2.4

John Hammond added a comment - 06/Mar/13 1:26 PM

OK, thanks for the clarification.

Please see http://review.whamcloud.com/5616.

John Hammond added a comment - 06/Mar/13 1:26 PM OK, thanks for the clarification. Please see http://review.whamcloud.com/5616 .

Andreas Dilger added a comment - 06/Mar/13 10:21 AM

John, the current RPC throttling mechanism for OSC and MDC RPCs is on the client. While this is not ideal, the problem is indeed that if the server has seen the request that it is too late to throttle it.

At this stage, we're just looking for an equivalent to max_rpcs_in_flight for the HSM requests, so they do not overwhelm the server.

Andreas Dilger added a comment - 06/Mar/13 10:21 AM John, the current RPC throttling mechanism for OSC and MDC RPCs is on the client. While this is not ideal, the problem is indeed that if the server has seen the request that it is too late to throttle it. At this stage, we're just looking for an equivalent to max_rpcs_in_flight for the HSM requests, so they do not overwhelm the server.

jacques-charles lafoucriere added a comment - 05/Mar/13 8:30 PM

HSM request are not blocking, they just record something to do on the MDT and the restore/archive is done asynchronously by coordinator. We the use of EAGAIN the only risk is to have slow clients which are never served because fast one are always taking the slots. We need a way to be sure all the clients are doing progress in their call list

jacques-charles lafoucriere added a comment - 05/Mar/13 8:30 PM HSM request are not blocking, they just record something to do on the MDT and the restore/archive is done asynchronously by coordinator. We the use of EAGAIN the only risk is to have slow clients which are never served because fast one are always taking the slots. We need a way to be sure all the clients are doing progress in their call list

John Hammond added a comment - 05/Mar/13 4:51 PM

I was proposing that the MDT keep a semaphore (as with cl_max_rpcs_in_flight) but that it do a non blocking down. If the semaphore would block then it returns -EAGAIN to the client. Then the client must wait and retry.

I understood that processing some HSM requests would put the MDT thread to sleep until the coordinator responded. Is that correct? I have only seen the stubbed out version of mdt_hsm.c. Will any of these handlers every have to wait for tape?

In either case (waiting on the coordinator or waiting on tape) I think it must be handled as an unbounded wait by Lustre.

I confirm that I will work on a patch.

John Hammond added a comment - 05/Mar/13 4:51 PM I was proposing that the MDT keep a semaphore (as with cl_max_rpcs_in_flight) but that it do a non blocking down. If the semaphore would block then it returns -EAGAIN to the client. Then the client must wait and retry. I understood that processing some HSM requests would put the MDT thread to sleep until the coordinator responded. Is that correct? I have only seen the stubbed out version of mdt_hsm.c. Will any of these handlers every have to wait for tape? In either case (waiting on the coordinator or waiting on tape) I think it must be handled as an unbounded wait by Lustre. I confirm that I will work on a patch.

jacques-charles lafoucriere added a comment - 05/Mar/13 3:23 PM

As I understand the limit comes from the MDT capacity to receive RPC request, so an MDT side is better but if the MDT had to count the requests it will already have received them so too late. The client side is a simple way to limit the load.

Do you confirm you work on a patch (so I will not prepare one)

jacques-charles lafoucriere added a comment - 05/Mar/13 3:23 PM As I understand the limit comes from the MDT capacity to receive RPC request, so an MDT side is better but if the MDT had to count the requests it will already have received them so too late. The client side is a simple way to limit the load. Do you confirm you work on a patch (so I will not prepare one)

People

Assignee:: John Hammond

Reporter:: Andreas Dilger

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 30/Jan/13 6:00 PM

Updated:: 13/Mar/13 8:57 AM

Resolved:: 13/Mar/13 8:57 AM