[LU-5319] Support multiple slots per client in last_rcvd file - Whamcloud Community JIRA

Details

Type: New Feature
Resolution: Fixed
Priority: Minor
Fix Version/s: Lustre 2.8.0
Affects Version/s: Lustre 2.8.0
Labels:
- p4b
- patch
- performance
- recovery

Rank (Obsolete):
14856

Description

While running mdtest benchmark, I have observed that file creation and unlink operations from a single Lustre client quickly saturates to around 8000 iops: maximum is reached as soon as with 4 tasks in parallel.
When using several Lustre mount points on a single client node, the file creation and unlink rate do scale with the number of tasks, up to the 16 cores of my client node.

Looking at the code, it appears that most metadata operations are serialized by a mutex in the MDC layer.
In mdc_reint() routine, request posting is protected by mdc_get_rpc_lock() and mdc_put_rpc_lock(), where the lock is :
struct client_obd -> struct mdc_rpc_lock *cl_rpc_lock -> struct mutex rpcl_mutex.

After an email discussion with Andreas Dilger, it appears that the limitation is actually on the MDS, since it cannot handle more than a single filesystem-modifying RPC at one time. There is only one slot in the MDT last_rcvd file for each client to save the state for the reply in case it is lost.

The aim of this ticket is to implement multiple slots per client in the last_rcvd file so that several filesystem-modifying RPCs can be handled in parallel.

The single client metadata performance should be significantly improved while still ensuring a safe recovery mecanism.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

mdtest lustre 2.5.60 file creation b.png
10 kB
31/Jul/14 6:57 AM
mdtest lustre 2.5.60 file removal b.png
10 kB
31/Jul/14 6:57 AM
MDTReplyReconstructionImprovement.architecture.pdf
456 kB
26/Sep/14 7:42 AM
MDTReplyReconstructionImprovement.design.pdf
853 kB
09/Apr/15 11:51 AM
MDTReplyReconstructionImprovement.testplan.pdf
615 kB
17/Jul/15 12:05 PM

Issue Links

is related to

LU-6840 update memory reply data in DNE update replay

Resolved

LU-5951 sanity test_39k: mtime is lost on close

Resolved

LU-6981 obd_last_committed is not updated in tgt_reply_data_init()

Resolved

LU-7729 Don't return ptlrpc_error() in process_req_last_xid().

Resolved

LU-7028 racer:kernel:BUG: spinlock bad magic on CPU#0

Resolved

LU-7082 conf-sanity test_90b: MDT start failed

Resolved

LU-7408 multislot RPC support didn't declare write for reply_data object

Resolved

LU-6841 replay-single test_30: multiop 20786 failed

Closed

LU-3285 Data on MDT

Resolved

LU-14144 get and set Lustre module parameters via "lctl get_param/set_param"

Open

LU-933 allow disabling the mdc_rpc_lock for performance testing

Resolved

LU-6753 Fix several minor improvements to multislots feature

Resolved

is related to

LU-6386 lower transno may overwrite the bigger one in client last_rcvd slot

Resolved

LU-7185 restore flags on ptlrpc_connect_import failure to prevent LBUG

Resolved

LU-7410 After downgrade from 2.8 to 2.5.5, hit unsupported incompat filesystem feature(s) 400

Resolved

LU-6864 DNE3: Support multiple modify RPCs in flight for MDT-MDT connection

Resolved

LUDOC-304 Updates related to support of multiple modify RPCs in parallel

Resolved

mentioned in: Page Loading...; Page Loading...

(7 is related to, 5 is related to , 2 mentioned in)

Activity

People

Assignee:: Alex Zhuravlev

Reporter:: Gregoire Pichon

Votes:: 0 Vote for this issue

Watchers:: 34 Start watching this issue

Dates

Created:: 10/Jul/14 3:00 PM

Updated:: 19/Oct/22 12:08 AM

Resolved:: 27/Aug/15 1:58 PM