[LU-14976] Changing tbf policy induces high CPU load - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: Lustre 2.16.0
Affects Version/s: None
Labels:
None
Environment:
Centos 7 VMs on Lustre 2.14

Severity:
3
Rank (Obsolete):
9223372036854775807

Description

Reproducer:

Activate "tbf gid" policy:
lctl set_param mds.MDS.mdt.nrs_policies="tbf gid"
Register a rule for a group (with a small rate value):
lctl set_param mds.MDS.mdt.nrs_tbf_rule="start eaujames gid={1000} rate=10"
Start doing md oprations with the limited gid on the mdt (multithreaded file creations/deletions)
When a message is queued inside the policy, changes the policy to tbf:
lctl set_param mds.MDS.mdt.nrs_policies="tbf"
Stop md operations. Lustre consumes 100% on CPU partition where the message is queued:
For our production filesystem, on MDT0001 all cpt were impacted (>100 rpc in queue, load ~300) and on MDT0000 one cpt was impacted (1 rpc in queue, load ~90).

mds.MDS.mdt.nrs_policies=
regular_requests:
  - name: fifo
    state: started
    fallback: yes
    queued: 0
    active: 0  
  
  - name: crrn
    state: stopped
    fallback: no
    queued: 0
    active: 0
  
  - name: tbf
    state: started
    fallback: no
    queued: 1
    active: 0
  
  - name: delay
    state: stopped
    fallback: no
    queued: 0
    active: 0

When we try to change the policy to fifo, the proccess is block to "stopping" state:

mds.MDS.mdt.nrs_policies=
regular_requests:
  - name: fifo
    state: started
    fallback: yes 
    queued: 0    
    active: 0   

  - name: crrn
    state: stopped
    fallback: no
    queued: 0    
    active: 0  

  - name: tbf 
    state: stopping
    fallback: no
    queued: 1    
    active: 0   
  
  - name: delay
    state: stopped
    fallback: no
    queued: 0
    active: 0

Analyse:

It seems that when we change tbf policy ("tbf gid" -> "tbf"), old rpc queued inside "tbf gid" became inaccessible to ptlrpc threads.

ptlrpc_wait_event wake up when an rpc is availabled to enqueue. But in that case ptlrpc thread is unable to enqueue the request, so it wake up all the time (causing the cpu load).

00000100:00000001:1.0:1630509978.890060:0:4749:0:(service.c:2029:ptlrpc_server_request_get()) Process leaving (rc=0 : 0 : 0)
00000100:00000001:0.0:1630509978.890060:0:5580:0:(service.c:2008:ptlrpc_server_request_get()) Process entered
00000100:00000001:2.0:1630509978.890061:0:5653:0:(service.c:2029:ptlrpc_server_request_get()) Process leaving (rc=0 : 0 : 0)
00000100:00000001:2.0:1630509978.890061:0:5653:0:(service.c:2248:ptlrpc_server_handle_request()) Process leaving (rc=0 : 0 : 0)
00000100:00000001:1.0:1630509978.890061:0:4749:0:(service.c:2248:ptlrpc_server_handle_request()) Process leaving (rc=0 : 0 : 0)
00000100:00000001:0.0:1630509978.890061:0:5580:0:(service.c:2029:ptlrpc_server_request_get()) Process leaving (rc=0 : 0 : 0)
00000100:00000001:0.0:1630509978.890061:0:5580:0:(service.c:2248:ptlrpc_server_handle_request()) Process leaving (rc=0 : 0 : 0)
00000100:00000001:1.0:1630509978.890062:0:4749:0:(service.c:2244:ptlrpc_server_handle_request()) Process entered
00000100:00000001:1.0:1630509978.890062:0:4749:0:(service.c:2008:ptlrpc_server_request_get()) Process entered
00000100:00000001:2.0:1630509978.890063:0:5653:0:(service.c:2244:ptlrpc_server_handle_request()) Process entered
00000100:00000001:2.0:1630509978.890063:0:5653:0:(service.c:2008:ptlrpc_server_request_get()) Process entered
00000100:00000001:1.0:1630509978.890063:0:4749:0:(service.c:2029:ptlrpc_server_request_get()) Process leaving (rc=0 : 0 : 0)
00000100:00000001:0.0:1630509978.890063:0:5580:0:(service.c:2244:ptlrpc_server_handle_request()) Process entered
00000100:00000001:2.0:1630509978.890064:0:5653:0:(service.c:2029:ptlrpc_server_request_get()) Process leaving (rc=0 : 0 : 0)

On my VM for one mdt thread ptlrpc_server_handle_request() is called with 300kHz frequency (doing nothing).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

change_tbf_policy_dk.log
4.33 MB
01/Sep/21 5:15 PM
tbf_cpu_load_after_dk.log
6.68 MB
01/Sep/21 5:15 PM
tbf_cpu_load_after.svg
41 kB
01/Sep/21 5:15 PM

Issue Links

is related to

LU-14364 Switching QoS from tbf uid to fifo caused soft lockup

Open

LU-16846 Fix concole messages in nrs.c

Resolved

is related to

LU-9885 Huge amount of costs in ptlrpc_wait_event() at file creation

Open

Activity

[LU-14976] Changing tbf policy induces high CPU load

Gerrit Updater added a comment - 24/May/23 10:59 AM

"Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51119
Subject: ~~LU-14976~~ nrs: change nrs policies at run time
Project: fs/lustre-release
Branch: b2_12
Current Patch Set: 1
Commit: bdb237e26e903d2eb9d7fb1697965c7234a431f5

Gerrit Updater added a comment - 24/May/23 10:59 AM "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51119 Subject: LU-14976 nrs: change nrs policies at run time Project: fs/lustre-release Branch: b2_12 Current Patch Set: 1 Commit: bdb237e26e903d2eb9d7fb1697965c7234a431f5

Gerrit Updater added a comment - 24/May/23 10:35 AM

"Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51118
Subject: ~~LU-14976~~ nrs: change nrs policies at run time
Project: fs/lustre-release
Branch: b2_15
Current Patch Set: 1
Commit: 8292d1a744b996a43acb8d1f34210d8f9b6c7581

Gerrit Updater added a comment - 24/May/23 10:35 AM "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51118 Subject: LU-14976 nrs: change nrs policies at run time Project: fs/lustre-release Branch: b2_15 Current Patch Set: 1 Commit: 8292d1a744b996a43acb8d1f34210d8f9b6c7581

Peter Jones added a comment - 22/Apr/23 6:31 PM

Landed for 2.16

Peter Jones added a comment - 22/Apr/23 6:31 PM Landed for 2.16

Gerrit Updater added a comment - 22/Apr/23 5:27 PM

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/48523/
Subject: ~~LU-14976~~ nrs: change nrs policies at run time
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: c098c09564a125dd44ffe0c135cd1cb6359229e7

Gerrit Updater added a comment - 22/Apr/23 5:27 PM "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/48523/ Subject: LU-14976 nrs: change nrs policies at run time Project: fs/lustre-release Branch: master Current Patch Set: Commit: c098c09564a125dd44ffe0c135cd1cb6359229e7

Gerrit Updater added a comment - 12/Sep/22 11:40 AM

"Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/48523
Subject: ~~LU-14976~~ nrs: change nrs policies at run time
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 37530fe5fc53a80519e4334a3a295e690f03afbc

Gerrit Updater added a comment - 12/Sep/22 11:40 AM "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/48523 Subject: LU-14976 nrs: change nrs policies at run time Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 37530fe5fc53a80519e4334a3a295e690f03afbc

Gerrit Updater added a comment - 27/Oct/21 12:35 AM

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/44817/
Subject: ~~LU-14976~~ ptlrpc: align function names with param names
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 7fe49f1e7cf0586da0f389188325014a8a13b849

Gerrit Updater added a comment - 27/Oct/21 12:35 AM "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/44817/ Subject: LU-14976 ptlrpc: align function names with param names Project: fs/lustre-release Branch: master Current Patch Set: Commit: 7fe49f1e7cf0586da0f389188325014a8a13b849

Etienne Aujames added a comment - 02/Sep/21 6:50 AM

This issue occurred on a filesystem in production.

Here the context:
A user was filling the changelog list 18k open/s (changelog usage jump from 30% to 70% in one night). So the admin wanted to limit this user to avoid MDT crash.
The activated NRS policy was "tbf gid", the admin changed the tbf policy to "tbf" to limit the user by uid.

Etienne Aujames added a comment - 02/Sep/21 6:50 AM This issue occurred on a filesystem in production. Here the context: A user was filling the changelog list 18k open/s (changelog usage jump from 30% to 70% in one night). So the admin wanted to limit this user to avoid MDT crash. The activated NRS policy was "tbf gid", the admin changed the tbf policy to "tbf" to limit the user by uid.

People

Assignee:: Etienne Aujames

Reporter:: Etienne Aujames

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 01/Sep/21 4:52 PM

Updated:: 27/Jun/24 3:27 PM

Resolved:: 22/Apr/23 6:31 PM