[LU-10216] One question about enabling NRS_TBF in Lustre Created: 09/Nov/17  Updated: 20/Jul/18  Resolved: 20/Jul/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Question/Request Priority: Critical
Reporter: sebg-crd-pm (Inactive) Assignee: Emoly Liu
Resolution: Fixed Votes: 0
Labels: None
Environment:

Lustre 2.10.0/ Centos 7.3


Attachments: PNG File Selection_019.png     PNG File Selection_020.png    
Issue Links:
Related
is related to LUDOC-328 documentation updates for complex TBF... Open
Rank (Obsolete): 9223372036854775807

 Description   

Hi All,

There are 4 X OSS servers and 2 X MGS/MDS in our Lustre filesystem. 

We are trying to enable NRS_TBF by using following instructions in our MGS server.

After this, we want to verify if these rules are added successfully by using following instructions in one OSS server. The result is shown below.

 

Is it correct?  If no, please kindly provide us with instructions to do this.

Also, how can we verify if these rules are "Change" and "Stop(Delete)" successfully when we use "lctl set_param ost.OSS.ost_io.nrs_tbf_rule="change ossb1 500" and "lctl set_param ost.OSS.ost_io.nrs_tbf_rule="stop ossb1", respectively.  

 

Thanks!



 Comments   
Comment by Peter Jones [ 09/Nov/17 ]

Emoly

Can you please advise?

Thanks

Peter

Comment by sebg-crd-pm (Inactive) [ 10/Nov/17 ]

Hi Emoly, 

Please give us some suggestions for this issue.

We really need your help.

Thanks!

 

Comment by Emoly Liu [ 13/Nov/17 ]

Hi sebg-crd-pm,

The TBF policy format has been changed since lustre 2.8.55, as follows

  • For NRS "tbf nid" policy:
    lctl set_param ost.OSS.ost_io.nrs_tbf_rule="start $RULE_NAME nid={$NID} rate=$RATE"
  • For NRS "tbf jobid" policy:
    lctl set_param ost.OSS.ost_io.nrs_tbf_rule="start $RULE_NAME jobid={$JOBID} rate=$RATE"
  • To change the policy:
    lctl set_param ost.OSS.ost_io.nrs_tbf_rule="change $RULE_NAME rate=$RATE"

So you should change your commands format to make them work. Here is my output for your reference:

[root@centos7-3 tests]# lctl set_param ost.OSS.ost_io.nrs_policies="tbf nid"
ost.OSS.ost_io.nrs_policies=tbf nid

[root@centos7-3 tests]# lctl set_param ost.OSS.ost_io.nrs_tbf_rule="start ossb1 nid={172.20.110.209@o2ib} rate=100"
ost.OSS.ost_io.nrs_tbf_rule=start ossb1 nid={172.20.110.209@o2ib} rate=100
[root@centos7-3 tests]# lctl set_param ost.OSS.ost_io.nrs_tbf_rule="start robin nid={172.20.110.212@o2ib} rate=200"
ost.OSS.ost_io.nrs_tbf_rule=start robin nid={172.20.110.212@o2ib} rate=200

[root@centos7-3 tests]# lctl get_param ost.OSS.ost_io.nrs_tbf_rule
ost.OSS.ost_io.nrs_tbf_rule=
regular_requests:
CPT 0:
ossb1 {172.20.110.209@o2ib} 100, ref 0
robin {172.20.110.212@o2ib} 200, ref 0
default {*} 10000, ref 0
high_priority_requests:
CPT 0:
ossb1 {172.20.110.209@o2ib} 100, ref 0
robin {172.20.110.212@o2ib} 200, ref 0
default {*} 10000, ref 0

[root@centos7-3 tests]# lctl set_param ost.OSS.ost_io.nrs_tbf_rule="change ossb1 rate=500"
ost.OSS.ost_io.nrs_tbf_rule=change ossb1 rate=500
[root@centos7-3 tests]# lctl get_param ost.OSS.ost_io.nrs_tbf_rule
ost.OSS.ost_io.nrs_tbf_rule=
regular_requests:
CPT 0:
ossb1 {172.20.110.209@o2ib} 500, ref 0
robin {172.20.110.212@o2ib} 200, ref 0
default {*} 10000, ref 0
high_priority_requests:
CPT 0:
ossb1 {172.20.110.209@o2ib} 500, ref 0
robin {172.20.110.212@o2ib} 200, ref 0
default {*} 10000, ref 0

[root@centos7-3 tests]# lctl set_param ost.OSS.ost_io.nrs_tbf_rule="stop ossb1"
ost.OSS.ost_io.nrs_tbf_rule=stop ossb1
[root@centos7-3 tests]# lctl get_param ost.OSS.ost_io.nrs_tbf_rule
ost.OSS.ost_io.nrs_tbf_rule=
regular_requests:
CPT 0:
robin {172.20.110.212@o2ib} 200, ref 0
default {*} 10000, ref 0
high_priority_requests:
CPT 0:
robin {172.20.110.212@o2ib} 200, ref 0
default {*} 10000, ref 0

HTH.
Emoly

Comment by Andreas Dilger [ 06/Dec/17 ]

Please note that "lctl set_param" commands are local to the server they are run on, and only in effect while the server is mounted, so these commands need to be run on all of the OSS nodes.

It looks like there is an open ticket for the NRS TBF documentation to be updated for the new functionality added by DDN - LUDOC-328.

Comment by Emoly Liu [ 29/Mar/18 ]

sebg-crd-pm,
Can we close this issue ?
Thanks,
Emoly

Comment by sebg-crd-pm (Inactive) [ 20/Jul/18 ]

You can close it. Thanks.

Comment by Emoly Liu [ 20/Jul/18 ]

Thanks.

Generated at Sat Feb 10 02:33:05 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.