<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:05:12 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-13905] A single stream performance regression in client 4.18.0-193.14.2.el8_2 kernel</title>
                <link>https://jira.whamcloud.com/browse/LU-13905</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;There is a single stream read performance regression with 4.18.0-193.14.2.el8_2 kernel. Here is test environment and a reproducer.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;1 x client (1 x Gold 5218 CPU @ 2.30GHz, 96GB RAM, 1 x IB-HDR100)
CentOS8.2  (Tested kernel version: 4.18.0-147.el8.x86_64 and 4.18.0-193.14.2.el8_2.x86_64)
OFED-5.0-2.1.8.0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;&#160;&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[root@ec01 ~]# lctl set_param osc.*.max_pages_per_rpc=16M osc.*.max_rpcs_in_flight=16 llite.*.max_read_ahead_mb=2048 llite.*.max_read_ahead_per_file_mb=N
[root@ec01 ~]# clush -w es400nvx1-vm[1-4],7990e3-vm[1-2],ec01 &quot;echo 3 &amp;gt; /proc/sys/vm/drop_caches&quot;
[root@ec01 ~]# /work/tools/bin/ior -r -t 1m -b 192g -e -o /es400nv/s/file -k  
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;At least, the behaviors with max_read_ahead_per_file_mb=64 (default) are different between two kernel versions 4.18.0-147.el8.x86_64 and 4.18.0-193.14.2.el8_2.x86_64.&lt;br/&gt;
 Here is what I&apos;ve tested on NVMe OST system.&lt;/p&gt;
&lt;div class=&apos;table-wrap&apos;&gt;
&lt;table class=&apos;confluenceTable&apos;&gt;&lt;tbody&gt;
&lt;tr&gt;
&lt;th class=&apos;confluenceTh&apos;&gt;&#160;&lt;/th&gt;
&lt;th class=&apos;confluenceTh&apos;&gt;4.18.0-147.el8.x86_64&lt;/th&gt;
&lt;th class=&apos;confluenceTh&apos;&gt;4.18.0-193.14.2.el8_2.x86_64&lt;/th&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;max_read_ahead_per_file_mb=64&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;4252(MiB/s)&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;2943(MiB/s)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;max_read_ahead_per_file_mb=128&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;4186(MiB/s)&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;4287(MiB/s)&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;/div&gt;


&lt;p&gt;It was 30% slower performance with max_read_ahead_per_file_mb=64, but when it increased to 128, both performance were close.&lt;/p&gt;

&lt;p&gt;There is another results which was tested on HDD based OSTs.&lt;/p&gt;
&lt;div class=&apos;table-wrap&apos;&gt;
&lt;table class=&apos;confluenceTable&apos;&gt;&lt;tbody&gt;
&lt;tr&gt;
&lt;th class=&apos;confluenceTh&apos;&gt;&#160;&lt;/th&gt;
&lt;th class=&apos;confluenceTh&apos;&gt;4.18.0-147.el8.x86_64&lt;/th&gt;
&lt;th class=&apos;confluenceTh&apos;&gt;4.18.0-193.14.2.el8_2.x86_64&lt;/th&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;max_read_ahead_per_file_mb=64&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;1578(MiB/s)&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;1326(MiB/s)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;max_read_ahead_per_file_mb=128&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;3396(MiB/s)&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;2827(MiB/s)&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;/div&gt;


&lt;p&gt;In this case, there was still ~16% performrance regressions in 4.18.0-193.14.2.el8_2.x86_64 regardless max_read_ahead_per_file_mb=64 or 128.&lt;/p&gt;</description>
                <environment>RHEL8.2 (kernel kernel 4.18.0-193.14.2.el8_2)&lt;br/&gt;
lustre-commit: c54b6ca  (master) </environment>
        <key id="60391">LU-13905</key>
            <summary>A single stream performance regression in client 4.18.0-193.14.2.el8_2 kernel</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="sihara">Shuichi Ihara</reporter>
                        <labels>
                    </labels>
                <created>Wed, 12 Aug 2020 09:35:27 +0000</created>
                <updated>Wed, 12 Aug 2020 19:21:30 +0000</updated>
                                            <version>Lustre 2.14.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="277291" author="wshilong" created="Wed, 12 Aug 2020 11:01:08 +0000"  >&lt;p&gt;The problem is somehow 4.18.0-147.el8.x86_64 schedule kworker more often than 4.18.0-147.el8.x86_64, we might need investigate what changes has been  applied for kernel work queue between this minor version updates.&lt;/p&gt;</comment>
                            <comment id="277313" author="wshilong" created="Wed, 12 Aug 2020 13:57:41 +0000"  >&lt;p&gt;not aware of specific workqueue changes, but could this be related to some cpupower frequency changes?  cpupower frequency-info to check if we could both reach performance mode?&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i017rz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10021"><![CDATA[2]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>