<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:17:10 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-8395] Limit lnet-selftest Threads</title>
                <link>https://jira.whamcloud.com/browse/LU-8395</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;On dense cored systems with hyperthreading active, you end up with a lot of execution threads.  An example would be a system with 256 total threads.&lt;/p&gt;

&lt;p&gt;The libcfs partition code divides those up into 16 partitions of 16 threads each (based on a math algorithm...not logical grouping based on hyperthreading).  &lt;/p&gt;

&lt;p&gt;Lnet-selftest then creates a group of scheduler threads in each partition equal to the number of execution threads minus 1 (so 15 per partition for our example).  That means it creates 240 scheduler threads.  This actually reduces performance rather than improves it.&lt;/p&gt;

&lt;p&gt;We need to limit the number of lnet-selftest scheduler threads to some reasonable number per partition.&lt;/p&gt;

&lt;p&gt;Note: another bug will be created to look at how we group the execution threads into partitions.  It needs to make more sense than it does.&lt;/p&gt;</description>
                <environment></environment>
        <key id="38161">LU-8395</key>
            <summary>Limit lnet-selftest Threads</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="4">Incomplete</resolution>
                                        <assignee username="ashehata">Amir Shehata</assignee>
                                    <reporter username="doug">Doug Oucharek</reporter>
                        <labels>
                    </labels>
                <created>Wed, 13 Jul 2016 22:26:01 +0000</created>
                <updated>Wed, 11 Dec 2019 19:52:57 +0000</updated>
                            <resolved>Wed, 11 Dec 2019 19:52:57 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="158754" author="gerrit" created="Wed, 13 Jul 2016 22:51:24 +0000"  >&lt;p&gt;Doug Oucharek (doug.s.oucharek@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/21299&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/21299&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8395&quot; title=&quot;Limit lnet-selftest Threads&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8395&quot;&gt;&lt;del&gt;LU-8395&lt;/del&gt;&lt;/a&gt; lnet: Limit total number of lnet-selftest threads&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 9a3bc67c22e9c52b64a1a2eb3fad220ace85d434&lt;/p&gt;</comment>
                            <comment id="158970" author="dmiter" created="Fri, 15 Jul 2016 17:07:00 +0000"  >&lt;p&gt;Unfortunately this patch makes performance on KNL worse.&lt;/p&gt;</comment>
                            <comment id="158990" author="adilger" created="Fri, 15 Jul 2016 17:54:57 +0000"  >&lt;p&gt;Doug, Dmitry, could you please describe the parameters of your performance testing so that you can agree on what is being measured.  &lt;/p&gt;

&lt;p&gt;Doug, was the 64-thread limit based on anything concrete, or just a guess?  It might be useful to run some testing with different thread counts to see where the threshold is hit.  It might also be possible to determine at compile time (based on CPP constant for KNL) or at runtime (not sure how) to limit the threads differently for Xeon Phi vs. other systems with many cores.&lt;/p&gt;

&lt;p&gt;Dmitry, can you run your testing similarly, to see where the threshold is for reducing thread counts.  Do we need to have one LNet thread per core, or do we have some benefit to reserve a few cores for other tasks?&lt;/p&gt;</comment>
                            <comment id="158994" author="adilger" created="Fri, 15 Jul 2016 18:00:26 +0000"  >&lt;p&gt;Doug, as for the CPT partition calculations, this is discussed in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5050&quot; title=&quot;cpu partitioning oddities&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5050&quot;&gt;&lt;del&gt;LU-5050&lt;/del&gt;&lt;/a&gt; and &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7553&quot; title=&quot;Lustre cpu_npartitions default value breaks memory allocation on clients&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7553&quot;&gt;&lt;del&gt;LU-7553&lt;/del&gt;&lt;/a&gt;, so this ticket should be focused on the LST thread count.&lt;/p&gt;

&lt;p&gt;There was one patch &lt;a href=&quot;http://review.whamcloud.com/17824&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/17824&lt;/a&gt; &quot;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5050&quot; title=&quot;cpu partitioning oddities&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5050&quot;&gt;&lt;del&gt;LU-5050&lt;/del&gt;&lt;/a&gt; libcfs: default CPT matches NUMA topology&quot; that tried to change this, but it had to be reverted for a relatively minor reason, and should probably be revived.&lt;/p&gt;</comment>
                            <comment id="159067" author="dmiter" created="Mon, 18 Jul 2016 10:20:41 +0000"  >&lt;p&gt;This is results with different partitions count. The current settings looks much better. &lt;/p&gt;</comment>
                            <comment id="259639" author="adilger" created="Wed, 11 Dec 2019 19:52:57 +0000"  >&lt;p&gt;This configuration is not a development target today, and there is no real information in this ticket.  Maybe revisit this in the future.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="33696">LU-7553</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="24686">LU-5050</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="22271" name="knl.png" size="13744" author="dmiter" created="Mon, 18 Jul 2016 10:20:41 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzyhfr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>