<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:23:21 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-9114] Make MDS (And other server threads?) hog CPU less</title>
                <link>https://jira.whamcloud.com/browse/LU-9114</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;It&apos;s somewhat widely seen in various logs that pacemaker complaints its thread was not scheduled for tens of seconds which is way too excessive.&lt;br/&gt;
Indeed MDS is pretty cpu hungry, but we need to ensure we insert enough of schedule points so that other processes get a shot at CPU too.&lt;/p&gt;

&lt;p&gt;There are also some bandaids discussed like using numa settings to cordon off one cpu from use by Lustre, but those are just that - bandaids.&lt;/p&gt;

&lt;p&gt;We probably can play with various debug settings that warn about this and make the timeouts lower to try and catch more of the offenders. Likely have a bunch in flock code with its double loops&lt;/p&gt;</description>
                <environment></environment>
        <key id="43799">LU-9114</key>
            <summary>Make MDS (And other server threads?) hog CPU less</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="adilger">Andreas Dilger</assignee>
                                    <reporter username="green">Oleg Drokin</reporter>
                        <labels>
                    </labels>
                <created>Tue, 14 Feb 2017 00:28:25 +0000</created>
                <updated>Thu, 17 Dec 2020 18:29:58 +0000</updated>
                            <resolved>Thu, 17 Dec 2020 18:29:58 +0000</resolved>
                                                    <fixVersion>Lustre 2.14.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="192896" author="adilger" created="Thu, 20 Apr 2017 17:25:11 +0000"  >&lt;p&gt;In addition to checking if any one MDS thread running too long without scheduling, it may also be that the many MDS kernel threads are scheduled with a higher priority and prevent the userspace threads from being run.  I think for pacemaker and such, it makes sense to &lt;tt&gt;mlock()&lt;/tt&gt; the heartbeat daemons into memory (so they aren&apos;t swapped) and run them with realtime priority (or something like &lt;tt&gt;nice -15&lt;/tt&gt;) so that they can always get CPU time even when all of the MDS threads are running.&lt;/p&gt;</comment>
                            <comment id="216347" author="pjones" created="Thu, 14 Dec 2017 18:55:55 +0000"  >&lt;p&gt;Dmitry&lt;/p&gt;

&lt;p&gt;Can you please investigate this area as a longer term task for 2018&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="275637" author="gerrit" created="Fri, 17 Jul 2020 17:47:40 +0000"  >&lt;p&gt;Andreas Dilger (adilger@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/39435&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/39435&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9114&quot; title=&quot;Make MDS (And other server threads?) hog CPU less&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9114&quot;&gt;&lt;del&gt;LU-9114&lt;/del&gt;&lt;/a&gt; ldlm: don&apos;t compute sumsq for pool stats&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: bbe08711a531af70404ef1ba5ffe17815cede034&lt;/p&gt;</comment>
                            <comment id="287885" author="gerrit" created="Thu, 17 Dec 2020 17:00:48 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/39435/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/39435/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9114&quot; title=&quot;Make MDS (And other server threads?) hog CPU less&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9114&quot;&gt;&lt;del&gt;LU-9114&lt;/del&gt;&lt;/a&gt; ldlm: don&apos;t compute sumsq for pool stats&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 966f6bb550be52e4bf5dd7fd38a0d707fe2a5072&lt;/p&gt;</comment>
                            <comment id="287917" author="pjones" created="Thu, 17 Dec 2020 18:29:58 +0000"  >&lt;p&gt;Landed for 2.14&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="22595">LU-4423</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzz3pz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>