<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:27:32 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-2708] MDS thrashing in ptlrpc_alloc_rqbd</title>
                <link>https://jira.whamcloud.com/browse/LU-2708</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We have had some problems in recent weeks with the MDS on grove (sequoia&apos;s filesystem cluster) thrashing for anywhere from minutes to many hours while under load.  While it does so, it is not appear to be handling traffic very quickly, and the node load is so high that login is nearly impossible.&lt;/p&gt;

&lt;p&gt;I caught it doing that for a while today during testing and dumped some SysRq info to the console.&lt;/p&gt;

&lt;p&gt;It looks to me like the active tasks may be spending too much time under ptlrpc_alloc_rqbd() doing vmallocs.&lt;/p&gt;

&lt;p&gt;Prakash had a patch to move those allocations to a slab.  But it became time consuming to keep moving forward.  We may need to look at reviving that.&lt;/p&gt;

&lt;p&gt;See attached file &quot;console.grove-mds1.txt.bz2&quot;.&lt;/p&gt;</description>
                <environment>Lustre 2.3.58-6chaos (github.com/chaos/lustre) on MDS.</environment>
        <key id="17361">LU-2708</key>
            <summary>MDS thrashing in ptlrpc_alloc_rqbd</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="liang">Liang Zhen</assignee>
                                    <reporter username="morrone">Christopher Morrone</reporter>
                        <labels>
                            <label>HB</label>
                            <label>sequoia</label>
                            <label>topsequoia</label>
                    </labels>
                <created>Tue, 29 Jan 2013 20:48:58 +0000</created>
                <updated>Wed, 6 Mar 2013 20:42:07 +0000</updated>
                            <resolved>Wed, 6 Mar 2013 20:42:07 +0000</resolved>
                                    <version>Lustre 2.4.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="51431" author="morrone" created="Tue, 29 Jan 2013 21:01:24 +0000"  >&lt;p&gt;Related to &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2432&quot; title=&quot;ptlrpc_alloc_rqbd spinning on vmap_area_lock on MDS&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2432&quot;&gt;&lt;del&gt;LU-2432&lt;/del&gt;&lt;/a&gt;.  However, I verified that we are running with the kernel patch that fixes the kernel vmalloc problem for the problem noted in this ticket.&lt;/p&gt;</comment>
                            <comment id="51439" author="liang" created="Tue, 29 Jan 2013 23:23:16 +0000"  >&lt;p&gt;I have posted another patch : &lt;a href=&quot;http://review.whamcloud.com/#change,4940&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,4940&lt;/a&gt;&lt;br/&gt;
it will not fundamentally resolve the issue, but it will largely reduce request buffer size/count, and save a lot of memory. &lt;br/&gt;
But I think it has to be reviewed by Andreas first.&lt;/p&gt;</comment>
                            <comment id="53488" author="liang" created="Wed, 6 Mar 2013 20:42:07 +0000"  >&lt;p&gt;we have landed two patches for this:&lt;br/&gt;
&lt;a href=&quot;http://review.whamcloud.com/#change,4939&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,4939&lt;/a&gt; can largely decrease threads number contending on vmalloc.&lt;br/&gt;
&lt;a href=&quot;http://review.whamcloud.com/#change,4940&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,4940&lt;/a&gt; can improve buffer utilization rate, and decrease chance to call vmalloc.&lt;br/&gt;
So I think we can close this ticket for now.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="16860">LU-2432</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="12218" name="console.grove-mds1.txt.bz2" size="404062" author="morrone" created="Tue, 29 Jan 2013 20:48:58 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvgg7:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6314</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>