<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:36:26 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-10588] lfsck generates &quot;kernel: list passed to list_sort() too long for efficiency&quot;</title>
                <link>https://jira.whamcloud.com/browse/LU-10588</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Just wanted to report these kernel messages on the MDS seen&#160;shortly after having started lfsck_namespace and oi_scrub. I know they clearly lack some information but seem be related to lfsck.&lt;/p&gt;

&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;Jan 30 14:11:36 oak-md1-s2 kernel: list passed to list_sort() too &lt;span class=&quot;code-object&quot;&gt;long&lt;/span&gt; &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; efficiency
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Stephane&lt;/p&gt;</description>
                <environment>3.10.0-693.2.2.el7_lustre.pl1.x86_64</environment>
        <key id="50480">LU-10588</key>
            <summary>lfsck generates &quot;kernel: list passed to list_sort() too long for efficiency&quot;</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="5" iconUrl="https://jira.whamcloud.com/images/icons/priorities/trivial.svg">Trivial</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="5">Cannot Reproduce</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="sthiell">Stephane Thiell</reporter>
                        <labels>
                    </labels>
                <created>Tue, 30 Jan 2018 22:37:06 +0000</created>
                <updated>Wed, 13 Feb 2019 08:03:13 +0000</updated>
                            <resolved>Wed, 13 Feb 2019 08:03:08 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="219579" author="adilger" created="Wed, 31 Jan 2018 18:30:37 +0000"  >&lt;p&gt;Stephane,&lt;br/&gt;
There is no direct caller for &lt;tt&gt;list_sort&lt;/tt&gt; in the Lustre tree, not any use of &quot;sort&quot; in LFSCK at all, so there is no way to know what is causing this message. &lt;/p&gt;

&lt;p&gt;One option would be to change, &lt;tt&gt;list_sort()&lt;/tt&gt; to call &lt;tt&gt;WARN_ONCE()&lt;/tt&gt;, like:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;        &lt;span class=&quot;code-keyword&quot;&gt;if&lt;/span&gt; (lev &amp;gt; max_lev) {
                &lt;span class=&quot;code-keyword&quot;&gt;if&lt;/span&gt; (WARN_ONCE(lev &amp;gt;= AREAY_SIZE(part) - 1,
                              &lt;span class=&quot;code-quote&quot;&gt;&quot;list too &lt;span class=&quot;code-object&quot;&gt;long&lt;/span&gt; &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; efficiency (%d &amp;gt;= %d)\n&quot;&lt;/span&gt;, lev,
                              MAX_LIST_LENGTH_BITS))
                         lev--;
                max_lev = lev;
        }
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This will dump the stack trace of the thread that hits this problem. &lt;/p&gt;</comment>
                            <comment id="219601" author="sthiell" created="Wed, 31 Jan 2018 22:55:30 +0000"  >&lt;p&gt;Ah thanks much Andreas! Will do when possible and update the ticket if I got something. This might take some time though.&lt;/p&gt;

&lt;p&gt;Stephane&lt;/p&gt;</comment>
                            <comment id="224200" author="pjones" created="Wed, 21 Mar 2018 16:46:30 +0000"  >&lt;p&gt;Fan Yong&lt;/p&gt;

&lt;p&gt;Anything else to suggest here?&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="224381" author="yong.fan" created="Fri, 23 Mar 2018 10:02:50 +0000"  >&lt;p&gt;In Lustre, the unique caller of list_sort() is __ldiskfs_es_shrink() that will be called by system under the case of system RAM pressure to release some RAM from extent status tree.&lt;/p&gt;

&lt;p&gt;Currently, only with the message &quot;Jan 30 14:11:36 oak-md1-s2 kernel: list passed to list_sort() too long for efficiency&quot;, I can NOT say whether it is the __ldiskfs_es_shrink() triggered such message or not.&lt;/p&gt;

&lt;p&gt;But by default, Lustre only enables file extents for OST, NOT for MDT. But in this case, the message was printed on the MDS. That is strange. Means even if LFSCK caused too many inodes cached (that is controlled by system), the extent status tree on the MDT would be almost empty and should not cause list_sort() warning.&lt;/p&gt;</comment>
                            <comment id="224384" author="yong.fan" created="Fri, 23 Mar 2018 10:35:46 +0000"  >&lt;p&gt;Anyway, we need the stack trace to know who triggered the list_sort() warning.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=sthiell&quot; class=&quot;user-hover&quot; rel=&quot;sthiell&quot;&gt;sthiell&lt;/a&gt;,&lt;/p&gt;

&lt;p&gt;Have you got the stack trace for the list_sort() sponsor? Have you checked the RAM usage on MDT when list_sort() trouble happened?&lt;/p&gt;</comment>
                            <comment id="224409" author="sthiell" created="Fri, 23 Mar 2018 15:30:51 +0000"  >&lt;p&gt;Hi nasf,&lt;/p&gt;

&lt;p&gt;Thanks for this useful information! We&apos;re indeed using ldiskfs. But this is currently low priority for us and as we&apos;re testing a patched kernel already for another important issue, we decided not to include any other change for now. Reproducibility, you know... &lt;img class=&quot;emoticon&quot; src=&quot;https://jira.whamcloud.com/images/icons/emoticons/smile.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/p&gt;

&lt;p&gt;When things get settled for us with 2.10.x, I&apos;ll patch the kernel to get more debugging from this warning as suggested by Andreas.&lt;/p&gt;

&lt;p&gt;Thanks again.&lt;/p&gt;

&lt;p&gt;Stephane&lt;/p&gt;</comment>
                            <comment id="241845" author="adilger" created="Wed, 13 Feb 2019 08:03:08 +0000"  >&lt;p&gt;Closing this old issue since there is no information, please re-open if more information becomes available.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzrxz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>