<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:59:59 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-13285] multiple DF returns bad info</title>
                <link>https://jira.whamcloud.com/browse/LU-13285</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Hey all,&lt;br/&gt;
Ive been struggling with a problem with our newly updated lustre 2.12 cluster, and I don&apos;t really know if its a bug, or configuration problem, or what.&lt;br/&gt;
So here&apos;s the setup: I&apos;ve recently set up a small 2-OST single MDT 2.10 cluster to emulate our production cluster, and test the process of upgrading to 2.12.4. The upgrade went fine, however there is a problem with how df reports space on the lustre filesystem that is causing problems with our processing software. The software includes a df check to make sure the filesystem isn&apos;t too full before beginning a job. The problem is, that when multiple df commands are run against the lustre filesystem from the same client, occasionally the command will return a 0 in the available field, which in turn makes the software think the filesystem is full, then drop jobs. I can test this by running &apos;while [ true ];do /bin/df -TP /performance;done&apos; on two sessions on the same client. As soon as I start the second while loop, the outputs go from:&lt;br/&gt;
Filesystem &#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; Type &#160; 1024-blocks &#160; Used Available Capacity Mounted on&lt;br/&gt;
 192.168.0.181@tcp:/perform lustre &#160; &#160;71467728 100416 &#160;67664944 &#160; &#160; &#160; 1% /performance&lt;br/&gt;
&#160;&lt;br/&gt;
to:&lt;br/&gt;
Filesystem &#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; Type &#160; 1024-blocks &#160;Used Available Capacity Mounted on&lt;br/&gt;
 192.168.0.181@tcp:/perform lustre &#160; &#160; &#160; &#160; &#160; 0 &#160; &#160;-0 &#160; &#160; &#160; &#160;-0 &#160; &#160; &#160;50% /performance&lt;br/&gt;
I am using lustre 2.12.4 on the client as well, so Ive ruled out version mismatch issues at least.&lt;br/&gt;
&#160;&lt;br/&gt;
I&apos;ve checked all the mount settings between the prod 2.10 cluster and the dev 2.12 cluster, and everything I can find looks the same. The 2.10 prod cluster does not have this problem, and the dev cluster did not have the problem before upgrading from 2.10.&lt;br/&gt;
&#160;&lt;br/&gt;
I have posted this in the lustre-discuss mailing list and Nathan Dauchy suggested I open a Jira issue so I could upload an strace of the failure.&lt;/p&gt;</description>
                <environment>CentOS7.7.1908</environment>
        <key id="58166">LU-13285</key>
            <summary>multiple DF returns bad info</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="kkonzem">Kevin Konzem</reporter>
                        <labels>
                    </labels>
                <created>Fri, 21 Feb 2020 16:02:37 +0000</created>
                <updated>Mon, 6 Apr 2020 23:36:06 +0000</updated>
                            <resolved>Mon, 6 Apr 2020 23:36:06 +0000</resolved>
                                    <version>Lustre 2.12.4</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>9</watches>
                                                                            <comments>
                            <comment id="263930" author="dauchy" created="Mon, 24 Feb 2020 22:39:47 +0000"  >&lt;p&gt;Kevin,&#160; it looks like you did the strace on the bash process, not on &apos;df&apos; itself, so the data may not be terribly useful to developers.&lt;/p&gt;

&lt;p&gt;I was able to catch a similar problem on our system, and the strace shows that the statfs() call is returning incorrect data.&#160; Here is a &quot;good&quot; and a &quot;bad&quot; run for comparison:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
statfs(&lt;span class=&quot;code-quote&quot;&gt;&quot;/mnt/lfs1&quot;&lt;/span&gt;, {f_type=0xbd00bd0, f_bsize=4096, f_blocks=949817358228, f_bfree=378438906913, f_bavail=368840556468, f_files=3795357040, f_ffree=3500647829, f_fsid={1050737646, 0}, f_namelen=255, f_frsize=4096, f_flags=ST_VALID}) = 0


statfs(&lt;span class=&quot;code-quote&quot;&gt;&quot;/mnt/lfs1&quot;&lt;/span&gt;, {f_type=0xbd00bd0, f_bsize=0, f_blocks=0, f_bfree=0, f_bavail=0, f_files=18446618905756391232, f_ffree=132349083419, f_fsid={1050737646, 0}, f_namelen=0, f_frsize=0, f_flags=ST_VALID}) = 0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;&#160;&lt;br/&gt;
 The recent change that (to me anyway) seems related is &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12368&quot; title=&quot;concurrent statfs() calls on the client should be blocked&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12368&quot;&gt;&lt;del&gt;LU-12368&lt;/del&gt;&lt;/a&gt;. Do you have that in your client build?&lt;/p&gt;</comment>
                            <comment id="264085" author="kkonzem" created="Wed, 26 Feb 2020 14:42:32 +0000"  >&lt;p&gt;My bad, sorry about that. I found a better way to strace the actual df command, attached is a good example&#160;&lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/34351/34351_df.out&quot; title=&quot;df.out attached to LU-13285&quot;&gt;df.out&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; and a bad example &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/34352/34352_df2.out&quot; title=&quot;df2.out attached to LU-13285&quot;&gt;df2.out&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;.&lt;/p&gt;

&lt;p&gt;I looked at &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12368&quot; title=&quot;concurrent statfs() calls on the client should be blocked&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12368&quot;&gt;&lt;del&gt;LU-12368&lt;/del&gt;&lt;/a&gt;, and while that did look promising, I installed 2.13 on a client to try it out, but the bug remained. Should I try installing 2.13 on the server as well, or is that part of the code only handled by the client?&lt;/p&gt;

&lt;p&gt;Also, I tried running &apos;lfs df&apos; instead of &apos;df&apos;, but I got the same response. When ran in a loop on two sessions on the same client, it worked fine on 2.10, but intermittently failed on 2.12/2.13.&lt;/p&gt;</comment>
                            <comment id="264107" author="dauchy" created="Wed, 26 Feb 2020 18:22:36 +0000"  >&lt;p&gt;Another possibly related ticket is &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13296&quot; title=&quot;statfs isn&amp;#39;t work properly with MDT statfs proxy&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13296&quot;&gt;&lt;del&gt;LU-13296&lt;/del&gt;&lt;/a&gt; (statfs isn&apos;t work properly with MDT statfs proxy), which tracks a regression introduced by &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10018&quot; title=&quot;MDT as a statfs proxy&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10018&quot;&gt;&lt;del&gt;LU-10018&lt;/del&gt;&lt;/a&gt; (MDT as a statfs proxy).&lt;/p&gt;</comment>
                            <comment id="264191" author="spitzcor" created="Thu, 27 Feb 2020 22:56:53 +0000"  >&lt;p&gt;Yes, I was just popping in here after following your conversation on lustre-discuss.  &lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=kkonzem&quot; class=&quot;user-hover&quot; rel=&quot;kkonzem&quot;&gt;kkonzem&lt;/a&gt;, I think you should try your reproducer against the patch in &lt;a href=&quot;https://review.whamcloud.com/37753&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/37753&lt;/a&gt;.  I hope it will work for you.  There is a simplified reproducer as a part of that patch too.&lt;/p&gt;</comment>
                            <comment id="266987" author="adilger" created="Mon, 6 Apr 2020 23:36:00 +0000"  >&lt;p&gt;The patch &lt;a href=&quot;https://review.whamcloud.com/37753&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/37753&lt;/a&gt; &quot;&lt;tt&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13296&quot; title=&quot;statfs isn&amp;#39;t work properly with MDT statfs proxy&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13296&quot;&gt;&lt;del&gt;LU-13296&lt;/del&gt;&lt;/a&gt; obd: make statfs cache working again&lt;/tt&gt;&quot; was landed to master for 2.14 and backported to b2_12 for 2.12.5.&lt;/p&gt;
</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="58185">LU-13296</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="34351" name="df.out" size="5123" author="kkonzem" created="Wed, 26 Feb 2020 14:11:40 +0000"/>
                            <attachment id="34352" name="df2.out" size="5182" author="kkonzem" created="Wed, 26 Feb 2020 14:12:25 +0000"/>
                            <attachment id="34326" name="dftest.txt" size="5367415" author="kkonzem" created="Fri, 21 Feb 2020 16:02:34 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00u87:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>