<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:54:20 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-5765] sanity test_123a test_123b: rm: no such file or directory</title>
                <link>https://jira.whamcloud.com/browse/LU-5765</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;This issue was created by maloo for Andreas Dilger &amp;lt;andreas.dilger@intel.com&amp;gt;&lt;/p&gt;

&lt;p&gt;In sanity.sh test_123a and test_123b a large number of errors are being reported when &quot;rm -r&quot; is running:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;rm: cannot remove `/mnt/lustre/d123a.sanity/f123a.sanity2&apos;: No such file or directory
rm: cannot remove `/mnt/lustre/d123a.sanity/f123a.sanity5&apos;: No such file or directory
rm: cannot remove `/mnt/lustre/d123a.sanity/f123a.sanity8&apos;: No such file or directory
rm: cannot remove `/mnt/lustre/d123a.sanity/f123a.sanity11&apos;: No such file or directory
rm: cannot remove `/mnt/lustre/d123a.sanity/f123a.sanity12&apos;: No such file or directory
rm: cannot remove `/mnt/lustre/d123a.sanity/f123a.sanity16&apos;: No such file or directory
rm: cannot remove `/mnt/lustre/d123a.sanity/f123a.sanity18&apos;: No such file or directory
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This issue relates to the following test suite run: &lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/186592e2-5577-11e4-8542-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/186592e2-5577-11e4-8542-5254006e85c2&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Info required for matching: sanity 123a&lt;br/&gt;
Info required for matching: sanity 123b&lt;/p&gt;</description>
                <environment></environment>
        <key id="27089">LU-5765</key>
            <summary>sanity test_123a test_123b: rm: no such file or directory</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="isaac">Isaac Huang</assignee>
                                    <reporter username="maloo">Maloo</reporter>
                        <labels>
                            <label>HB</label>
                            <label>zfs</label>
                    </labels>
                <created>Fri, 17 Oct 2014 20:26:17 +0000</created>
                <updated>Wed, 14 Jan 2015 18:32:17 +0000</updated>
                            <resolved>Wed, 14 Jan 2015 18:32:17 +0000</resolved>
                                    <version>Lustre 2.7.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="96817" author="utopiabound" created="Tue, 21 Oct 2014 12:28:32 +0000"  >&lt;p&gt;Happened on lustre-rsync-test test_2b on review-dne-part-1 on master:&lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/d99c68e6-5653-11e4-b972-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/d99c68e6-5653-11e4-b972-5254006e85c2&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="97360" author="yong.fan" created="Fri, 24 Oct 2014 02:59:02 +0000"  >&lt;p&gt;Another failure instance:&lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/e28d061c-5b13-11e4-9c62-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/e28d061c-5b13-11e4-9c62-5254006e85c2&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="97487" author="adilger" created="Fri, 24 Oct 2014 23:38:37 +0000"  >&lt;p&gt;Another failure &lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/729ab070-5b5e-11e4-95e9-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/729ab070-5b5e-11e4-95e9-5254006e85c2&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Di, Fan Yong,&lt;br/&gt;
it looks like readdir is failing for some reason (e.g. looping and returning the same entry hundreds of times?) and that is why &quot;rm&quot; is returning an error for all of the later entries.  Is this statahead, or DNE going wrong?  Unfortunately, we don&apos;t have any debug logs to tell us what is going on.&lt;/p&gt;</comment>
                            <comment id="97488" author="adilger" created="Fri, 24 Oct 2014 23:48:03 +0000"  >&lt;p&gt;It looks like these failures are all happening on ZFS.  I don&apos;t think the lustre-rsync-test test_2b looks like the same symptom AFAICS.&lt;/p&gt;

&lt;p&gt;Isn&apos;t there another bug open with ZFS directory iteration broken?  Maybe they are related?&lt;/p&gt;</comment>
                            <comment id="97956" author="isaac" created="Thu, 30 Oct 2014 17:20:18 +0000"  >&lt;p&gt;I repeated &quot;sanity --only 123a,123b&quot; for 100 times, and couldn&apos;t reproduce it. I was using build lustre-b2_5/96/ with 1 OSS (2 OSTs) 1 MDS and 1 client, any other build/configuration I should try?&lt;/p&gt;</comment>
                            <comment id="97991" author="isaac" created="Thu, 30 Oct 2014 20:50:21 +0000"  >&lt;p&gt;Another 50 repetitions, still can&apos;t reproduce, although the eagle VMs I used all had just 1 CPU.&lt;/p&gt;</comment>
                            <comment id="97992" author="adilger" created="Thu, 30 Oct 2014 21:11:09 +0000"  >&lt;p&gt;Isaac, I think you should be testing with master and ZFS 0.6.3, and not b2_5, since all of the failures I&apos;ve seen have been on master so far.&lt;/p&gt;

&lt;p&gt;Maybe a patch should be landed to sanity.sh test_123&lt;span class=&quot;error&quot;&gt;&amp;#91;ab&amp;#93;&lt;/span&gt; to help diagnose the problem if it happens again under review testing?  For example, doing an &quot;ls&quot; of the directory after the test is done, always running &quot;rm&quot; under strace and logging it to a file that is attached to Maloo so that we can see if the problem is in readdir() data returned to userspace, etc.  There have been 8 failures in the last 165 review-zfs tests in the past week, so I think if a patch is landed to sanity.sh on master it should be possible to get more information within a day or two.  I don&apos;t think just running review testing on the patch itself is likely to see problems, unless we run sanity.sh 20x in a loop with &lt;tt&gt;Test-Parameters:&lt;/tt&gt;.&lt;/p&gt;</comment>
                            <comment id="98009" author="yong.fan" created="Thu, 30 Oct 2014 23:25:03 +0000"  >&lt;p&gt;Isaac, another possible condition is that our Maloo test clusters run a lots VMs on single node, so the system load on Maloo clusters should be much higher than our personal test environment. So if we can simulate the similar test environment as Maloo clusters do, then it may be helpful to reproduce the issue locally.&lt;/p&gt;</comment>
                            <comment id="100364" author="isaac" created="Mon, 1 Dec 2014 22:31:44 +0000"  >&lt;p&gt;With a build on master, I&apos;m now able to reproduce it almost 100%. I&apos;ll look into it.&lt;/p&gt;</comment>
                            <comment id="100472" author="isaac" created="Tue, 2 Dec 2014 19:32:00 +0000"  >&lt;p&gt;In one test today, test_123a took 18127 seconds to complete (more than 100x the usual time on the same hw), and there were 11575754 &quot;no such file&quot; errors in the test log.&lt;/p&gt;</comment>
                            <comment id="101757" author="adilger" created="Tue, 16 Dec 2014 21:00:34 +0000"  >&lt;p&gt;Is it possible that this problem is the same as &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3573&quot; title=&quot;lustre-rsync-test test_8: @@@@@@ FAIL: Failure in replication; differences found. &quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3573&quot;&gt;&lt;del&gt;LU-3573&lt;/del&gt;&lt;/a&gt; and was fixed by &lt;a href=&quot;http://review.whamcloud.com/12904&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/12904&lt;/a&gt; ?&lt;/p&gt;

&lt;p&gt;The most recent failure &lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/97ea0fd0-84f6-11e4-a60f-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/97ea0fd0-84f6-11e4-a60f-5254006e85c2&lt;/a&gt; was on a patch based on a tree that doesn&apos;t contain the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3573&quot; title=&quot;lustre-rsync-test test_8: @@@@@@ FAIL: Failure in replication; differences found. &quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3573&quot;&gt;&lt;del&gt;LU-3573&lt;/del&gt;&lt;/a&gt; fix.&lt;/p&gt;

&lt;p&gt;Before that, the most recent test failure was &lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/863b9434-fcb2-11e2-9222-52540035b04c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/863b9434-fcb2-11e2-9222-52540035b04c&lt;/a&gt; on 2014-08-03.&lt;/p&gt;</comment>
                            <comment id="102155" author="adilger" created="Sun, 21 Dec 2014 04:10:46 +0000"  >&lt;p&gt;Saw this again on a recent patch run:&lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/e55f7412-881b-11e4-aa28-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/e55f7412-881b-11e4-aa28-5254006e85c2&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/c69c83ca-87f9-11e4-a70f-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/c69c83ca-87f9-11e4-a70f-5254006e85c2&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="103500" author="adilger" created="Wed, 14 Jan 2015 18:31:07 +0000"  >&lt;p&gt;This might also be related to &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-6101&quot; title=&quot;sanity test_24A: Can not delete directories&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-6101&quot;&gt;&lt;del&gt;LU-6101&lt;/del&gt;&lt;/a&gt;, which also has a patch. &lt;/p&gt;</comment>
                            <comment id="103501" author="adilger" created="Wed, 14 Jan 2015 18:32:17 +0000"  >&lt;p&gt;Haven&apos;t seen this again for the past 4 weeks. &lt;/p&gt;

&lt;p&gt;Closing it again. Maybe &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-6101&quot; title=&quot;sanity test_24A: Can not delete directories&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-6101&quot;&gt;&lt;del&gt;LU-6101&lt;/del&gt;&lt;/a&gt; will fix the final trigger.  &lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="19751">LU-3573</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="28106">LU-6101</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzwyx3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>16184</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>