<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:15:04 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1262] mkdir followed by rmdir on a different client fails -- Object doesn&apos;t exist!</title>
                <link>https://jira.whamcloud.com/browse/LU-1262</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Customer creates a directory on one node, puts a file in that directory.  Then, on a different client, tries to recursively remove the directory.  Then, back on the first client, tries making the directory again.  This fails. &lt;/p&gt;

&lt;p&gt;Here are the exact steps:&lt;/p&gt;

&lt;p&gt;usrs400 $ mkdir /mnt/lustre/foo&lt;br/&gt;
usrs400 $ touch /mnt/lustre/foo/bar&lt;/p&gt;

&lt;p&gt;usrs399 $ rm -rf /mnt/lustre/foo&lt;/p&gt;

&lt;p&gt;usrs400 $ mkdir /mnt/lustre/foo&lt;br/&gt;
mkdir: cannot create directory `foo&apos;: File exists&lt;br/&gt;
usrs400 $ ls /mnt/lustre/foo&lt;br/&gt;
ls: cannot access foo: No such file or directory&lt;br/&gt;
usrs400 $ mkdir /mnt/lustre/foo&lt;br/&gt;
mkdir: cannot create directory `foo&apos;: File exists&lt;br/&gt;
usrs400 $ rmdir /mnt/lustre/foo&lt;br/&gt;
rmdir: failed to remove `foo&apos;: No such file or directory&lt;br/&gt;
usrs400 $ mkdir /mnt/lustre/foo&lt;/p&gt;


&lt;p&gt;The customer has waited 10 minutes for this to complete.  For the following output from /var/log/messages, the customer only waited a second or two.  Also, the customer unmounted and remounted the clients, to keep things simpler.&lt;/p&gt;


&lt;p&gt;usrs400:/var/log/messages:&lt;/p&gt;

&lt;p&gt;Mar 27 12:26:57 usrs400 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677137.616534&amp;#93;&lt;/span&gt; Lustre: Lustre: Build Version: ../lustre/scripts-20120222220600-PRISTINE-../lustre/scripts&lt;br/&gt;
Mar 27 12:26:57 usrs400 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677137.758165&amp;#93;&lt;/span&gt; Lustre: Added LNI 192.168.185.6@tcp &lt;span class=&quot;error&quot;&gt;&amp;#91;8/256/0/180&amp;#93;&lt;/span&gt;&lt;br/&gt;
Mar 27 12:26:57 usrs400 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677137.758237&amp;#93;&lt;/span&gt; Lustre: Accept secure, port 988&lt;br/&gt;
Mar 27 12:27:01 usrs400 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677141.529389&amp;#93;&lt;/span&gt; Lustre: MGC192.168.185.35@tcp: Reactivating import&lt;br/&gt;
Mar 27 12:27:01 usrs400 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677141.568728&amp;#93;&lt;/span&gt; LustreError: 31684:0:(obd_config.c:1147:class_process_proc_param()) writing proc entry checksum_pages err -11&lt;br/&gt;
Mar 27 12:27:01 usrs400 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677141.774591&amp;#93;&lt;/span&gt; Lustre: Client xxxxxx-client has started&lt;br/&gt;
Mar 27 12:27:32 usrs400 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677173.081643&amp;#93;&lt;/span&gt; LustreError: 31698:0:(file.c:2228:ll_inode_revalidate_fini()) failure -2 inode 144115239078592513&lt;br/&gt;
Mar 27 12:27:32 usrs400 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677173.208438&amp;#93;&lt;/span&gt; LustreError: 31698:0:(file.c:2228:ll_inode_revalidate_fini()) failure -2 inode 144115239078592513&lt;/p&gt;


&lt;p&gt;usrs399:/var/log/messages:&lt;/p&gt;

&lt;p&gt;Mar 27 12:26:20 usrs399 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677221.853364&amp;#93;&lt;/span&gt; Lustre: Lustre: Build Version: ../lustre/scripts-20120222220600-PRISTINE-../lustre/scripts&lt;br/&gt;
Mar 27 12:26:20 usrs399 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677221.993703&amp;#93;&lt;/span&gt; Lustre: Added LNI 192.168.185.7@tcp &lt;span class=&quot;error&quot;&gt;&amp;#91;8/256/0/180&amp;#93;&lt;/span&gt;&lt;br/&gt;
Mar 27 12:26:20 usrs399 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677221.993777&amp;#93;&lt;/span&gt; Lustre: Accept secure, port 988&lt;br/&gt;
Mar 27 12:26:39 usrs399 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677240.559658&amp;#93;&lt;/span&gt; Lustre: MGC192.168.185.35@tcp: Reactivating import&lt;br/&gt;
Mar 27 12:26:39 usrs399 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677240.592294&amp;#93;&lt;/span&gt; LustreError: 621:0:(obd_config.c:1147:class_process_proc_param()) writing proc entry checksum_pages err -11&lt;br/&gt;
Mar 27 12:26:39 usrs399 kernel: &lt;span class=&quot;error&quot;&gt;&amp;#91;677240.825955&amp;#93;&lt;/span&gt; Lustre: Client xxxxxx-client has started&lt;/p&gt;


&lt;p&gt;On the MDS (I think that there is some clock skew):&lt;/p&gt;

&lt;p&gt;Mar 27 12:26:39 ts-xxxxxxxx-01 kernel: Lustre: 2523:0:(ldlm_lib.c:877:target_handle_connect()) MGS: connection from b92afcf0-1504-ed4b-819e-d31039236758@192.168.185.7@tcp t0 exp (null) cur 1332851199 last 0&lt;br/&gt;
Mar 27 12:26:39 ts-xxxxxxxx-01 kernel: Lustre: 2523:0:(sec.c:1474:sptlrpc_import_sec_adapt()) import MGS-&amp;gt;NET_0x20000c0a8b907_UUID netid 20000: select flavor null&lt;br/&gt;
Mar 27 12:26:39 ts-xxxxxxxx-01 kernel: Lustre: 10205:0:(ldlm_lib.c:877:target_handle_connect()) xxxxxx-MDT0000: connection from 11909088-d4a2-77fa-030f-1c9e2a493436@192.168.185.7@tcp t0 exp (null) cur 1332851199 last 0&lt;br/&gt;
Mar 27 12:26:39 ts-xxxxxxxx-01 kernel: Lustre: 10205:0:(sec.c:1474:sptlrpc_import_sec_adapt()) import xxxxxx-MDT0000-&amp;gt;NET_0x20000c0a8b907_UUID netid 20000: select flavor null&lt;br/&gt;
Mar 27 12:27:01 ts-xxxxxxxx-01 kernel: Lustre: 2523:0:(ldlm_lib.c:877:target_handle_connect()) MGS: connection from 8927fc00-6820-4bea-63ad-35146f43bb3e@192.168.185.6@tcp t0 exp (null) cur 1332851221 last 0&lt;br/&gt;
Mar 27 12:27:01 ts-xxxxxxxx-01 kernel: Lustre: 2523:0:(sec.c:1474:sptlrpc_import_sec_adapt()) import MGS-&amp;gt;NET_0x20000c0a8b906_UUID netid 20000: select flavor null&lt;br/&gt;
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) header@ffff88061fac1ec0[0x0, 1, &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000be0:0x1:0x0&amp;#93;&lt;/span&gt; hash lru]&lt;/p&gt;
{
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) ....mdt@ffff88061fac1f18mdt-object@ffff88061fac1ec0(ioepoch=0 flags=0x0, epochcount=0, writecount=0)
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) ....cmm@ffff88061fbc83c0[local]
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) ....mdd@ffff88061fd80380mdd-object@ffff88061fd80380(open_count=0, valid=0, cltime=0, flags=0)
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) ....osd-ldiskfs@ffff88061e88b180osd-ldiskfs-object@ffff88061e88b180(i:(null):0/0)[plain]
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) }
&lt;p&gt; header@ffff88061fac1ec0&lt;br/&gt;
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) Object doesn&apos;t exist!&lt;br/&gt;
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) header@ffff88061fac1ec0[0x0, 1, &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000be0:0x1:0x0&amp;#93;&lt;/span&gt; hash lru]&lt;/p&gt;
{
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) ....mdt@ffff88061fac1f18mdt-object@ffff88061fac1ec0(ioepoch=0 flags=0x0, epochcount=0, writecount=0)
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) ....cmm@ffff88061fbc83c0[local]
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) ....mdd@ffff88061fd80380mdd-object@ffff88061fd80380(open_count=0, valid=0, cltime=0, flags=0)
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) ....osd-ldiskfs@ffff88061e88b180osd-ldiskfs-object@ffff88061e88b180(i:(null):0/0)[plain]
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) }
&lt;p&gt; header@ffff88061fac1ec0&lt;br/&gt;
Mar 27 12:27:32 ts-xxxxxxxx-01 kernel: Lustre: 2541:0:(mdt_handler.c:1010:mdt_getattr_name_lock()) Object doesn&apos;t exist!&lt;/p&gt;




</description>
                <environment>Lustre servers are running 2.6.32-220.el6, with Lustre 2.1.1.rc4.&lt;br/&gt;
Lustre clients are running 2.6.38.2, with special code created for this release, with &lt;a href=&quot;http://review.whamcloud.com/#change,2170&quot;&gt;http://review.whamcloud.com/#change,2170&lt;/a&gt;.&lt;br/&gt;
</environment>
        <key id="13728">LU-1262</key>
            <summary>mkdir followed by rmdir on a different client fails -- Object doesn&apos;t exist!</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="laisiyao">Lai Siyao</assignee>
                                    <reporter username="rspellman">Roger Spellman</reporter>
                        <labels>
                    </labels>
                <created>Tue, 27 Mar 2012 13:08:57 +0000</created>
                <updated>Mon, 9 Apr 2012 20:49:36 +0000</updated>
                            <resolved>Mon, 9 Apr 2012 20:49:36 +0000</resolved>
                                    <version>Lustre 2.1.1</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="32207" author="pjones" created="Tue, 27 Mar 2012 14:35:32 +0000"  >&lt;p&gt;Lai&lt;/p&gt;

&lt;p&gt;Could you please comment on this one?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="32931" author="laisiyao" created="Fri, 30 Mar 2012 12:07:46 +0000"  >&lt;p&gt;This can&apos;t reproduce in master branch, I&apos;ll check the changes made after 2.1.&lt;/p&gt;</comment>
                            <comment id="32932" author="rspellman" created="Fri, 30 Mar 2012 12:18:02 +0000"  >&lt;p&gt;What are you using for a client?&lt;/p&gt;

&lt;p&gt;As stated in the &quot;Environment&quot; section:&lt;/p&gt;

&lt;p&gt;Lustre clients are running 2.6.38.2, with special code created for this release, with &lt;a href=&quot;http://review.whamcloud.com/#change,2170&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,2170&lt;/a&gt;. &lt;/p&gt;</comment>
                            <comment id="32936" author="rspellman" created="Fri, 30 Mar 2012 13:16:15 +0000"  >&lt;p&gt;Customer reports the following.  &lt;/p&gt;

&lt;p&gt;So a little more playing around and I found out you don&apos;t need to have a file in the directory to have a problem. However, the problem only manifests itself if you use &quot;rm -r&quot; to remove the directory. The difference is that &quot;rmdir&quot; uses the &quot;rmdir()&quot; system call and &quot;rm -r&quot; uses the &quot;unlinkat()&quot; system call.&lt;/p&gt;

&lt;p&gt;I also have a new fun bug for you:&lt;/p&gt;

&lt;p&gt;client1 $ mkdir /mnt/lustre/test&lt;/p&gt;

&lt;p&gt;client2 $ echo foo &amp;gt; /mnt/lustre/test/foo&lt;/p&gt;

&lt;p&gt;client1 $ mv /mnt/lustre/test /mnt/lustre/test2&lt;/p&gt;

&lt;p&gt;client2 $ echo bar &amp;gt; /mnt/lustre/test/bar    #### SUCCEEDS!!!&lt;br/&gt;
client2 $ ls /mnt/lustre/test&lt;br/&gt;
ls: cannot open directory /mnt/lustre/test: No such file or directory&lt;br/&gt;
client2 $ echo baz &amp;gt; /mnt/lustre/test/baz   #### SUCCEEDS!!!&lt;/p&gt;

&lt;p&gt;client1 $ ls /mnt/lustre/test2&lt;br/&gt;
bar  baz  foo&lt;/p&gt;

&lt;p&gt;client2 $ mkdir /mnt/lustre/test&lt;br/&gt;
mkdir: cannot create directory `/mnt/lustre/test&apos;: File exists&lt;/p&gt;

&lt;p&gt;It seems that client2 cached the name to the directory node and used the cache when writing a file, but using ls did not invalidate the cache entry nor did trying to make the directory.&lt;/p&gt;

&lt;p&gt;/var/log/messages in both clients and xxxxxxx-01 (the MDS) have no messages caused by this test.&lt;/p&gt;

&lt;p&gt;I would actually consider this a more serious bug than the first one because our users are likely to have multiple runs that write to the same directory path and they may move the directory for subsequent runs but the old directory will receive the data.&lt;/p&gt;</comment>
                            <comment id="33023" author="laisiyao" created="Fri, 30 Mar 2012 22:57:48 +0000"  >&lt;p&gt;Hmm, kernel &amp;gt;= 2.6.38 uses d_set_d_op() to set dentry operations, and .d_delete is called before refcount is decreased, I&apos;ll update &lt;a href=&quot;http://review.whamcloud.com/#change,2170&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,2170&lt;/a&gt; later.&lt;/p&gt;</comment>
                            <comment id="33027" author="laisiyao" created="Fri, 30 Mar 2012 23:46:36 +0000"  >&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/#change,2170&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,2170&lt;/a&gt; updated, could you try again?&lt;/p&gt;</comment>
                            <comment id="34305" author="pjones" created="Mon, 9 Apr 2012 17:01:16 +0000"  >&lt;p&gt;As per Terascala the latest code fixes this issue&lt;/p&gt;</comment>
                            <comment id="34308" author="rspellman" created="Mon, 9 Apr 2012 17:10:06 +0000"  >&lt;p&gt;Agreed.  This can be closed.&lt;/p&gt;</comment>
                            <comment id="34344" author="pjones" created="Mon, 9 Apr 2012 20:49:36 +0000"  >&lt;p&gt;Landing the patch for this work is tracked under &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-506&quot; title=&quot;FC15  patchless client support.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-506&quot;&gt;&lt;del&gt;LU-506&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvh5b:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6427</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10021"><![CDATA[2]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>