<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:04:28 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-6924] remote regular file are missing after recovery.</title>
                <link>https://jira.whamcloud.com/browse/LU-6924</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;In 24 hours DNE failover test. I found this on one of the MDT, &lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;LustreError: 2758:0:(client.c:2869:ptlrpc_replay_interpret()) @@@ status -110, old was 0  req@ffff880feb148cc0 x1507974808149044/t25771723485(25771723485) o1000-&amp;gt;lustre-MDT0003-osp-MDT0001@192.168.2.128@o2ib:24/4 lens 248/16576 e 1 to 0 dl 1438129486 ref 2 fl Interpret:R/4/0 rc -110/-110
Lustre: lustre-MDT0003-osp-MDT0001: Connection restored to lustre-MDT0003 (at 192.168.2.128@o2ib)
LustreError: 3117:0:(mdt_open.c:1171:mdt_cross_open()) lustre-MDT0001: [0x240000406:0x167f1:0x0] doesn&apos;t exist!: rc = -14
Lustre: DEBUG MARKER: ==== Checking the clients loads BEFORE failover -- failure NOT OK ELAPSED=27221 DURATION=86400 PERIOD=1800
Lustre: DEBUG MARKER: Client load failed on node c05, rc=1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Then on the client side, which cause dbench fails&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;   2      7136     0.00 MB/sec  execute 191 sec  latency 272510.369 ms
   2      7136     0.00 MB/sec  execute 192 sec  latency 273510.512 ms
   2      7136     0.00 MB/sec  execute 193 sec  latency 274510.637 ms
   2      7136     0.00 MB/sec  execute 194 sec  latency 275510.799 ms
   2      7136     0.00 MB/sec  execute 195 sec  latency 276510.916 ms
   2      7136     0.00 MB/sec  execute 196 sec  latency 277511.069 ms
   2      7136     0.00 MB/sec  execute 197 sec  latency 278511.229 ms
   2      7136     0.00 MB/sec  execute 198 sec  latency 279511.387 ms
   2      7330     0.00 MB/sec  execute 199 sec  latency 280182.929 ms
[9431] open ./clients/client1/~dmtmp/EXCEL/RESULTS.XLS failed for handle 11887 (Bad address)
(9432) ERROR: handle 11887 was not found
Child failed with status 1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Then the test fails.&lt;/p&gt;</description>
                <environment></environment>
        <key id="31262">LU-6924</key>
            <summary>remote regular file are missing after recovery.</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="di.wang">Di Wang</assignee>
                                    <reporter username="di.wang">Di Wang</reporter>
                        <labels>
                    </labels>
                <created>Wed, 29 Jul 2015 07:31:09 +0000</created>
                <updated>Wed, 26 Aug 2015 07:47:10 +0000</updated>
                            <resolved>Thu, 13 Aug 2015 00:07:52 +0000</resolved>
                                    <version>Lustre 2.8.0</version>
                                    <fixVersion>Lustre 2.8.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="122614" author="di.wang" created="Wed, 29 Jul 2015 21:36:01 +0000"  >&lt;p&gt;Hmm, I do not have enough debug to know what happens, but it mostly like &lt;/p&gt;

&lt;p&gt;1. MDS02 do remote unlink, so it destroy local object, then delete the remote name entry on MDS04. &lt;br/&gt;
2. But MDS04 restarts at the moment, after it restarts, it will wait all clients connected, then collecting the debug log.&lt;br/&gt;
3. After MDS02 reconnects to MDS04, it will send replay unlink to MDS04, MDS04 got the unlink request and wait for the BULK.&lt;br/&gt;
4. At the same time, MDS02 evict MDS04&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Lustre: lustre-MDT0001: already connected client lustre-MDT0003-mdtlov_UUID (at 192.168.2.128@o2ib) with handle 0x2e45787e4dd12a1. Rejecting client with the same UUID trying to reconnect with handle 0xf0284dfd774c7787
Lustre: lustre-MDT0001: haven&apos;t heard from client lustre-MDT0003-mdtlov_UUID (at 192.168.2.128@o2ib) in 228 seconds. I think it&apos;s dead, and I am evicting it. exp ffff880ffa181c00, cur 1438129440 expire 1438129290 last 1438129212
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt; 
&lt;p&gt;5. MDS04 failed on waiting bulk transfer&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;LustreError: 2792:0:(ldlm_lib.c:3041:target_bulk_io()) @@@ network error on bulk WRITE  req@ffff880827864850 x1507974808149044/t0(25771723485) o1000-&amp;gt;lustre-MDT0001-mdtlov_UUID@192.168.2.126@o2ib:219/0 lens 248/16608 e 1 to 0 dl 1438129504 ref 1 fl Complete:/4/0 rc 0/0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;6. MDS02 failed on this unlink replay&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;LustreError: 2758:0:(client.c:2869:ptlrpc_replay_interpret()) @@@ status -110, old was 0  req@ffff880feb148cc0 x1507974808149044/t25771723485(25771723485) o1000-&amp;gt;lustre-MDT0003-osp-MDT0001@192.168.2.128@o2ib:24/4 lens 248/16576 e 1 to 0 dl 1438129486 ref 2 fl Interpret:R/4/0 rc -110/-110
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;7 Because MDS02 already got reply of this replay (note: this is bulk replay), so it will not replay this request again. (see ptlrpc_replay_interpret()).&lt;/p&gt;</comment>
                            <comment id="122616" author="di.wang" created="Wed, 29 Jul 2015 21:49:49 +0000"  >&lt;p&gt;So the easiest fix might be in step 7.  If it is bulk replay, and even though the server get the request, but the bulk transfer timeout, then we will still resend the replay request.&lt;/p&gt;</comment>
                            <comment id="122618" author="gerrit" created="Wed, 29 Jul 2015 22:04:04 +0000"  >&lt;p&gt;wangdi (di.wang@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/15793&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/15793&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-6924&quot; title=&quot;remote regular file are missing after recovery.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-6924&quot;&gt;&lt;del&gt;LU-6924&lt;/del&gt;&lt;/a&gt; ptlrpc: replay bulk request&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 722bbb86fa5479bdc16b62b43863eff39a61df56&lt;/p&gt;</comment>
                            <comment id="124006" author="gerrit" created="Thu, 13 Aug 2015 00:05:34 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/15793/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/15793/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-6924&quot; title=&quot;remote regular file are missing after recovery.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-6924&quot;&gt;&lt;del&gt;LU-6924&lt;/del&gt;&lt;/a&gt; ptlrpc: replay bulk request&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 0addfa9fa1d48cc9fa5eb05026848e55382f81a8&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="31033">LU-6831</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="31147">LU-6883</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzxj8v:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>