<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:38:09 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-3929] 2.1.6-&gt;2.4.1 rolling upgrade: lustre-MDT0000: recovery is timed out, evict stale exports</title>
                <link>https://jira.whamcloud.com/browse/LU-3929</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;While performing rolling upgrade from Lustre 2.1.6 to 2.4.1 RC2 with the path of OSS-&amp;gt;MDS-&amp;gt;Client one by one, the test failed after upgrading MDS:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Starting the MDS service on fat-amd-3...
----------------
fat-amd-3
----------------
debug=-1
subsystem_debug=all -lnet -lnd -pinger
debug_mb=100
pdsh -l root -t 100 -S -w fat-amd-3 &quot;mkdir -p /mnt/mds1 &amp;amp;&amp;amp; mount -t lustre -o user_xattr /dev/sdc1 /mnt/mds1&quot;
Waiting 895 secs for fat-amd-3 recovery done. status: RECOVERING
&amp;lt;~snip~&amp;gt;
Waiting 5 secs for fat-amd-3 recovery done. status: RECOVERING
Waiting 0 secs for fat-amd-3 recovery done. status: RECOVERING
fat-amd-3 recovery not done in 900 sec. status: RECOVERING
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;On MDS fat-amd-3, &quot;lctl get_param -n &lt;b&gt;.&lt;/b&gt;.recovery_status&quot; showed that:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;----------------
fat-amd-3
----------------
status: RECOVERING
recovery_start: 1378874775
time_remaining: 0
connected_clients: 2/4
req_replay_clients: 0
lock_repay_clients: 0
completed_clients: 2
evicted_clients: 0
replayed_requests: 0
queued_requests: 0
next_transno: 4294967297
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Console log on MDS showed that:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 4 clients reconnect
Lustre: lustre-MDT0000: recovery is timed out, evict stale exports
Lustre: lustre-MDT0000: disconnecting 2 stale clients
Lustre: lustre-MDT0000: recovery is timed out, evict stale exports
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Maloo reports:&lt;br/&gt;
&lt;a href=&quot;https://maloo.whamcloud.com/test_sets/d91a2b68-1aa1-11e3-88ff-52540035b04c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://maloo.whamcloud.com/test_sets/d91a2b68-1aa1-11e3-88ff-52540035b04c&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://maloo.whamcloud.com/test_sets/dae9450c-1a86-11e3-8ceb-52540035b04c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://maloo.whamcloud.com/test_sets/dae9450c-1a86-11e3-8ceb-52540035b04c&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The same failure also occurred while rolling upgrade from Lustre 2.1.6 to 2.4.0:&lt;br/&gt;
&lt;a href=&quot;https://maloo.whamcloud.com/test_sets/c70af506-1ab5-11e3-8898-52540035b04c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://maloo.whamcloud.com/test_sets/c70af506-1ab5-11e3-8898-52540035b04c&lt;/a&gt;&lt;/p&gt;</description>
                <environment></environment>
        <key id="20901">LU-3929</key>
            <summary>2.1.6-&gt;2.4.1 rolling upgrade: lustre-MDT0000: recovery is timed out, evict stale exports</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="hongchao.zhang">Hongchao Zhang</assignee>
                                    <reporter username="yujian">Jian Yu</reporter>
                        <labels>
                            <label>mn4</label>
                    </labels>
                <created>Wed, 11 Sep 2013 08:44:30 +0000</created>
                <updated>Mon, 7 Jul 2014 17:06:47 +0000</updated>
                            <resolved>Mon, 6 Jan 2014 14:17:31 +0000</resolved>
                                    <version>Lustre 2.4.0</version>
                    <version>Lustre 2.4.1</version>
                    <version>Lustre 2.5.0</version>
                                    <fixVersion>Lustre 2.5.1</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>7</watches>
                                                                            <comments>
                            <comment id="66542" author="pjones" created="Thu, 12 Sep 2013 19:43:11 +0000"  >&lt;p&gt;Hongchao&lt;/p&gt;

&lt;p&gt;Could you please make an assessment of this issue?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="66578" author="hongchao.zhang" created="Fri, 13 Sep 2013 11:24:18 +0000"  >&lt;p&gt;this issue is related to the LWP(Light Weight Proxy) connection.&lt;br/&gt;
in b2_1_*, the LWP connection is treated as normal clients and one client data &quot;tg_export_data&quot; is allocated for it, then there will be one more export&lt;br/&gt;
to be recover during recovery at MDT, but in b2_4_*, MDT won&apos;t allocate client data for LWP connection, which doesn&apos;t need recover.&lt;/p&gt;</comment>
                            <comment id="69399" author="sebastien.buisson" created="Mon, 21 Oct 2013 14:41:28 +0000"  >&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;We are suffering from this error, which is very annoying in case a customer wants to upgrade its OSSes first and then its MDSes and clients.&lt;/p&gt;

&lt;p&gt;This ticket was opened a month ago, but did not make any progress since then. This is surprising, as I would tend to consider it a major issue on the upgrade path from 2.1 to 2.4 (and 2.5 too). Am I missing something?&lt;/p&gt;

&lt;p&gt;Sebastien.&lt;/p&gt;</comment>
                            <comment id="69756" author="hongchao.zhang" created="Thu, 24 Oct 2013 14:56:19 +0000"  >&lt;p&gt;status update:&lt;br/&gt;
the patch is under testing and will be pushed to Gerrit soon. Thanks&lt;/p&gt;</comment>
                            <comment id="70093" author="hongchao.zhang" created="Tue, 29 Oct 2013 03:58:51 +0000"  >&lt;p&gt;the patch is against b2_1, and is tracked at &lt;a href=&quot;http://review.whamcloud.com/#/c/8086/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#/c/8086/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="71867" author="hongchao.zhang" created="Tue, 19 Nov 2013 11:05:30 +0000"  >&lt;p&gt;the patch against master is tracked at &lt;a href=&quot;http://review.whamcloud.com/#/c/8328/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#/c/8328/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="71887" author="sebastien.buisson" created="Tue, 19 Nov 2013 14:48:54 +0000"  >&lt;p&gt;Hi, I have just tested patch &lt;a href=&quot;http://review.whamcloud.com/#/c/8086/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#/c/8086/&lt;/a&gt; for b2_1, and it works fine. I mean rolling upgrade from Lustre 2.1.6 plus this patch to 2.4.1 went off smoothly.&lt;/p&gt;

&lt;p&gt;So now I am wondering what is the purpose of this new patch &lt;a href=&quot;http://review.whamcloud.com/#/c/8328/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#/c/8328/&lt;/a&gt; for master.&lt;/p&gt;

&lt;p&gt;Sebastien.&lt;/p&gt;</comment>
                            <comment id="72117" author="hongchao.zhang" created="Fri, 22 Nov 2013 10:02:05 +0000"  >&lt;p&gt;it could allow more previous Lustre version to upgrade to new version with the patch against master.&lt;/p&gt;</comment>
                            <comment id="72118" author="sebastien.buisson" created="Fri, 22 Nov 2013 10:16:48 +0000"  >&lt;p&gt;Do you mean the master patch &lt;b&gt;alone&lt;/b&gt; would be enough to be able to successfully upgrade from 2.1 with the path OSS-&amp;gt;MDS-&amp;gt;Client?&lt;/p&gt;</comment>
                            <comment id="72186" author="green" created="Sat, 23 Nov 2013 14:23:38 +0000"  >&lt;p&gt;Yes, the master patch alone should be enough to allow upgrades from unpatched 2.1 OSTS (i.e. those that do not have 8086 patch present). Can you give such a combination a try, please?&lt;/p&gt;

&lt;p&gt;We believe it&apos;s a better way since it saves you one extra step of upgrading all your OSTS to 2.1.6+patch before you can update your MDS to 2.4+ and then update your OSTs again to 2.4+ too (which is kind of overkill).&lt;/p&gt;</comment>
                            <comment id="72302" author="sebastien.buisson" created="Tue, 26 Nov 2013 14:37:16 +0000"  >&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;Here is the test I carried out:&lt;/p&gt;
&lt;ul class=&quot;alternate&quot; type=&quot;square&quot;&gt;
	&lt;li&gt;full file system installed with stock 2.1.6&lt;/li&gt;
	&lt;li&gt;upgrade to 2.4.1 + patch &lt;a href=&quot;http://review.whamcloud.com/8328&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/8328&lt;/a&gt; with the path OSS-&amp;gt;MDS-&amp;gt;Client&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;It went off smoothly. So I confirm that the master patch is enough. And, as Oleg explained, having the patch in the target version simplifies upgrade.&lt;/p&gt;

&lt;p&gt;Cheers,&lt;br/&gt;
Sebastien.&lt;/p&gt;</comment>
                            <comment id="74396" author="pjones" created="Mon, 6 Jan 2014 14:17:31 +0000"  >&lt;p&gt;Landed for 2.5.1&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="25444">LU-5298</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzw1u7:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>10379</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>