<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:43:00 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LUDOC-446] Lustre server rolling upgrades</title>
                <link>https://jira.whamcloud.com/browse/LUDOC-446</link>
                <project id="10070" key="LUDOC">Lustre Documentation</project>
                    <description>&lt;p&gt;The lustre operations manual currently says this:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Lustre software release 2.x.y release (minor) upgrade:
&#8226;   All servers must be upgraded at the same time, while some or all clients may be upgraded.
&#8226;   Rolling upgrades are supported for minor releases allowing individual servers and clients to be upgraded
without stopping the Lustre file system.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The first sentence sounds like another way of saying &quot;all the servers must be the same&quot;.  The second sentence sounds like another way of saying &quot;the servers can be different versions, but only temporarily&quot;.&lt;/p&gt;

&lt;p&gt;As long as:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;&quot;2.x&quot; part of the version is staying the same&lt;/li&gt;
	&lt;li&gt;&quot;y&quot; part is changing to &quot;y+1&quot; or &quot;y+2&quot;&lt;/li&gt;
	&lt;li&gt;assuming an actual tagged releases&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;Can we update update the targets within a single file system one at a time?  If so, I&apos;ll propose a patch with what I think is better wording.&lt;/p&gt;</description>
                <environment></environment>
        <key id="56006">LUDOC-446</key>
            <summary>Lustre server rolling upgrades</summary>
                <type id="9" iconUrl="https://jira.whamcloud.com/images/icons/issuetypes/undefined.png">Question/Request</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="ofaaland">Olaf Faaland</assignee>
                                    <reporter username="ofaaland">Olaf Faaland</reporter>
                        <labels>
                            <label>llnl</label>
                    </labels>
                <created>Thu, 20 Jun 2019 22:13:18 +0000</created>
                <updated>Mon, 26 Aug 2019 03:30:16 +0000</updated>
                                                                                <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="249933" author="adilger" created="Tue, 25 Jun 2019 03:36:36 +0000"  >&lt;p&gt;Olaf, indeed it is possible to upgrade servers one-at-a-time, but I can&apos;t imagine why you&apos;d want to?  If one server is down temporarily for an upgrade, that means files on that server are temporarily inaccessible and clients will block until the server is restarted and recovery completes.  The client recovery time will be pretty much the same whether a single server is upgraded or half/all of the servers are upgraded, but you would have many more recovery periods if you only upgrade one OSS at a time.&lt;/p&gt;

&lt;p&gt;The recommended process for rolling upgrades is to failover half of the MDT+OST targets to to their backup nodes, upgrade the now-idle MDS+OSS nodes to the new release, then failover &lt;b&gt;all&lt;/b&gt; of the targets to the just-upgraded nodes, upgrade the other half of the MDS+OSS nodes, and fail back half of the targets to their original nodes.  This involves 3 recovery periods, or correspondingly more if you have e.g. N-way HA failover clusters and are failing 1/N of the targets at a time.&lt;/p&gt;

&lt;p&gt;With FLR mirror/EC we will eventually be able to do this without any outage, but that would need all of the files (or at least all of the in-use files) to be redundant in some way.  Conceivably, we could mirror all of the files using OSTs on a particular OSS before doing the upgrade, then un-mirror them afterward, but that would be relatively slow.&lt;/p&gt;

&lt;p&gt;In any case, I&apos;m all for improving the readability of the manual, so feel free to suggest better wording.&lt;/p&gt;</comment>
                            <comment id="253555" author="ofaaland" created="Mon, 26 Aug 2019 00:43:15 +0000"  >&lt;p&gt;Hi Andreas,&lt;br/&gt;
Thanks in arrears.  That makes sense, and I haven&apos;t gotten around to my proposed wording, but I will. &lt;/p&gt;

&lt;p&gt;Peter,&lt;br/&gt;
 I can&apos;t remove the topllnl label in this project, it appears, and can&apos;t assign this to myself.  Can you make those changes, or make it so I can (either is fine with me)?&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                    <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00ijz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>