<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:39:04 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-10888] &apos;lctl abort_recovery&apos; allow aborting recovery between MDTs</title>
                <link>https://jira.whamcloud.com/browse/LU-10888</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;&apos;lctl abort_recovery&apos; doesn&apos;t abort recovery between MDTs, because unlike abort recovery on single MDT system which only fail unfinished operations, this may break system consistency, so as a tradeoff, Lustre chose consistency over availability. But there are two major causes if recovery between MDTs doesn&apos;t finish, the first is network issue, for this type, we can wait indefinitely for network to recover, while the second is software bug, which is difficult for user to fix  manually on backend filesystem.&lt;/p&gt;

&lt;p&gt;Now lfsck is ready, which can fix inconsistency in the system. So we should provide an option to allow user to abort recovery between MDTs, and then fix inconsistencies.&lt;/p&gt;
</description>
                <environment></environment>
        <key id="51708">LU-10888</key>
            <summary>&apos;lctl abort_recovery&apos; allow aborting recovery between MDTs</summary>
                <type id="2" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11311&amp;avatarType=issuetype">New Feature</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="6">Not a Bug</resolution>
                                        <assignee username="hongchao.zhang">Hongchao Zhang</assignee>
                                    <reporter username="laisiyao">Lai Siyao</reporter>
                        <labels>
                    </labels>
                <created>Mon, 9 Apr 2018 07:28:24 +0000</created>
                <updated>Mon, 25 Nov 2019 20:07:50 +0000</updated>
                            <resolved>Mon, 15 Jul 2019 12:26:54 +0000</resolved>
                                    <version>Lustre 2.10.0</version>
                    <version>Lustre 2.11.0</version>
                                                        <due>Tue, 9 Jul 2019 00:00:00 +0000</due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="241315" author="pfarrell" created="Mon, 4 Feb 2019 18:22:01 +0000"  >&lt;p&gt;I think it&apos;s also possible for this lack of MDS-MDS abort_recovery to cause hangs in certain situations.&#160; During some testing at Cray, we had an MDS LBUG that happened during replay on MDS restart, so we tried abort_recovery.&#160; That hung in some complex scenario related to cross MDT communication.&#160; We weren&apos;t aware at the time that cross-MDT ops weren&apos;t handled by abort recovery, but it seems likely to be related.&lt;/p&gt;</comment>
                            <comment id="251392" author="hongchao.zhang" created="Mon, 15 Jul 2019 12:26:54 +0000"  >&lt;p&gt;abort_recovery has been enabled between MDTS, the new requirement of aborting&lt;br/&gt;
the recovery between MDTs but not aborting between client/MDT will be implemented&lt;br/&gt;
in a new ticket&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="52628">LU-11111</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="53393">LU-11419</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="56404">LU-12546</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                    <customfield id="customfield_10030" key="com.atlassian.jira.plugin.system.customfieldtypes:labels">
                        <customfieldname>Epic/Theme</customfieldname>
                        <customfieldvalues>
                                        <label>DNE</label>
            <label>DNE2</label>
            <label>dne</label>
    
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzvgn:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>