<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:33:23 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-10251] MDS hangs in recovery cannot abort, recovery timer is bogus</title>
                <link>https://jira.whamcloud.com/browse/LU-10251</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;MDS is rebooted (single MDS, no DNE)&lt;br/&gt;
MDS goes into recovery, with bogus values for recovery timer. &lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;soak-8 login: [ 1393.056450] Lustre: soaked-MDT0000: Denying connection &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; &lt;span class=&quot;code-keyword&quot;&gt;new&lt;/span&gt; client 7af6eae0-3527-5481-d01d-161d271e4510(at 192.168.1.142@o2ib), waiting &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; 29 known clients (6 recovered, 21 in progress, and 2 evicted) to recover in 71565:2
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;MDS never exits recovery, clients get -EBUSY. &lt;br/&gt;
Attempting to abort_recovery causes timeouts, system still wedged. &lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;1681.193209] INFO: task lctl:2555 blocked &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; more than 120 seconds.^M
[ 1681.271617] &lt;span class=&quot;code-quote&quot;&gt;&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot;&lt;/span&gt; disables &lt;span class=&quot;code-keyword&quot;&gt;this&lt;/span&gt; message.^M
[ 1681.368730] lctl            D ffff8803f1ecd400     0  2555   2526 0x00000084^M
[ 1681.456456]  ffff880413a5bc10 0000000000000082 ffff8803f1826eb0 ffff880413a5bfd8^M
[ 1681.548186]  ffff880413a5bfd8 ffff880413a5bfd8 ffff8803f1826eb0 ffff8808195014d0^M
[ 1681.639847]  7fffffffffffffff ffff8808195014c8 ffff8803f1826eb0 ffff8803f1ecd400^M
[ 1681.731520] Call Trace:^M
[ 1681.763370]  [&amp;lt;ffffffff816a9589&amp;gt;] schedule+0x29/0x70^M
[ 1681.826052]  [&amp;lt;ffffffff816a7099&amp;gt;] schedule_timeout+0x239/0x2c0^M
[ 1681.899089]  [&amp;lt;ffffffff816a993d&amp;gt;] wait_for_completion+0xfd/0x140^M
[ 1681.974192]  [&amp;lt;ffffffff810c4820&amp;gt;] ? wake_up_state+0x20/0x20^M
[ 1682.044159]  [&amp;lt;ffffffffc10f5a5d&amp;gt;] target_stop_recovery_thread.part.16+0x3d/0xd0 [ptlrpc]^M
[ 1682.144235]  [&amp;lt;ffffffffc10f5b08&amp;gt;] target_stop_recovery_thread+0x18/0x20 [ptlrpc]^M
[ 1682.235915]  [&amp;lt;ffffffffc15935d0&amp;gt;] mdt_iocontrol+0x550/0xaf0 [mdt]^M
[ 1682.312024]  [&amp;lt;ffffffffc0ef3bd9&amp;gt;] ? lprocfs_counter_add+0xf9/0x160 [obdclass]^M
[ 1682.400553]  [&amp;lt;ffffffffc0edebb3&amp;gt;] class_handle_ioctl+0x1913/0x1da0 [obdclass]^M
[ 1682.488997]  [&amp;lt;ffffffff812b1a98&amp;gt;] ? security_capable+0x18/0x20^M
[ 1682.561806]  [&amp;lt;ffffffffc0ec4602&amp;gt;] obd_class_ioctl+0xd2/0x170 [obdclass]^M
[ 1682.643909]  [&amp;lt;ffffffff812151bd&amp;gt;] do_vfs_ioctl+0x33d/0x540^M
[ 1682.712431]  [&amp;lt;ffffffff816b0091&amp;gt;] ? __do_page_fault+0x171/0x450^M
[ 1682.786103]  [&amp;lt;ffffffff81215461&amp;gt;] SyS_ioctl+0xa1/0xc0^M
[ 1682.849308]  [&amp;lt;ffffffff816b5089&amp;gt;] system_call_fastpath+0x16/0x1b^M
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Lustre-log, stack traces attached, we are currently forcing a kernel dump&lt;/p&gt;</description>
                <environment>soak performance cluster</environment>
        <key id="49352">LU-10251</key>
            <summary>MDS hangs in recovery cannot abort, recovery timer is bogus</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="laisiyao">Lai Siyao</assignee>
                                    <reporter username="cliffw">Cliff White</reporter>
                        <labels>
                            <label>soak</label>
                    </labels>
                <created>Thu, 16 Nov 2017 19:49:40 +0000</created>
                <updated>Fri, 17 Nov 2017 23:26:07 +0000</updated>
                                            <version>Lustre 2.11.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="214045" author="jgmitter" created="Fri, 17 Nov 2017 18:50:14 +0000"  >&lt;p&gt;Hi Lai,&lt;/p&gt;

&lt;p&gt;Can you please investigate this issue?&lt;/p&gt;

&lt;p&gt;Thanks.&lt;br/&gt;
Joe&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="28743" name="soak-8.crash.txt" size="764091" author="cliffw" created="Thu, 16 Nov 2017 19:49:33 +0000"/>
                            <attachment id="28742" name="soak-8.recovery.wedge.txt" size="6842304" author="cliffw" created="Thu, 16 Nov 2017 19:49:39 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzznyv:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>