<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:50:16 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-12171] sanity test_133g: Timeout occurred after 161 mins</title>
                <link>https://jira.whamcloud.com/browse/LU-12171</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;This issue was created by maloo for S Buisson &amp;lt;sbuisson@ddn.com&amp;gt;&lt;/p&gt;

&lt;p&gt;This issue relates to the following test suite run: &lt;a href=&quot;https://testing.whamcloud.com/test_sets/43fba6f6-5a10-11e9-92fe-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/43fba6f6-5a10-11e9-92fe-52540065bddc&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;test_133g failed with the following error:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Timeout occurred after 161 mins, last suite running was sanity, restarting cluster to continue tests
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;trevis-14vm9 hosting MDT0000 becomes unreachable, and this situation cannot be recovered. From the MDS standpoint, client was evicted, but the client did not manage to reconnect.&lt;/p&gt;





&lt;p&gt;VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV&lt;br/&gt;
sanity test_133g - Timeout occurred after 161 mins, last suite running was sanity, restarting cluster to continue tests&lt;/p&gt;</description>
                <environment></environment>
        <key id="55373">LU-12171</key>
            <summary>sanity test_133g: Timeout occurred after 161 mins</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="maloo">Maloo</reporter>
                        <labels>
                    </labels>
                <created>Mon, 8 Apr 2019 15:45:47 +0000</created>
                <updated>Fri, 19 Apr 2019 18:11:15 +0000</updated>
                            <resolved>Thu, 18 Apr 2019 22:37:32 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="245750" author="guzheng" created="Mon, 15 Apr 2019 01:19:12 +0000"  >&lt;p&gt;another instance:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://testing.whamcloud.com/test_sessions/45917329-c33d-4929-9393-69b342c5f0eb&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sessions/45917329-c33d-4929-9393-69b342c5f0eb&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="245948" author="pfarrell" created="Wed, 17 Apr 2019 19:17:11 +0000"  >&lt;p&gt;&lt;a href=&quot;https://testing.whamcloud.com/test_sessions/d70e90ee-cbfd-426e-b0b7-e82cad8bcfd4&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sessions/d70e90ee-cbfd-426e-b0b7-e82cad8bcfd4&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="245956" author="pfarrell" created="Wed, 17 Apr 2019 19:57:47 +0000"  >&lt;p&gt;So I took a look here...&lt;/p&gt;

&lt;p&gt;mds1 is getting failed over in all of the failure cases.&lt;/p&gt;


&lt;p&gt;In the cleanup for this, we check if mds1 is active on the expected server, and if it&apos;s not, we fail it.&#160; That&apos;s fine...&lt;/p&gt;


&lt;p&gt;In the timeout cases, mds1 is not showing as active, and so it&apos;s getting failed over - but it&apos;s getting failed over to the same VM it started from:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;cln..Failing mds1 on onyx-43vm9
CMD: onyx-43vm9 grep -c /mnt/lustre-mds1&apos; &apos; /proc/mounts || true
Stopping /mnt/lustre-mds1 (opts:) on onyx-43vm9
CMD: onyx-43vm9 umount -d /mnt/lustre-mds1
CMD: onyx-43vm9 lsmod | grep lnet &amp;gt; /dev/null &amp;amp;&amp;amp;
lctl dl | grep &apos; ST &apos; || true
CMD: onyx-43vm9 ! zpool list -H lustre-mdt1 &amp;gt;/dev/null 2&amp;gt;&amp;amp;1 ||
			grep -q ^lustre-mdt1/ /proc/mounts ||
			zpool export  lustre-mdt1
reboot facets: mds1
Failover mds1 to onyx-43vm9 
13:57:29 (1554731849) waiting for trevis-14vm9 network 900 secs ...
13:57:29 (1554731849) network interface is UP
CMD: trevis-14vm9 hostname
mount facets: mds1&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;This is quite odd.&#160; I can&apos;t see why this would happen...&#160; But it seems reasonable that self-failover like this, which isn&apos;t intended, might confuse something.&#160; (Can&apos;t figure out what yet.)&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=jamesanunez&quot; class=&quot;user-hover&quot; rel=&quot;jamesanunez&quot;&gt;jamesanunez&lt;/a&gt;:&lt;/p&gt;

&lt;p&gt;This started on April 8th, and is limited to DNE testing.&#160; Given what we&apos;re seeing elsewhere, I&apos;d lay money this is &lt;b&gt;also&lt;/b&gt;&#160;fallout from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11636&quot; title=&quot;t-f test_mkdir() does not support interop with non DNEII servers&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11636&quot;&gt;&lt;del&gt;LU-11636&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="246047" author="pfarrell" created="Thu, 18 Apr 2019 22:37:32 +0000"  >&lt;p&gt;Dupe of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12175&quot; title=&quot;sanity test 208 fails with &amp;#39;lease broken over recovery&amp;#39;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12175&quot;&gt;LU-12175&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="55382">LU-12175</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="55455">LU-12210</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00en3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>