<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:50:54 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-12245] replay-vbr test 5b fails with &apos;Restart of mds1 failed!&apos;</title>
                <link>https://jira.whamcloud.com/browse/LU-12245</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;replay-vbr test_5b fails with &apos;Restart of mds1 failed!&apos;, so far, only for SLES12 SP4.&lt;/p&gt;

&lt;p&gt;Looking the suite_log for a recent failure with logs at &lt;a href=&quot;https://testing.whamcloud.com/test_sets/be1853f6-6692-11e9-8bb1-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/be1853f6-6692-11e9-8bb1-52540065bddc&lt;/a&gt; , we see that mounting the failed over MDS does not work&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Failover mds1 to trevis-35vm7
00:10:41 (1556089841) waiting for trevis-35vm7 network 900 secs ...
00:10:41 (1556089841) network interface is UP
CMD: trevis-35vm7 hostname
mount facets: mds1
CMD: trevis-35vm7 dmsetup status /dev/mapper/mds1_flakey &amp;gt;/dev/null 2&amp;gt;&amp;amp;1
CMD: trevis-35vm7 test -b /dev/lvm-Role_MDS/P1
CMD: trevis-35vm7 loop_dev=\$(losetup -j /dev/lvm-Role_MDS/P1 | cut -d : -f 1);
			 if [[ -z \$loop_dev ]]; then
				loop_dev=\$(losetup -f);
				losetup \$loop_dev /dev/lvm-Role_MDS/P1 || loop_dev=;
			 fi;
			 echo -n \$loop_dev
trevis-35vm7: losetup: /dev/lvm-Role_MDS/P1: failed to set up loop device: No such file or directory
CMD: trevis-35vm7 test -b /dev/lvm-Role_MDS/P1
CMD: trevis-35vm7 e2label /dev/lvm-Role_MDS/P1
trevis-35vm7: e2label: No such file or directory while trying to open /dev/lvm-Role_MDS/P1
trevis-35vm7: Couldn&apos;t find valid filesystem superblock.
Starting mds1:   -o loop /dev/lvm-Role_MDS/P1 /mnt/lustre-mds1
CMD: trevis-35vm7 mkdir -p /mnt/lustre-mds1; mount -t lustre   -o loop /dev/lvm-Role_MDS/P1 /mnt/lustre-mds1
trevis-35vm7: mount: /dev/lvm-Role_MDS/P1: failed to setup loop device: No such file or directory
Start of /dev/lvm-Role_MDS/P1 on mds1 failed 32
 replay-vbr test_5b: @@@@@@ FAIL: Restart of mds1 failed! 
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Looking at the console log for MDS1 (vm8), we see the MDS failover&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt; [  266.800298] Lustre: DEBUG MARKER: /usr/sbin/lctl mark == replay-vbr test 5b: link checks version of target parent ========================================== 00:10:22 \(1556089822\)
[  266.982167] Lustre: DEBUG MARKER: == replay-vbr test 5b: link checks version of target parent ========================================== 00:10:22 (1556089822)
[  267.088184] Lustre: lustre-MDT0000: Connection restored to 144cd783-70f4-6475-93c8-b9b0a8f6fd6b (at 10.9.5.120@tcp)
[  267.089958] Lustre: Skipped 6 previous similar messages
[  267.890140] Lustre: DEBUG MARKER: /usr/sbin/lctl set_param mdd.lustre-MDT0000.sync_permission=0
[  268.211897] Lustre: DEBUG MARKER: /usr/sbin/lctl set_param mdt.lustre-MDT0000.commit_on_sharing=0
[  268.641380] Lustre: DEBUG MARKER: sync; sync; sync
[  269.842265] Lustre: DEBUG MARKER: /usr/sbin/lctl --device lustre-MDT0000 notransno
[  270.163063] Lustre: DEBUG MARKER: modprobe dm-flakey;
[  270.163063] 			 dmsetup targets | grep -q flakey
[  270.487834] Lustre: DEBUG MARKER: dmsetup table /dev/mapper/mds1_flakey
[  270.820089] Lustre: DEBUG MARKER: dmsetup suspend --nolockfs --noflush /dev/mapper/mds1_flakey
[  271.139890] Lustre: DEBUG MARKER: dmsetup load /dev/mapper/mds1_flakey --table &quot;0 20971520 flakey 252:0 0 0 1800 1 drop_writes&quot;
[  271.459528] Lustre: DEBUG MARKER: dmsetup resume /dev/mapper/mds1_flakey
[  271.812452] Lustre: DEBUG MARKER: /usr/sbin/lctl mark mds1 REPLAY BARRIER on lustre-MDT0000
[  271.975373] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000
[  272.633700] Lustre: DEBUG MARKER: /usr/sbin/lctl dl
[  272.964835] Lustre: DEBUG MARKER: modprobe dm-flakey;
[  272.964835] 			 dmsetup targets | grep -q flakey
[  273.353504] Lustre: DEBUG MARKER: /usr/sbin/lctl dl

&amp;lt;ConMan&amp;gt; Console [trevis-35vm8] disconnected from &amp;lt;trevis-35:6007&amp;gt; at 04-24 07:10.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Looking at the failover MDS (vm7), we don&#8217;t see an indication that the MDS failed over and we don&#8217;t see replay-vbr test 5c start on either MDS.&lt;/p&gt;

&lt;p&gt;There are no other reply-vbr test 5b failures like this in the past four months.&lt;/p&gt;</description>
                <environment>SLES12 SP4</environment>
        <key id="55530">LU-12245</key>
            <summary>replay-vbr test 5b fails with &apos;Restart of mds1 failed!&apos;</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                            <label>failover</label>
                    </labels>
                <created>Mon, 29 Apr 2019 20:14:27 +0000</created>
                <updated>Fri, 15 Nov 2019 08:41:13 +0000</updated>
                                            <version>Lustre 2.12.1</version>
                    <version>Lustre 2.12.3</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="256411" author="jamesanunez" created="Tue, 15 Oct 2019 14:26:52 +0000"  >&lt;p&gt;We see a similar test hang for replay-single test 101 at &lt;a href=&quot;https://testing.whamcloud.com/test_sets/6c0fb4d6-ea6e-11e9-be86-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/6c0fb4d6-ea6e-11e9-be86-52540065bddc&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="258354" author="sebastien" created="Fri, 15 Nov 2019 08:41:13 +0000"  >&lt;p&gt;Possibly a new occurence via recovery-small test_136:&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/f00a801a-0722-11ea-b934-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/f00a801a-0722-11ea-b934-52540065bddc&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00flz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>