<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:11:50 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-7778] mount of MDT(==MGS) failed after MDS restart</title>
                <link>https://jira.whamcloud.com/browse/LU-7778</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Error happened during soak testing of build &apos;20160215&apos; (see &lt;a href=&quot;https://wiki.hpdd.intel.com/display/Releases/Soak+Testing+on+Lola#SoakTestingonLola-20150215&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://wiki.hpdd.intel.com/display/Releases/Soak+Testing+on+Lola#SoakTestingonLola-20150215&lt;/a&gt;). DNE is enabled.&lt;br/&gt;
MDT had been formatted using &lt;em&gt;ldiskfs&lt;/em&gt;, OSTs using &lt;em&gt;zfs&lt;/em&gt;. MDS nodes are configured in active-active HA failover configuration.&lt;/p&gt;

&lt;p&gt;Please note that build 20150215 is a vanilla build of the master brunch. &lt;br/&gt;
This issue might be addressed by the changes included in build &apos;20160210&apos; as we didn&apos;t observe this issue in a two day test session.&lt;/p&gt;

&lt;p&gt;Sequence of events:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;2016-02-15 16:25:21,179:fsmgmt.fsmgmt:INFO     triggering fault mds_restart&lt;/li&gt;
	&lt;li&gt;2016-02-15 16:31:41,282:fsmgmt.fsmgmt:INFO     lola-8 is up&lt;/li&gt;
	&lt;li&gt;2016-02-15 16:36:50,594:fsmgmt.fsmgmt:INFO     ... soaked-MDT0001 mounted successfully on lola-8&lt;/li&gt;
	&lt;li&gt;2016-02-15 16:38:20,      mount of MDT0000 (== MGS) fails&lt;br/&gt;
Error message reads as:
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Feb 15 16:38:20 lola-8 kernel: LustreError: 15c-8: MGC192.168.1.108@o2ib10: The configuration from log &apos;soaked-MDT0000&apos; failed (-5). This may be the result of communication errors between this node and the MGS, a bad configuration, or other errors. See the syslog for more information.
Feb 15 16:38:20 lola-8 kernel: LustreError: 4538:0:(obd_mount_server.c:1309:server_start_targets()) failed to start server soaked-MDT0000: -5
Feb 15 16:38:20 lola-8 kernel: LustreError: 4538:0:(obd_mount_server.c:1798:server_fill_super()) Unable to start targets: -5
Feb 15 16:38:20 lola-8 kernel: LustreError: 4538:0:(obd_mount_server.c:1512:server_put_super()) no obd soaked-MDT0000
Feb 15 16:38:20 lola-8 kernel: Lustre: server umount soaked-MDT0000 complete
Feb 15 16:38:20 lola-8 kernel: LustreError: 4538:0:(obd_mount.c:1426:lustre_fill_super()) Unable to mount  (-5)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;&lt;/li&gt;
	&lt;li&gt;I checked the HW and cluster configuration: no problem with IB HCA, LNet is working, routers are up; Disk device file of MDT-0000 can be read and accessed.&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;Attached messages, console and manual forced debug log of node &lt;tt&gt;lola-8&lt;/tt&gt;.&lt;/p&gt;</description>
                <environment>lola&lt;br/&gt;
build: 2.8.50-6-gf9ca359 ; commit f9ca359284357d145819beb08b316e932f7a3060</environment>
        <key id="34685">LU-7778</key>
            <summary>mount of MDT(==MGS) failed after MDS restart</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="di.wang">Di Wang</assignee>
                                    <reporter username="heckes">Frank Heckes</reporter>
                        <labels>
                            <label>soak</label>
                    </labels>
                <created>Tue, 16 Feb 2016 09:08:44 +0000</created>
                <updated>Wed, 24 Feb 2016 06:16:02 +0000</updated>
                            <resolved>Wed, 24 Feb 2016 06:16:02 +0000</resolved>
                                    <version>Lustre 2.8.0</version>
                                    <fixVersion>Lustre 2.8.0</fixVersion>
                    <fixVersion>Lustre 2.9.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="142326" author="di.wang" created="Tue, 16 Feb 2016 21:05:52 +0000"  >&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Feb 15 16:37:47 lola-8 kernel: LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. quota=on. Opts:
Feb 15 16:37:48 lola-8 kernel: LustreError: 11-0: soaked-MDT0006-osp-MDT0001: operation mds_connect to node 192.168.1.111@o2ib10 failed: rc = -16
Feb 15 16:37:48 lola-8 kernel: LustreError: Skipped 3 previous similar messages
Feb 15 16:37:48 lola-8 kernel: Lustre: 4320:0:(client.c:2063:ptlrpc_expire_one_request()) @@@ Request sent has failed due to network error: [sent 1455583068/real 1455583068]  req@ffff8804037909c0 x1526289292853684/t0(0) o38-&amp;gt;soaked-MDT0000-osp-MDT0001@192.168.1.109@o2ib10:24/4 lens 520/544 e 0 to 1 dl 1455583079 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
Feb 15 16:37:48 lola-8 kernel: Lustre: 4320:0:(client.c:2063:ptlrpc_expire_one_request()) Skipped 2 previous similar messages
Feb 15 16:37:54 lola-8 kernel: LustreError: 137-5: soaked-MDT0003_UUID: not available for connect from 192.168.1.104@o2ib10 (no target). If you are running an HA pair check that the target is mounted on the other server.
Feb 15 16:37:54 lola-8 kernel: LustreError: Skipped 58 previous similar messages
Feb 15 16:38:03 lola-8 kernel: Lustre: soaked-MDT0001: Client d26c53bc-3d10-5c53-0c35-f189140fc2e8 (at 192.168.1.131@o2ib100) reconnecting, waiting for 14 clients in recovery for 3:53
Feb 15 16:38:03 lola-8 kernel: Lustre: Skipped 180 previous similar messages
Feb 15 16:38:20 lola-8 kernel: LustreError: 15c-8: MGC192.168.1.108@o2ib10: The configuration from log &apos;soaked-MDT0000&apos; failed (-5). This may be the result of communication errors between this node and the MGS, a bad configuration, or other errors. See the syslog for more information.
Feb 15 16:38:20 lola-8 kernel: LustreError: 4538:0:(obd_mount_server.c:1309:server_start_targets()) failed to start server soaked-MDT0000: -5
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;It looks like MDT0 has trouble to communicate with MGS. But unfortunately, there are no logs to indicate what happens. I guess I need monitor the &quot;run&quot;.&lt;/p&gt;</comment>
                            <comment id="142890" author="gerrit" created="Thu, 18 Feb 2016 22:09:02 +0000"  >&lt;p&gt;wangdi (di.wang@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/18509&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/18509&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7778&quot; title=&quot;mount of MDT(==MGS) failed after MDS restart&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7778&quot;&gt;&lt;del&gt;LU-7778&lt;/del&gt;&lt;/a&gt; osd: check if the object is destroyed&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 3096d9dbeae6bafccf10104b8221b91fac05a08f&lt;/p&gt;</comment>
                            <comment id="143484" author="gerrit" created="Wed, 24 Feb 2016 06:07:59 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/18509/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/18509/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7778&quot; title=&quot;mount of MDT(==MGS) failed after MDS restart&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7778&quot;&gt;&lt;del&gt;LU-7778&lt;/del&gt;&lt;/a&gt; osd: check if the object is destroyed&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: a19c1ea92fb8d9909ec9fb98f22a8a9e4835c572&lt;/p&gt;</comment>
                            <comment id="143486" author="pjones" created="Wed, 24 Feb 2016 06:16:02 +0000"  >&lt;p&gt;Landed for 2.8 and 2.9&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="20384" name="console-lola-8.log.bz2" size="74600" author="heckes" created="Tue, 16 Feb 2016 09:25:05 +0000"/>
                            <attachment id="20385" name="lustre-log-mount-non-operational-20160216-0044-lola-8.bz2" size="5106056" author="heckes" created="Tue, 16 Feb 2016 09:25:06 +0000"/>
                            <attachment id="20386" name="messages-lola-8.log.bz2" size="88135" author="heckes" created="Tue, 16 Feb 2016 09:25:06 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzy1bz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>