<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:19:52 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1809] Clients unable to mount (-108)</title>
                <link>https://jira.whamcloud.com/browse/LU-1809</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;NOAA hit a problem that looks a lot like &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-441&quot; title=&quot;ll_fill_super()) Unable to process log: -108&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-441&quot;&gt;&lt;del&gt;LU-441&lt;/del&gt;&lt;/a&gt;. The clients were unable to mount the filesystem for a while after rebooting. &lt;/p&gt;

&lt;p&gt;Here&apos;s the client&apos;s syslog:&lt;br/&gt;
Aug 30 15:00:09 s1 kernel: Lustre:&lt;br/&gt;
2524:0:(client.c:1487:ptlrpc_expire_one_request()) @@@ Request&lt;br/&gt;
x1410486709891636 sent from MGC10.179.16.120@o2ib to NID&lt;br/&gt;
10.179.16.121@o2ib 0s ago has failed due to network error (35s prior to&lt;br/&gt;
deadline).&lt;br/&gt;
Aug 30 15:00:09 s1 kernel:  req@ffff8805fc06e400 x1410486709891636/t0&lt;br/&gt;
o250-&amp;gt;MGS@MGC10.179.16.120@o2ib_1:26/25 lens 368/584 e 0 to 1 dl&lt;br/&gt;
1346338844 ref 1 fl Rpc:N/0/0 rc 0/0&lt;br/&gt;
Aug 30 15:00:09 s1 kernel: LustreError:&lt;br/&gt;
112398:0:(client.c:858:ptlrpc_import_delay_req()) @@@ IMP_INVALID&lt;br/&gt;
req@ffff88041c05e800 x1410486709891637/t0&lt;br/&gt;
o501-&amp;gt;MGS@MGC10.179.16.120@o2ib_1:26/25 lens 264/432 e 0 to 1 dl 0 ref 1&lt;br/&gt;
fl Rpc:/0/0 rc 0/0&lt;br/&gt;
Aug 30 15:00:09 s1 kernel: LustreError: 15c-8: MGC10.179.16.120@o2ib:&lt;br/&gt;
The configuration from log &apos;lfs2-client&apos; failed (-108). This may be the&lt;br/&gt;
result of communication errors between this node and the MGS, a bad&lt;br/&gt;
configuration, or other errors. See the syslog for more information.&lt;br/&gt;
Aug 30 15:00:09 s1 kernel: LustreError:&lt;br/&gt;
112398:0:(llite_lib.c:1095:ll_fill_super()) Unable to process log: -108&lt;br/&gt;
Aug 30 15:00:09 s1 kernel: Lustre: client lfs2-client(ffff88041c2ea400)&lt;br/&gt;
umount complete&lt;br/&gt;
Aug 30 15:00:09 s1 kernel: LustreError:&lt;br/&gt;
112398:0:(obd_mount.c:2065:lustre_fill_super()) Unable to mount  (-108)&lt;/p&gt;

&lt;p&gt;MDS logs to come. &lt;/p&gt;</description>
                <environment></environment>
        <key id="15639">LU-1809</key>
            <summary>Clients unable to mount (-108)</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="keith">Keith Mannthey</assignee>
                                    <reporter username="kitwestneat">Kit Westneat</reporter>
                        <labels>
                    </labels>
                <created>Fri, 31 Aug 2012 09:33:08 +0000</created>
                <updated>Fri, 19 Oct 2012 08:50:47 +0000</updated>
                            <resolved>Fri, 19 Oct 2012 08:50:47 +0000</resolved>
                                    <version>Lustre 1.8.8</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>7</watches>
                                                                            <comments>
                            <comment id="44054" author="isaac" created="Fri, 31 Aug 2012 15:11:58 +0000"  >&lt;p&gt;Likely this is a dup of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-630&quot; title=&quot;mount failure after MGS connection lost and file system is unmounted&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-630&quot;&gt;&lt;del&gt;LU-630&lt;/del&gt;&lt;/a&gt;. The message below said an outgoing message sent 0 second ago failed with error:&lt;br/&gt;
2524:0:(client.c:1487:ptlrpc_expire_one_request()) @@@ Request x1410486709891636 sent from MGC10.179.16.120@o2ib to NID 10.179.16.121@o2ib 0s ago has failed due to network error (35s prior to deadline).&lt;/p&gt;

&lt;p&gt;This looked like a local error, i.e. the message did not go out on wire. Please:&lt;br/&gt;
1. On all clients and servers: options ko2iblnd peer_timeout=0&lt;br/&gt;
2. On some clients where mount failed: echo +neterror &amp;gt; /proc/sys/lnet/printk&lt;br/&gt;
   This must be done after each client reboot.&lt;/p&gt;

&lt;p&gt;If this is a dup of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-630&quot; title=&quot;mount failure after MGS connection lost and file system is unmounted&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-630&quot;&gt;&lt;del&gt;LU-630&lt;/del&gt;&lt;/a&gt;, then step 1 should fix it. If it still persists, step 2 would allow more debug data to go to syslog; /proc/sys/lnet/peers would also provide useful data in this case.&lt;/p&gt;</comment>
                            <comment id="44236" author="kitwestneat" created="Wed, 5 Sep 2012 18:27:16 +0000"  >&lt;p&gt;It looks like the patch is fairly simple, can we get it landed on b1_8?&lt;/p&gt;

&lt;p&gt;In the meantime I will communicate the workaround to the customer. I think it is pretty rare though. &lt;/p&gt;

&lt;p&gt;Thanks,&lt;br/&gt;
Kit&lt;/p&gt;</comment>
                            <comment id="44306" author="kitwestneat" created="Thu, 6 Sep 2012 13:54:16 +0000"  >&lt;p&gt;Hi Isaac,&lt;/p&gt;

&lt;p&gt;What are the implications of peer_timeout=0? That is to say, what exactly does it do? &lt;/p&gt;

&lt;p&gt;Also, does it have to be on all the servers and clients? or can it be just the servers or just the clients?&lt;/p&gt;

&lt;p&gt;Thanks,&lt;br/&gt;
Kit&lt;/p&gt;</comment>
                            <comment id="44309" author="isaac" created="Thu, 6 Sep 2012 14:11:21 +0000"  >&lt;p&gt;peer_timeout=0 disables a feature that should only be turned on for routers - it was a bug to be able to enable it anywhere but the routers. In other words, peer_timeout=0 fixes it without any code changes.&lt;/p&gt;

&lt;p&gt;The feature does not work on clients and servers and will cause messages to be dropped, so &quot;peer_timeout=0&quot; must be set on all clients and servers.&lt;/p&gt;</comment>
                            <comment id="44311" author="kitwestneat" created="Thu, 6 Sep 2012 14:14:55 +0000"  >&lt;p&gt;Is it ok to do &quot;peer_timeout=0&quot; on the clients before the servers? Or does it need to be set at the same time everywhere?&lt;/p&gt;</comment>
                            <comment id="44312" author="isaac" created="Thu, 6 Sep 2012 14:18:54 +0000"  >&lt;p&gt;There&apos;s no requirement on order. You can do it in any order that&apos;s most convenient.&lt;/p&gt;</comment>
                            <comment id="45516" author="kitwestneat" created="Tue, 25 Sep 2012 10:52:04 +0000"  >&lt;p&gt;could we get this landed to b1_8? It appears to have fixed the issue.&lt;/p&gt;</comment>
                            <comment id="46680" author="isaac" created="Wed, 17 Oct 2012 14:36:39 +0000"  >&lt;p&gt;I likely missed some notifications when JIRA was upgraded a while back. I agree that it&apos;s a simple patch that fixes a class of problems hard to diagnose when they manifest themselves at upper layers. I&apos;d defer to Peter whether to land it to b1_8.&lt;/p&gt;</comment>
                            <comment id="46681" author="pjones" created="Wed, 17 Oct 2012 14:45:01 +0000"  >&lt;p&gt;Thanks Isaac. Keith can you please backport the patch from master to b1_8?&lt;/p&gt;</comment>
                            <comment id="46682" author="isaac" created="Wed, 17 Oct 2012 14:54:36 +0000"  >&lt;p&gt;Quite likely the patch would apply to b1_8 without any changes, just ignore white space changes with patch --ignore-whitespace. &lt;/p&gt;</comment>
                            <comment id="46688" author="keith" created="Wed, 17 Oct 2012 17:05:56 +0000"  >&lt;p&gt;I was able to cherry-pick the patch from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-630&quot; title=&quot;mount failure after MGS connection lost and file system is unmounted&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-630&quot;&gt;&lt;del&gt;LU-630&lt;/del&gt;&lt;/a&gt; into b1_8.  &lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/4287&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/4287&lt;/a&gt; is the b1_8 patch. &lt;/p&gt;</comment>
                            <comment id="46772" author="pjones" created="Fri, 19 Oct 2012 08:50:47 +0000"  >&lt;p&gt;duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-630&quot; title=&quot;mount failure after MGS connection lost and file system is unmounted&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-630&quot;&gt;&lt;del&gt;LU-630&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="11812" name="mds-2-1.log" size="82482" author="kitwestneat" created="Fri, 31 Aug 2012 09:33:08 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvgmf:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6342</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>