<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:05:46 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-13972] kiblnd can continue attempting to reconnect indefinitely.</title>
                <link>https://jira.whamcloud.com/browse/LU-13972</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;As shown in the log below&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
00000800:00000100:6.0:1600464412.044215:0:40:0:(o2iblnd_cb.c:2906:kiblnd_rejected()) 172.16.0.44@o2ib rejected: no listener at 987
00000800:00000100:6.0:1600464412.045753:0:40:0:(o2iblnd_cb.c:2880:kiblnd_check_reconnect()) 172.16.0.44@o2ib: reconnect (invalid service id), 12, 12, msg_size: 4096, queue_depth: 8/-1, max_frags: 256/-1
00000800:00000100:6.0:1600464412.045755:0:40:0:(o2iblnd_cb.c:2906:kiblnd_rejected()) 172.16.0.44@o2ib rejected: no listener at 987
00000800:00000100:6.0:1600464412.047336:0:40:0:(o2iblnd_cb.c:2880:kiblnd_check_reconnect()) 172.16.0.44@o2ib: reconnect (invalid service id), 12, 12, msg_size: 4096, queue_depth: 8/-1, max_frags: 256/-1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;The o2iblnd can get into a loop attemptint to reconnect to a node which is not up until the connection timeout kicks in.&lt;/p&gt;

&lt;p&gt;There are two potential solutions. Add a new module parameter to control the number of times to attempt a reconnect before we fail.&lt;/p&gt;

&lt;p&gt;Another option, which I prefer, is to use the existing retry_count o2iblnd module parameter to limit the number of connection retries.&lt;/p&gt;

&lt;p&gt;It&apos;s currently used for:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
retry_count
The maximum number of times that a data transfer operation 
should be retried on the connection when an error occurs. This setting 
controls the number of
times to retry send, RDMA, and atomic operations when timeouts occur. 
Applies only to RDMA_PS_TCP. &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;I believe it&apos;s can be used for reconnection attempts performed by the iblnd as well.&lt;/p&gt;</description>
                <environment></environment>
        <key id="60861">LU-13972</key>
            <summary>kiblnd can continue attempting to reconnect indefinitely.</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="ashehata">Amir Shehata</assignee>
                                    <reporter username="ashehata">Amir Shehata</reporter>
                        <labels>
                    </labels>
                <created>Fri, 18 Sep 2020 22:08:36 +0000</created>
                <updated>Fri, 19 Mar 2021 14:04:45 +0000</updated>
                            <resolved>Mon, 19 Oct 2020 04:07:57 +0000</resolved>
                                                    <fixVersion>Lustre 2.14.0</fixVersion>
                    <fixVersion>Lustre 2.12.7</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="280024" author="gerrit" created="Sat, 19 Sep 2020 08:35:22 +0000"  >&lt;p&gt;Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/39981&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/39981&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13972&quot; title=&quot;kiblnd can continue attempting to reconnect indefinitely.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13972&quot;&gt;&lt;del&gt;LU-13972&lt;/del&gt;&lt;/a&gt; o2iblnd: Don&apos;t retry indefinitely&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: c0ce7179f98ab2bef0566afdbb9a8f58db6a9c7c&lt;/p&gt;</comment>
                            <comment id="280135" author="chunteraa" created="Mon, 21 Sep 2020 14:55:14 +0000"  >&lt;p&gt;Currently when a client loses connection to a server, it will retry indefinitely. &lt;br/&gt;
With this change, it appears the client will eventually fail the connection.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="282533" author="gerrit" created="Mon, 19 Oct 2020 03:13:35 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/39981/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/39981/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13972&quot; title=&quot;kiblnd can continue attempting to reconnect indefinitely.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13972&quot;&gt;&lt;del&gt;LU-13972&lt;/del&gt;&lt;/a&gt; o2iblnd: Don&apos;t retry indefinitely&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 7c8ad11ef08f0f2f886004ae4a56f67722c16d5c&lt;/p&gt;</comment>
                            <comment id="282551" author="pjones" created="Mon, 19 Oct 2020 04:07:57 +0000"  >&lt;p&gt;Landed for 2.14&lt;/p&gt;</comment>
                            <comment id="294742" author="gerrit" created="Thu, 11 Mar 2021 19:51:21 +0000"  >&lt;p&gt;Serguei Smirnov (ssmirnov@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/42011&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/42011&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13972&quot; title=&quot;kiblnd can continue attempting to reconnect indefinitely.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13972&quot;&gt;&lt;del&gt;LU-13972&lt;/del&gt;&lt;/a&gt; o2iblnd: Don&apos;t retry indefinitely&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 30ce9d9700e82e8317709c8ffa5a4ab754e6544e&lt;/p&gt;</comment>
                            <comment id="295281" author="gerrit" created="Wed, 17 Mar 2021 23:21:25 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/42011/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/42011/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13972&quot; title=&quot;kiblnd can continue attempting to reconnect indefinitely.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13972&quot;&gt;&lt;del&gt;LU-13972&lt;/del&gt;&lt;/a&gt; o2iblnd: Don&apos;t retry indefinitely&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 6d2aae7396cfcc37873effa137f8e0cc437132ff&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i01a9z:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>