<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:25:39 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-2491] Spurious LNet Error: Can&apos;t accept connection on &quot;bad dst nid&quot;</title>
                <link>https://jira.whamcloud.com/browse/LU-2491</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;I see the following LNet Error in the logs:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;2012-12-13 13:09:08 LNetError: 21195:0:(o2iblnd_cb.c:2261:kiblnd_passive_connect()) Can&apos;t accept 172.20.13.43@o2ib500 on NA (ib0:0:172.20.5.2): bad dst nid 172.20.5.2@o2ib500
2012-12-13 13:09:08 LNetError: 21180:0:(o2iblnd_cb.c:2261:kiblnd_passive_connect()) Can&apos;t accept 172.20.14.153@o2ib500 on NA (ib0:0:172.20.5.2): bad dst nid 172.20.5.2@o2ib500
2012-12-13 13:09:08 LNet: Added LNI 172.20.5.2@o2ib500 [8/1024/0/180]
2012-12-13 13:09:09 LNET configured
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Without looking at the code, it seems like LNet is denying the incoming connection because it is not yet configured. If that is the case, I don&apos;t think that warrants a console message. It should just silently refuse the connection until it is fully configured.&lt;/p&gt;</description>
                <environment></environment>
        <key id="16928">LU-2491</key>
            <summary>Spurious LNet Error: Can&apos;t accept connection on &quot;bad dst nid&quot;</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="2">Won&apos;t Fix</resolution>
                                        <assignee username="isaac">Isaac Huang</assignee>
                                    <reporter username="prakash">Prakash Surya</reporter>
                        <labels>
                            <label>LB</label>
                            <label>sequoia</label>
                            <label>shh</label>
                    </labels>
                <created>Thu, 13 Dec 2012 16:18:33 +0000</created>
                <updated>Tue, 8 Jan 2013 18:12:00 +0000</updated>
                            <resolved>Tue, 8 Jan 2013 18:12:00 +0000</resolved>
                                    <version>Lustre 2.4.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="49213" author="isaac" created="Thu, 13 Dec 2012 16:53:59 +0000"  >&lt;p&gt;Yes there&apos;s a small window after the o2iblnd has created a listening CMID but before the lnd_startup() call completes. Console errors shouldn&apos;t be used in such cases.&lt;/p&gt;</comment>
                            <comment id="49737" author="isaac" created="Thu, 27 Dec 2012 23:34:14 +0000"  >&lt;p&gt;I looked at the code, and it appeared not easy to do it properly. The CERROR() is shared by several similar error cases, and if simply changed to a CDEBUG() some important error cases would be muted too which actually deserves immediate attention. It&apos;s hard to single out the exact case here, i.e. an incoming connection comes while a &lt;b&gt;matching&lt;/b&gt; interface is still being initialized, taking into consideration the upcoming lnet dynamic config project.&lt;/p&gt;

&lt;p&gt;In short, a correct fix would involve quite some complexity, making the code harder to maintain in the long run. So I have to ask how many of these have been seen at LLNL to make it a concern for you guys?&lt;/p&gt;

&lt;p&gt;I&apos;d tend to think it&apos;s not a problem because:&lt;/p&gt;
&lt;ul class=&quot;alternate&quot; type=&quot;square&quot;&gt;
	&lt;li&gt;The window during which it could happen shouldn&apos;t be longer than a couple of milliseconds.&lt;/li&gt;
	&lt;li&gt;When it does happen on a node, LNet is still initializing itself, so Lustre isn&apos;t running yet. The console messages can&apos;t cause other Lustre debug messages to go unnoticed, because Lustre isn&apos;t running yet.&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;Please let me know if I&apos;ve missed something that makes it more problematic than I thought. Otherwise I&apos;d prefer to leave it there and keep the code simple.&lt;/p&gt;</comment>
                            <comment id="49850" author="prakash" created="Wed, 2 Jan 2013 16:34:55 +0000"  >&lt;p&gt;Well, I&apos;d like to see it fixed, but if it would cause a lot of added complexity to code which will be reworked with the upcoming LNET changes, I&apos;m OK leaving it as is. It&apos;s more of an annoyance than what I&apos;d call a problem.&lt;/p&gt;</comment>
                            <comment id="49858" author="isaac" created="Wed, 2 Jan 2013 19:19:08 +0000"  >&lt;p&gt;It&apos;s difficult to filter out exactly the spurious case only. Do you guys enable neterror console logging by default? If not, it&apos;d be a good simple trade-off to just change the CERROR() into a CNETERR().&lt;/p&gt;</comment>
                            <comment id="50158" author="prakash" created="Tue, 8 Jan 2013 16:12:22 +0000"  >&lt;p&gt;IIRC, we do enable neterror by default. If it isn&apos;t worth the effort to filter the spurious case, lets just close this as &quot;wont fix&quot;. This definitely shouldn&apos;t be a blocker, IMO.&lt;/p&gt;</comment>
                            <comment id="50168" author="isaac" created="Tue, 8 Jan 2013 18:12:00 +0000"  >&lt;p&gt;It&apos;s hard to filter out the exact spurious case without adding lots of complexity elsewhere.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvdtb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>5845</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>