<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:36:03 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-17513] how does &apos;conns_per_peer&apos; apply with multiple NIDs?</title>
                <link>https://jira.whamcloud.com/browse/LU-17513</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;On a system with multiple NIDs on both peers in a connection, there is an issue of creating &lt;tt&gt;conns_per_peer&lt;/tt&gt; connections to &lt;b&gt;each&lt;/b&gt; of the NIDs on each local/remote NID combination.  AFAIK, that means a single pair of nodes with conns_per_peer=4 and 4 interfaces on each side (e.g. DGX + server) would create (&lt;tt&gt;4 conns * &amp;lt;read,write&amp;gt; + 1 control) * 2 local NIDs * 2 remote NIDs&lt;/tt&gt; = 36 TCP sockets &lt;b&gt;PER PEER&lt;/b&gt;, and this increases with the &lt;b&gt;square&lt;/b&gt; of the number of NIDs on each peer (=144 sockets &lt;b&gt;PER PEER&lt;/b&gt; with 4 NIDs each).&lt;/p&gt;

&lt;p&gt;With 1200 2-NID clients that means 43200 sockets, which is a real-world problem today, and could make a slightly larger Ethernet cluster unusable due to connection storms exceeding the total number of available ports on the server.&lt;/p&gt;

&lt;p&gt;It seems likely (although this is speculation on my part) that socklnd should distribute &lt;tt&gt;conns_per_peer&lt;/tt&gt; across the available NID combinations, and then send messages to the peer with the knowledge of which peer NIDs actually have established connections instead of establishing an all-to-all set of connections.  In most cases, the physical network topology (switches) would likely prefer a 1:1 grouping of NIDs from the local and remote interfaces.  In the case of interface failure on the local/remote NID then those connections could be redistributed to other local or remote NIDs if needed.&lt;/p&gt;</description>
                <environment></environment>
        <key id="80709">LU-17513</key>
            <summary>how does &apos;conns_per_peer&apos; apply with multiple NIDs?</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="adilger">Andreas Dilger</reporter>
                        <labels>
                            <label>LNet</label>
                            <label>Multi-Rail</label>
                    </labels>
                <created>Wed, 7 Feb 2024 22:04:26 +0000</created>
                <updated>Thu, 8 Feb 2024 00:05:12 +0000</updated>
                                                                                <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="403091" author="adilger" created="Wed, 7 Feb 2024 22:29:46 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17515&quot; title=&quot;dynamically shrink &amp;#39;conns_per_peer&amp;#39; as needed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17515&quot;&gt;LU-17515&lt;/a&gt; is also very closely related to this issue, but may have significantly different changes and should be tracked separately.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="80711">LU-17515</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="78758">LU-17258</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="80710">LU-17514</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i04aqf:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>