<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:21:56 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-15862] How to set prefer server NID</title>
                <link>https://jira.whamcloud.com/browse/LU-15862</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We have a critical Mellanox firmware issue causing HCA soft lockups. As a possible work around we are considering using &#160;tcp NIDs, until we have a fix from Mellanox.&lt;/p&gt;

&lt;p&gt;We want to configure servers with tcp and o2ib nids, doing so will avoid future down time switching NIDS on the server.&lt;/p&gt;

&lt;p&gt;Is there a way to select/prefer NIDs on the client per filesystem?&lt;/p&gt;

&lt;p&gt;Here is an example config:&lt;/p&gt;

&lt;p&gt;client_a:&#160;&#160;&lt;span class=&quot;error&quot;&gt;&amp;#91;o2ib(ib0)&#160;tcp(ib0)&amp;#93;&lt;/span&gt; (mount fs1 using tcp0) (mount fs2 using o2ib) (mount fs3 mount o2ib)&lt;/p&gt;

&lt;p&gt;client_b: &lt;span class=&quot;error&quot;&gt;&amp;#91;o2ib414(ib0)&amp;#93;&lt;/span&gt; &#160;(mount fs1 using tcp0) (mount fs2 using o2ib) (mount fs3 mount o2ib)&lt;/p&gt;

&lt;p&gt;lrouter: &#160;o2ib41(ib1) :: o2ib(ib0) tcp(ib0)&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;fs1-srv1:&#160; &#160;o2ib(ib0)&#160;tcp(ib0)&lt;/p&gt;

&lt;p&gt;fs2-srv1: &#160;o2ib(ib0)&#160;tcp(ib0)&lt;/p&gt;

&lt;p&gt;fs3-srv1: o2ib(ib0)&lt;/p&gt;</description>
                <environment></environment>
        <key id="70331">LU-15862</key>
            <summary>How to set prefer server NID</summary>
                <type id="9" iconUrl="https://jira.whamcloud.com/images/icons/issuetypes/undefined.png">Question/Request</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="ssmirnov">Serguei Smirnov</assignee>
                                    <reporter username="mhanafi">Mahmoud Hanafi</reporter>
                        <labels>
                    </labels>
                <created>Tue, 17 May 2022 00:09:50 +0000</created>
                <updated>Fri, 10 Jun 2022 22:20:59 +0000</updated>
                                            <version>Lustre 2.12.8</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="335117" author="adilger" created="Tue, 17 May 2022 16:06:09 +0000"  >&lt;p&gt;Serguei, can you please comment and/or sub-assign. &lt;/p&gt;</comment>
                            <comment id="335129" author="JIRAUSER17312" created="Tue, 17 May 2022 18:03:33 +0000"  >&lt;p&gt;Hi &lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=mhanafi&quot; class=&quot;user-hover&quot; rel=&quot;mhanafi&quot;&gt;mhanafi&lt;/a&gt;,&lt;/p&gt;

&lt;p&gt;We&apos;re actively looking into this, would you mind if I ask, which mellanox firmware critical issue are you dealing with?&lt;/p&gt;

&lt;p&gt;-cf&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="337343" author="ashehata" created="Fri, 10 Jun 2022 22:19:21 +0000"  >&lt;p&gt;Hi Mahmoud,&lt;/p&gt;

&lt;p&gt;We implemented the UDSP feature. This allows adding rules to do what you&apos;re looking for. This feature was added in Lustre 2.15&lt;/p&gt;

&lt;p&gt;Here is how it would work in your case&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
lnetctl udsp add o2ib0 --priority 0 
lnetctl udsp add tcp0 --priority 1&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;0 is the highest priority. This will prefer o2ib0 always unless o2ib0 becomes unreachable in which case you&apos;d start using tcp0.&lt;/p&gt;

&lt;p&gt;If you add both of these rules on all nodes, then o2ib0 will always be preferred. Through the health feature, LNet will detect if this network is not reachable or peers on this network are not reachable and start using tcp0.&lt;/p&gt;

&lt;p&gt;This feature is built on the Multi-Rail discovery features. So discovery should be on to allow the nodes to associate both interfaces to the same peer.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i02q5b:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>