<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:38:31 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-10825] Configuring multi-rail with a large number of nodes</title>
                <link>https://jira.whamcloud.com/browse/LU-10825</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Recently, we prepare to deployment a lustre with multi-rail, but i don&apos;t known how to enable dynamic discovery.&lt;/p&gt;

&lt;p&gt;We use lustre-2.10.3, it seems dynamic discovery is implementd in version 2.11.&lt;/p&gt;

&lt;p&gt;We have about 2 mgs/mds, 6 oss and 512 client nodes, how to configure static multi-rail with a large number of nodes ?&lt;/p&gt;</description>
                <environment></environment>
        <key id="51411">LU-10825</key>
            <summary>Configuring multi-rail with a large number of nodes</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="ashehata">Amir Shehata</assignee>
                                    <reporter username="wutz">Taizeng Wu</reporter>
                        <labels>
                    </labels>
                <created>Mon, 19 Mar 2018 13:01:48 +0000</created>
                <updated>Mon, 20 Aug 2018 15:40:37 +0000</updated>
                            <resolved>Wed, 2 May 2018 04:06:25 +0000</resolved>
                                    <version>Lustre 2.10.3</version>
                                    <fixVersion>Lustre 2.12.0</fixVersion>
                    <fixVersion>Lustre 2.10.5</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                            <comments>
                            <comment id="223953" author="ashehata" created="Mon, 19 Mar 2018 16:55:56 +0000"  >&lt;p&gt;Can you explain to me your MR deployment? Do all nodes have multiple interfaces? or the clients only? servers only?&lt;/p&gt;</comment>
                            <comment id="224004" author="wutz" created="Tue, 20 Mar 2018 01:26:03 +0000"  >&lt;p&gt;All nodes have two interfaces (1 Mellanox FDR card with two port).&lt;/p&gt;

&lt;p&gt;I am trying to configure remote peer to include oss and clients on the mds/mgs nodes, remote peer to include mds/mgs and client on the oss nodes, remote peer to include mds/mgs and oss on the client nodes. Is this configuration correct ?&lt;/p&gt;

&lt;p&gt;Then i mkfs or mount lustre to use MR&apos;s primary nid.&lt;/p&gt;

&lt;p&gt;When i turn down a interface on server node, i found lustre filesystem hung sometimes.&#160;&lt;/p&gt;

&lt;p&gt;This issue may be caused by ARP (&#160;&lt;a href=&quot;http://wiki.lustre.org/LNet_Router_Config_Guide#ARP_flux_issue_for_MR_node&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://wiki.lustre.org/LNet_Router_Config_Guide#ARP_flux_issue_for_MR_node&lt;/a&gt;).&lt;/p&gt;

&lt;p&gt;I follow the ARP flux guide to configure, but lustre filesystem still hung sometimes when turn down a interface (dmesg report &quot;Request set has failed due to network error&quot; about node which turn down a interface).&lt;/p&gt;

&lt;p&gt;&#8212;&lt;/p&gt;

&lt;p&gt;Servers OS Version: RHEL 7.4&lt;/p&gt;

&lt;p&gt;Clients OS Version: RHEL 6.8&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="224055" author="ashehata" created="Tue, 20 Mar 2018 17:38:51 +0000"  >&lt;p&gt;I&apos;m currently working on a patch to make it easier to configure large systems without Dynamic Discovery.&lt;/p&gt;

&lt;p&gt;But for now you&apos;ll need to configure the servers to know about the client&apos;s interfaces and you&apos;ll need to configure the clients to know about the server&apos;s interfaces. And since the OSS/MGS communicate you&apos;ll need to configure these to know about each other&apos;s interfaces.&lt;/p&gt;

&lt;p&gt;MR doesn&apos;t handle interface down cases. If you intentionally (or unintentionally) bring down an interface, it will interfere with the file system operations as you&apos;ve seen. We&apos;re currently working on a feature, LNet Health, which will be able to handle this particular interface failures.&lt;/p&gt;</comment>
                            <comment id="224590" author="gerrit" created="Tue, 27 Mar 2018 01:07:20 +0000"  >&lt;p&gt;Amir Shehata (amir.shehata@intel.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/31785&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/31785&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10825&quot; title=&quot;Configuring multi-rail with a large number of nodes&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10825&quot;&gt;&lt;del&gt;LU-10825&lt;/del&gt;&lt;/a&gt; libcfs: generate ip addresses&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: eb88bfc56451ad790445daf1d5be303915b596b9&lt;/p&gt;</comment>
                            <comment id="224591" author="gerrit" created="Tue, 27 Mar 2018 01:07:21 +0000"  >&lt;p&gt;Amir Shehata (amir.shehata@intel.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/31786&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/31786&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10825&quot; title=&quot;Configuring multi-rail with a large number of nodes&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10825&quot;&gt;&lt;del&gt;LU-10825&lt;/del&gt;&lt;/a&gt; lnet: add ip2nets syntax handling for peer&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 3edcd387af4028e93f5d4df9caed9a36539ccbf6&lt;/p&gt;</comment>
                            <comment id="227037" author="gerrit" created="Wed, 2 May 2018 02:22:46 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/31785/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/31785/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10825&quot; title=&quot;Configuring multi-rail with a large number of nodes&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10825&quot;&gt;&lt;del&gt;LU-10825&lt;/del&gt;&lt;/a&gt; libcfs: generate ip addresses&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 4c5f788397213aa41356df1f96f7ade58653973a&lt;/p&gt;</comment>
                            <comment id="227038" author="gerrit" created="Wed, 2 May 2018 02:22:53 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/31786/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/31786/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10825&quot; title=&quot;Configuring multi-rail with a large number of nodes&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10825&quot;&gt;&lt;del&gt;LU-10825&lt;/del&gt;&lt;/a&gt; lnet: add ip2nets syntax handling for peer&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 70c95457f6836a9c0a9e95ae0c4bdd20f99a8747&lt;/p&gt;</comment>
                            <comment id="227064" author="pjones" created="Wed, 2 May 2018 04:06:25 +0000"  >&lt;p&gt;Landed for 2.12&lt;/p&gt;</comment>
                            <comment id="227113" author="gerrit" created="Wed, 2 May 2018 15:12:47 +0000"  >&lt;p&gt;Minh Diep (minh.diep@intel.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/32249&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/32249&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10825&quot; title=&quot;Configuring multi-rail with a large number of nodes&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10825&quot;&gt;&lt;del&gt;LU-10825&lt;/del&gt;&lt;/a&gt; libcfs: generate ip addresses&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_10&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: a22011c6d2cd804413b5f7e8353b687fb742a495&lt;/p&gt;</comment>
                            <comment id="227115" author="gerrit" created="Wed, 2 May 2018 15:56:30 +0000"  >&lt;p&gt;Minh Diep (minh.diep@intel.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/32250&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/32250&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10825&quot; title=&quot;Configuring multi-rail with a large number of nodes&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10825&quot;&gt;&lt;del&gt;LU-10825&lt;/del&gt;&lt;/a&gt; lnet: add ip2nets syntax handling for peer&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_10&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: fc78b0ee95a2ee85121e84e5d104f5c268aae26f&lt;/p&gt;</comment>
                            <comment id="231246" author="gerrit" created="Wed, 1 Aug 2018 16:35:03 +0000"  >&lt;p&gt;John L. Hammond (jhammond@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/32249/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/32249/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10825&quot; title=&quot;Configuring multi-rail with a large number of nodes&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10825&quot;&gt;&lt;del&gt;LU-10825&lt;/del&gt;&lt;/a&gt; libcfs: generate ip addresses&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_10&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: a48dc3fd0f738b545571a6d2cfdeb337f2d3243b&lt;/p&gt;</comment>
                            <comment id="231247" author="gerrit" created="Wed, 1 Aug 2018 16:35:17 +0000"  >&lt;p&gt;John L. Hammond (jhammond@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/32250/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/32250/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10825&quot; title=&quot;Configuring multi-rail with a large number of nodes&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10825&quot;&gt;&lt;del&gt;LU-10825&lt;/del&gt;&lt;/a&gt; lnet: add ip2nets syntax handling for peer&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_10&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: e124f39b6b4dd56780ba4490b81dca32ab08575c&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzuj3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>