<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:26:30 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-9472] FastReg (MLX5) support breaks when map_on_demand &gt; 0</title>
                <link>https://jira.whamcloud.com/browse/LU-9472</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;When building against MODED 4, the default for map_on_demand switches from 0 to 256. &#160;This is breaking MLX5-based cards which make use of the FastReg support in ko2iblnd. &#160;There are three problems with FastReg which need to be fixed:&lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;In kiblnd_fmr_pool_map() when using elements from the fpo_pool_list, if&#160;the list runs out, the current code is setting rc to -EBUSY when it should be -EAGAIN. &#160;EAGAIN triggers the pool to be made bigger. &#160;EBUSY just fails the transfer and connection (not what we want).&lt;/li&gt;
	&lt;li&gt;Even after I fix the setting of rc in number 1, bringing down the network via &quot;lctl network down&quot; trips this assert:&#160;&lt;br/&gt;
[ 1172.255552] LNetError: 10176:0:(o2iblnd.c:1421:kiblnd_destroy_fmr_pool()) ASSERTION( fpo-&amp;gt;fpo_map_count == 0 ) failed:&#160;&lt;/li&gt;
&lt;/ol&gt;


&lt;ol&gt;
	&lt;li&gt;Every time the pool size is increased, I keep seeing this annoying log (with neterror on):&#160;&lt;br/&gt;
May&#160; 9 00:22:26 trevis-407 kernel: LNet: Using FastReg for registration&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;The first 2 items are blockers and must be fixed ASAP. &#160;The 3rd might as well be addressed at the same time.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</description>
                <environment></environment>
        <key id="45954">LU-9472</key>
            <summary>FastReg (MLX5) support breaks when map_on_demand &gt; 0</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="doug">Doug Oucharek</assignee>
                                    <reporter username="doug">Doug Oucharek</reporter>
                        <labels>
                            <label>lnet</label>
                    </labels>
                <created>Tue, 9 May 2017 05:18:39 +0000</created>
                <updated>Fri, 8 Sep 2017 15:11:44 +0000</updated>
                            <resolved>Sat, 20 May 2017 19:23:41 +0000</resolved>
                                    <version>Lustre 2.10.0</version>
                    <version>Upstream</version>
                                    <fixVersion>Lustre 2.10.0</fixVersion>
                    <fixVersion>Upstream</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>13</watches>
                                                                            <comments>
                            <comment id="195154" author="doug" created="Tue, 9 May 2017 18:03:36 +0000"  >&lt;p&gt;The main problem, it turns out, is that the unmap routine is never being called for FastReg. &#160;As such, we have to keep growing the pool and assert when trying to shut down networking (because pool items are leaking).&lt;/p&gt;</comment>
                            <comment id="195155" author="simmonsja" created="Tue, 9 May 2017 18:10:25 +0000"  >&lt;p&gt;Thanks for finding this.&lt;/p&gt;</comment>
                            <comment id="195156" author="gerrit" created="Tue, 9 May 2017 18:12:37 +0000"  >&lt;p&gt;Doug Oucharek (doug.s.oucharek@intel.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/27015&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/27015&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt; lnd: Fix FastReg map/unmap for MLX5&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: a6a1d45a72360b5cc7e9e3a65c7456fa62c19192&lt;/p&gt;</comment>
                            <comment id="195797" author="shadow" created="Mon, 15 May 2017 12:10:43 +0000"  >&lt;p&gt;Tested fix and it work for me.&lt;/p&gt;</comment>
                            <comment id="196545" author="gerrit" created="Sat, 20 May 2017 18:44:06 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/27015/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/27015/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt; lnd: Fix FastReg map/unmap for MLX5&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: b436c75d9488222190de8b30f56d720f8ec63d6f&lt;/p&gt;</comment>
                            <comment id="196555" author="pjones" created="Sat, 20 May 2017 19:23:41 +0000"  >&lt;p&gt;Landed for 2.10&lt;/p&gt;</comment>
                            <comment id="205694" author="dougo" created="Thu, 17 Aug 2017 23:04:11 +0000"  >&lt;p&gt;Has this been pushed upstream yet?&lt;/p&gt;</comment>
                            <comment id="205699" author="simmonsja" created="Thu, 17 Aug 2017 23:25:39 +0000"  >&lt;p&gt;Not yet.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="48069">LU-9932</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="46078">LU-9500</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="28577">LU-6215</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzc67:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>