<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:56:38 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-12901] Failing to create a properly sized IB queue pair</title>
                <link>https://jira.whamcloud.com/browse/LU-12901</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Attempting to bring up a file system in our test bed with the latest lustre version (2.13) I saw this new error on LNet bring up.&lt;/p&gt;

&lt;p&gt;[ 472.738363] LNet: 8481:0:(o2iblnd_cb.c:3395:kiblnd_check_conns()) Timed out tx for 10.37.248.232@o2ib1: 471 seconds&lt;br/&gt;
[ 473.739295] LNetError: 2014:0:(o2iblnd.c:929:kiblnd_create_conn()) Can&apos;t create QP: -12, send_wr: 16317, recv_wr: 128, send_sge: 2, recv_sge: 1&lt;/p&gt;

&lt;p&gt;I found I can lower the peer_credits to get around this but that is not the proper fix.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</description>
                <environment>Seen with newer Mellanox ConnectX-4 devices</environment>
        <key id="57232">LU-12901</key>
            <summary>Failing to create a properly sized IB queue pair</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="ssmirnov">Serguei Smirnov</assignee>
                                    <reporter username="simmonsja">James A Simmons</reporter>
                        <labels>
                    </labels>
                <created>Wed, 23 Oct 2019 18:15:32 +0000</created>
                <updated>Mon, 20 Dec 2021 21:03:35 +0000</updated>
                            <resolved>Sun, 13 Dec 2020 15:44:59 +0000</resolved>
                                    <version>Lustre 2.13.0</version>
                                    <fixVersion>Lustre 2.14.0</fixVersion>
                                        <due></due>
                            <votes>1</votes>
                                    <watches>11</watches>
                                                                            <comments>
                            <comment id="256983" author="ssmirnov" created="Wed, 23 Oct 2019 20:55:13 +0000"  >&lt;p&gt;Hi &lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=simmonsja&quot; class=&quot;user-hover&quot; rel=&quot;simmonsja&quot;&gt;simmonsja&lt;/a&gt;,&lt;/p&gt;

&lt;p&gt;A few questions:&lt;/p&gt;

&lt;p&gt;Do you see it happen with MLNX 5?&lt;/p&gt;

&lt;p&gt;Do you see the issue occur immediately at start-up?&lt;/p&gt;

&lt;p&gt;Do you see the issue occur on a server with any number of client connections?&lt;/p&gt;

&lt;p&gt;Thanks,&lt;/p&gt;

&lt;p&gt;Serguei.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="257847" author="nathan.crawford@uci.edu" created="Wed, 6 Nov 2019 19:27:03 +0000"  >&lt;p&gt;Also seeing with Lustre 2.12.3 client on CentOS 7.6 (3.10.0-957.27.2.el7.x86_64). With both ConnectX-4 and -5 on client nodes.&lt;/p&gt;

&lt;p&gt;Server nodes are running Lustre 2.10.6 on CentOS 7.5 (3.10.0-862.14.4.el7.x86_64).&#160; Servers are also mixed Mellanox EDR and Intel/Qlogic QDR.&lt;/p&gt;

&lt;p&gt;All nodes are using the in-box RDMA drivers.&lt;/p&gt;

&lt;p&gt;As most client nodes are still on Intel QDR, we previously set the ib0 peer_credits to 62 across all nodes. Lowering peer_credits to 42 on the problematic clients allows mounting of file system.&#160;&lt;/p&gt;</comment>
                            <comment id="257848" author="simmonsja" created="Wed, 6 Nov 2019 19:28:27 +0000"  >&lt;p&gt;Sorry we are also having issues with our IB switch so currently I&apos;m not using our IB network.&lt;/p&gt;</comment>
                            <comment id="261700" author="knweiss" created="Thu, 23 Jan 2020 11:26:03 +0000"  >&lt;p&gt;I also still see this on a Lustre client v2_12_3-98-g6db0c4f082 (+&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12637&quot; title=&quot;Support RHEL 8.1&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12637&quot;&gt;&lt;del&gt;LU-12637&lt;/del&gt;&lt;/a&gt; patch) on CentOS 8.1 (4.18.0-147.3.1.el8_1.x86_64) with ConnectX-4 using the CentOS RDMA drivers (mlx5_core) . I can also confirm that peer_credits=42 (instead of 63) works.&lt;/p&gt;</comment>
                            <comment id="263072" author="mneff" created="Tue, 11 Feb 2020 14:28:16 +0000"  >&lt;p&gt;I also see this on a Lustre client with Centos7.7 and ConnectX6 using Mellanox OFED 4.7&lt;/p&gt;</comment>
                            <comment id="268812" author="aeonjeff" created="Tue, 28 Apr 2020 21:16:38 +0000"  >&lt;p&gt;I see this issue as well. CentOS 7.8, Lustre 2.13.0, MOFED 5.0-2.1.8, ConnectX6. Setting peer_credits to 128 fails as described. Lowering peer_credits to 48 results in functioning lnet.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="273726" author="knweiss" created="Thu, 25 Jun 2020 12:44:35 +0000"  >&lt;p&gt;May I suggest to change the &quot;&lt;b&gt;Affects Version/s&lt;/b&gt;&quot; attribute of this bug from 2.13.0 to 2.12.x (including 2.12.5 which is a LTS release). See e.g. the comments here or the reports on lustre-discuss.&lt;/p&gt;</comment>
                            <comment id="285932" author="gerrit" created="Tue, 24 Nov 2020 20:18:22 +0000"  >&lt;p&gt;Serguei Smirnov (ssmirnov@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/40748&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/40748&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12901&quot; title=&quot;Failing to create a properly sized IB queue pair&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12901&quot;&gt;&lt;del&gt;LU-12901&lt;/del&gt;&lt;/a&gt; o2iblnd: retry qp creation with reduced queue depth&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: fe4fcd922196355b08981d9015f1635c88904fd3&lt;/p&gt;</comment>
                            <comment id="287420" author="gerrit" created="Sun, 13 Dec 2020 08:23:12 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/40748/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/40748/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12901&quot; title=&quot;Failing to create a properly sized IB queue pair&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12901&quot;&gt;&lt;del&gt;LU-12901&lt;/del&gt;&lt;/a&gt; o2iblnd: retry qp creation with reduced queue depth&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 8a3ef5713cc4aed1ac7bd3ce177895caa597cc4c&lt;/p&gt;</comment>
                            <comment id="287434" author="pjones" created="Sun, 13 Dec 2020 15:44:59 +0000"  >&lt;p&gt;Landed for 2.14&lt;/p&gt;</comment>
                            <comment id="321247" author="gerrit" created="Mon, 20 Dec 2021 21:03:35 +0000"  >&lt;p&gt;&quot;Etienne AUJAMES &amp;lt;eaujames@ddn.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/45901&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/45901&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12901&quot; title=&quot;Failing to create a properly sized IB queue pair&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12901&quot;&gt;&lt;del&gt;LU-12901&lt;/del&gt;&lt;/a&gt; o2iblnd: retry qp creation with reduced queue depth&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 9e0736f2306286f2f2c653c4e06c17d2201d1c0f&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="49201">LU-10213</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="32012">LU-7124</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00oin:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>