<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:26:24 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-9461] lustre client mount fail after update IB driver and Lustre patch.</title>
                <link>https://jira.whamcloud.com/browse/LU-9461</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;The lustre client mount fail after update IB driver and Lustre client patch(&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9026&quot; title=&quot;Adapt to the removal of ib_get_dma_mr()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9026&quot;&gt;&lt;del&gt;LU-9026&lt;/del&gt;&lt;/a&gt;).&lt;br/&gt;
Should I apply any other patch for new IB driver?&lt;/p&gt;

&lt;p&gt;mount fail error mesage&lt;br/&gt;
[ 5713.280039] LNet: Using FastReg for registration&lt;br/&gt;
[ 5713.370689] LNet: Added LNI 192.168.2.220@o2ib0 &lt;span class=&quot;error&quot;&gt;&amp;#91;8/256/0/180&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 5736.543149] LNetError: 0:0:(o2iblnd_cb.c:3436:kiblnd_qp_event()) 192.168.2.201@o2ib0: Async QP event type 3&lt;br/&gt;
[ 5743.539710] Lustre: 15524:0:(client.c:2111:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: &lt;span class=&quot;error&quot;&gt;&amp;#91;sent 1493897510/real 1493897510&amp;#93;&lt;/span&gt;  req@ffff881011ef0300 x1566465051328608/t0(0) o503-&amp;gt;MGC192.168.2.201@o2ib0@192.168.2.201@o2ib0:26/25 lens 272/8416 e 0 to 1 dl 1493897517 ref 2 fl Rpc:X/0/ffffffff rc 0/-1&lt;br/&gt;
[ 5743.539728] LustreError: 166-1: MGC192.168.2.201@o2ib0: Connection to MGS (at 192.168.2.201@o2ib0) was lost; in progress operations using this service will fail&lt;br/&gt;
[ 5743.539899] LustreError: 15c-8: MGC192.168.2.201@o2ib0: The configuration from log &apos;hpcfs-client&apos; failed (-5). This may be the result of communication errors between this node and the MGS, a bad configuration, or other errors. See the syslog for more information.&lt;br/&gt;
[ 5743.540282] Lustre: Unmounted hpcfs-client&lt;br/&gt;
[ 5743.544658] LustreError: 15524:0:(obd_mount.c:1449:lustre_fill_super()) Unable to mount  (-5)&lt;/p&gt;</description>
                <environment>CentOS7.3&lt;br/&gt;
Lustre 2.9.0 + cherry-picked as e4297ef38561f1e788ba73ca0c8078a09dc8c303&lt;br/&gt;
MLNX_OFED_LINUX-4.0-2.0.0.1-rhel7.3&lt;br/&gt;
IB: Mellanox ConnectX-4 adapter EDR</environment>
        <key id="45901">LU-9461</key>
            <summary>lustre client mount fail after update IB driver and Lustre patch.</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="ashehata">Amir Shehata</assignee>
                                    <reporter username="sebg-crd-pm">sebg-crd-pm</reporter>
                        <labels>
                            <label>llnl</label>
                    </labels>
                <created>Mon, 8 May 2017 01:46:45 +0000</created>
                <updated>Mon, 18 Sep 2017 21:29:56 +0000</updated>
                            <resolved>Wed, 30 Aug 2017 12:59:52 +0000</resolved>
                                                                        <due></due>
                            <votes>1</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="194784" author="pjones" created="Mon, 8 May 2017 04:19:54 +0000"  >&lt;p&gt;Hi there&lt;/p&gt;

&lt;p&gt;The &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9026&quot; title=&quot;Adapt to the removal of ib_get_dma_mr()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9026&quot;&gt;&lt;del&gt;LU-9026&lt;/del&gt;&lt;/a&gt; patch will allow building with MOFED 4.0 but there are still known issues attempting to run.&lt;/p&gt;

&lt;p&gt;Doug&lt;/p&gt;

&lt;p&gt;Do you have any suggestions here&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="194999" author="doug" created="Tue, 9 May 2017 03:18:57 +0000"  >&lt;p&gt;The &quot;Async QP event type 3&quot; is a&#160;IB_EVENT_QP_ACCESS_ERR. &#160;This error will stop the connection from continuing and explains all the errors which follow in your logs.&lt;/p&gt;

&lt;p&gt;This error&#160;happens when a call to ib_create_qp() fails. &#160;It could fail if the version of the parameters being passed in is wrong (i.e. the size of the parameter structure is incorrect). &#160;This could be related to the other issue I am currently working on that involves MOFED 4. &#160;If MOFED 4 has changed the structure we use as a parameter to this call and we have not adapted to that change, we could see an error like this.&lt;/p&gt;

&lt;p&gt;Does this error happen on each mount attempt or was this a one off?&lt;/p&gt;</comment>
                            <comment id="195207" author="sebg-crd-pm" created="Wed, 10 May 2017 00:35:21 +0000"  >&lt;p&gt;This error happen on each mount attempt.(the test lustre filesystem servers OFED is 3.4)&lt;br/&gt;
And &quot;Async QP event type 3&quot; also happened when  lustre servers mount mgs/mds/....with OFED4. &lt;/p&gt;</comment>
                            <comment id="196080" author="doug" created="Tue, 16 May 2017 21:13:49 +0000"  >&lt;p&gt;I ran into this &quot;Async&quot; error at the same time as the issues I talk about in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt;. &#160;They are related. &#160;When I have a fix for &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt;, this issue will be addressed as well.&lt;/p&gt;</comment>
                            <comment id="196122" author="lidongyang" created="Wed, 17 May 2017 06:21:01 +0000"  >&lt;p&gt;I think it&apos;s my fault.&lt;br/&gt;
 Could you try this patch on top of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt; and &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt;?&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;diff --git a/lnet/klnds/o2iblnd/o2iblnd.c b/lnet/klnds/o2iblnd/o2iblnd.c
index 047fe3c..ba7829b 100644
--- a/lnet/klnds/o2iblnd/o2iblnd.c
+++ b/lnet/klnds/o2iblnd/o2iblnd.c
@@ -1900,8 +1900,6 @@ again:
                                        &lt;span class=&quot;code-keyword&quot;&gt;return&lt;/span&gt; n &amp;lt; 0 ? n : -EINVAL;
                                }
 
-                               mr-&amp;gt;iova = iov;
-
                                wr = &amp;amp;frd-&amp;gt;frd_fastreg_wr;
                                memset(wr, 0, sizeof(*wr));


&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="196248" author="doug" created="Wed, 17 May 2017 21:23:50 +0000"  >&lt;p&gt;I&apos;m curious as to why you would not want to set the mr-&amp;gt;iova value? &#160;Is this an unneeded step?&lt;/p&gt;</comment>
                            <comment id="196305" author="lidongyang" created="Thu, 18 May 2017 06:24:47 +0000"  >&lt;p&gt;Hi Doug,&#160;&lt;/p&gt;

&lt;p&gt;I believe the issue only applies to mlx5 cards using MOFED4.&lt;/p&gt;

&lt;p&gt;in MOFED4, mr-&amp;gt;iova is set by ib_map_mr_sg()-&amp;gt;ib_sg_to_pages()&lt;/p&gt;

&lt;p&gt;It doesn&apos;t make sense to reset mr-&amp;gt;iova after calling ib_map_mr_sg().&lt;/p&gt;

&lt;p&gt;That line of code was introduced to address an similar issue, see the comments on&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://review.whamcloud.com/#/c/19168/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/19168/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;I&apos;ve done some testing using MOFED4 + lustre-release with mlx4 cards forcing fast reg as well, so far I&apos;ve seen no problems.&lt;/p&gt;</comment>
                            <comment id="196314" author="sebg-crd-pm" created="Thu, 18 May 2017 09:04:01 +0000"  >&lt;p&gt;I can mount lustre ok (  2.9.57_69_g0bc1964 +  &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt; / &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt;  and this patch (-   mr-&amp;gt;iova = iov&lt;img class=&quot;emoticon&quot; src=&quot;https://jira.whamcloud.com/images/icons/emoticons/wink.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;  )&lt;/p&gt;

&lt;p&gt;Is it ok to apply  only &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9026&quot; title=&quot;Adapt to the removal of ib_get_dma_mr()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9026&quot;&gt;&lt;del&gt;LU-9026&lt;/del&gt;&lt;/a&gt;  + &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt; / &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt;  and (-   mr-&amp;gt;iova = iov&lt;img class=&quot;emoticon&quot; src=&quot;https://jira.whamcloud.com/images/icons/emoticons/wink.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;  patchs to 2.9.0?&lt;br/&gt;
Sould I apply any other patchs to Lustre2.9.0 for mlx5 cards using MOFED4 ?   Thanks for your suggestion&lt;/p&gt;
</comment>
                            <comment id="196363" author="doug" created="Thu, 18 May 2017 16:21:57 +0000"  >&lt;p&gt;Li: Thank you for the information. &#160;I checked and you are right, ib_map_mr_sg() does set mr-&amp;gt;iova so that line is not needed (could cause a problem). &#160;I will update &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt;&#160;with a removal of that line.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.hpdd.intel.com/secure/ViewProfile.jspa?name=sebg-crd-pm&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;&#160;sebg-crd-pm&lt;/a&gt;: Correct, you only need --&lt;del&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9026&quot; title=&quot;Adapt to the removal of ib_get_dma_mr()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9026&quot;&gt;&lt;del&gt;LU-9026&lt;/del&gt;&lt;/a&gt;&lt;/del&gt;-- + &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt; (with the removal of setting mr-&amp;gt;iova) + &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="206896" author="dinatale2" created="Tue, 29 Aug 2017 23:56:23 +0000"  >&lt;p&gt;We have just encountered this issue as well. Is it possible to have &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9026&quot; title=&quot;Adapt to the removal of ib_get_dma_mr()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9026&quot;&gt;&lt;del&gt;LU-9026&lt;/del&gt;&lt;/a&gt;, &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt;, and &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt; backported to the lustre 2.5 and 2.8 branches?&lt;/p&gt;</comment>
                            <comment id="206923" author="pjones" created="Wed, 30 Aug 2017 12:59:36 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=dinatale2&quot; class=&quot;user-hover&quot; rel=&quot;dinatale2&quot;&gt;dinatale2&lt;/a&gt; can you please open a new ticket to track this request?&lt;/p&gt;</comment>
                            <comment id="207865" author="sebg-crd-pm" created="Fri, 8 Sep 2017 11:12:06 +0000"  >&lt;p&gt;Hi, &lt;/p&gt;

&lt;p&gt;I got  create striped directory error  in  Lustre 2.10 with &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt; patch (&lt;a href=&quot;https://review.whamcloud.com/#/c/28237/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/28237/&lt;/a&gt;) for OFED4.0&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;root@hsm client&amp;#93;&lt;/span&gt;# lfs mkdir -c 2 dir1&lt;br/&gt;
error on LL_IOC_LMV_SETSTRIPE &apos;dir1&apos; (3): Input/output error&lt;br/&gt;
error: mkdir: create stripe dir &apos;dir1&apos; failed&lt;/p&gt;

&lt;p&gt;Should I apply any other patch for this issue?    Thanks&lt;/p&gt;</comment>
                            <comment id="207872" author="pjones" created="Fri, 8 Sep 2017 12:50:16 +0000"  >&lt;p&gt;This issue is being tracked under &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9958&quot; title=&quot;Create striped directory fail  in 2.10(with LU-9500 patch) &quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9958&quot;&gt;&lt;del&gt;LU-9958&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="46078">LU-9500</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzc2f:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>