<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:30:48 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-9958] Create striped directory fail  in 2.10(with LU-9500 patch) </title>
                <link>https://jira.whamcloud.com/browse/LU-9958</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Hi,&lt;br/&gt;
I try to use OFED4.0 driver in Lustre 2.10 with &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt; patch (&lt;a href=&quot;https://review.whamcloud.com/#/c/28237/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/28237/&lt;/a&gt;) but got create stripe directory error. &lt;br/&gt;
In &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9461&quot; title=&quot;lustre client mount fail after update IB driver and Lustre patch.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9461&quot;&gt;&lt;del&gt;LU-9461&lt;/del&gt;&lt;/a&gt;, I got infomation Lustre 2.9 have to apply &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9026&quot; title=&quot;Adapt to the removal of ib_get_dma_mr()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9026&quot;&gt;&lt;del&gt;LU-9026&lt;/del&gt;&lt;/a&gt;/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt;/ &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt;.&lt;br/&gt;
Then test it  OFED4.0 in Lustre2.10 + &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;del&gt;LU-9500&lt;/del&gt;&lt;/a&gt; patch( &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9026&quot; title=&quot;Adapt to the removal of ib_get_dma_mr()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9026&quot;&gt;&lt;del&gt;LU-9026&lt;/del&gt;&lt;/a&gt;/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9472&quot; title=&quot;FastReg (MLX5) support breaks when map_on_demand &amp;gt; 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9472&quot;&gt;&lt;del&gt;LU-9472&lt;/del&gt;&lt;/a&gt; in Lustre 2.10 )&lt;br/&gt;
Should I appy any other patch for this issue?   Thanks.&lt;/p&gt;

&lt;p&gt;//two mdts  must in different servers&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;root@hsm client&amp;#93;&lt;/span&gt;# lfs mkdir -c 2 dir1&lt;br/&gt;
error on LL_IOC_LMV_SETSTRIPE &apos;dir1&apos; (3): Input/output error&lt;br/&gt;
error: mkdir: create stripe dir &apos;dir1&apos; failed&lt;/p&gt;
</description>
                <environment>Lustre 2.10 (lustre-release-58fd06e) + &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9500&quot; title=&quot;MOFED 4/mlx5: Aligning non-aligned page addresses trigger dump_cqe&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9500&quot;&gt;&lt;strike&gt;LU-9500&lt;/strike&gt;&lt;/a&gt; patch  for OFED4.0&lt;br/&gt;
Melanox IB EDR + MLNX_OFED_LINUX-4.0-2.0.0.1-rhel7.3-x86_64.tgz&lt;br/&gt;
Test with 2 MDS servers (1 MDT/Server)</environment>
        <key id="48229">LU-9958</key>
            <summary>Create striped directory fail  in 2.10(with LU-9500 patch) </summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="5">Cannot Reproduce</resolution>
                                        <assignee username="laisiyao">Lai Siyao</assignee>
                                    <reporter username="sebg-crd-pm">sebg-crd-pm</reporter>
                        <labels>
                    </labels>
                <created>Fri, 8 Sep 2017 11:37:59 +0000</created>
                <updated>Sat, 14 Oct 2017 00:20:10 +0000</updated>
                            <resolved>Thu, 5 Oct 2017 05:34:00 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="207926" author="bhoagland" created="Fri, 8 Sep 2017 17:21:03 +0000"  >&lt;p&gt;Hello,&lt;/p&gt;

&lt;p&gt;Please attach the entire log for us to review.&lt;/p&gt;

&lt;p&gt;Thanks,&lt;/p&gt;

&lt;p&gt;Brad&lt;/p&gt;</comment>
                            <comment id="208021" author="sebg-crd-pm" created="Mon, 11 Sep 2017 08:25:52 +0000"  >&lt;p&gt;FYI&lt;/p&gt;

&lt;p&gt;lfs mkdir -c 2 /mnt/client/dir3&lt;br/&gt;
error on LL_IOC_LMV_SETSTRIPE &apos;dir3&apos; (3): Input/output error&lt;br/&gt;
error: mkdir: create stripe dir &apos;dir3&apos; failed&lt;/p&gt;

&lt;p&gt;see attached file   &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/28256/28256_mdt1.log&quot; title=&quot;mdt1.log attached to LU-9958&quot;&gt;mdt1.log&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/28257/28257_mdt0.log&quot; title=&quot;mdt0.log attached to LU-9958&quot;&gt;mdt0.log&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/28258/28258_client.log&quot; title=&quot;client.log attached to LU-9958&quot;&gt;client.log&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; &lt;/p&gt;
</comment>
                            <comment id="208192" author="sebg-crd-pm" created="Wed, 13 Sep 2017 02:53:59 +0000"  >&lt;p&gt;Hi Brad,&lt;/p&gt;

&lt;p&gt;Do you have any update after reviewing logs?&lt;/p&gt;

&lt;p&gt;Thanks!&lt;/p&gt;</comment>
                            <comment id="208382" author="pjones" created="Thu, 14 Sep 2017 17:30:40 +0000"  >&lt;p&gt;Lai&lt;/p&gt;

&lt;p&gt;Can you please advise on this one?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="208482" author="laisiyao" created="Fri, 15 Sep 2017 11:18:01 +0000"  >&lt;p&gt;in mdt1.log:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;00010000:00000001:6.0:1505117481.059504:0:5182:0:(ldlm_lib.c:3268:target_bulk_io()) Process leaving (rc=18446744073709551506 : -110 : ffffffffffffff92)
00000020:00000001:6.0:1505117481.059508:0:5182:0:(out_handler.c:982:out_handle()) Process leaving via out_free (rc=18446744073709547410 : -4206 : 0xffffffffffffef92)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;which caused mdt0:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;00000004:00000001:9.0:1505117529.647439:0:3485:0:(osp_trans.c:1204:osp_send_update_req()) Process leaving (rc=18446744073709551506 : -110 : ffffffffffffff92)
...
00000020:00000001:1.0:1505117529.647589:0:3442:0:(update_trans.c:1091:top_trans_stop()) Process leaving (rc=18446744073709551611 : -5 : fffffffffffffffb)
...
00000004:00000001:1.0:1505117529.647779:0:3442:0:(mdt_reint.c:526:mdt_create()) Process leaving via put_child (rc=18446744073709551611 : -5 : 0xfffffffffffffffb)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;It looks to be network IO on mdt1 timed out, could you verify the network on mdt1 is working correctly?&lt;/p&gt;</comment>
                            <comment id="208723" author="sebg-crd-pm" created="Tue, 19 Sep 2017 07:00:32 +0000"  >&lt;p&gt;I test this bug again in 2.10.1-RC1(no add any patch).&lt;/p&gt;

&lt;p&gt;It looks to be network IO on mdt1 timed out, could you verify the network on mdt1 is working correctly?&lt;br/&gt;
=&amp;gt;all osp state looks like normal&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;mdt1 server&amp;#93;&lt;/span&gt;&lt;br/&gt;
./osp/jlustre-MDT0000-osp-MDT0001/state:current_state: FULL&lt;br/&gt;
./osp/jlustre-MDT0000-osp-MDT0001/import:    state: FULL&lt;br/&gt;
./osp/jlustre-OST0000-osc-MDT0001/state:current_state: FULL&lt;br/&gt;
./osp/jlustre-OST0000-osc-MDT0001/import:    state: FULL&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;mdt0 server&amp;#93;&lt;/span&gt;&lt;br/&gt;
./osp/jlustre-MDT0001-osp-MDT0000/state:current_state: FULL&lt;br/&gt;
./osp/jlustre-MDT0001-osp-MDT0000/import:    state: FULL&lt;br/&gt;
./osp/jlustre-OST0000-osc-MDT0000/state:current_state: FULL&lt;br/&gt;
./osp/jlustre-OST0000-osc-MDT0000/import:    state: FULL&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;/var/log/message in  mdt0 server&amp;#93;&lt;/span&gt;&lt;br/&gt;
Sep 19 02:50:14 ossb2 kernel: LNetError: 21764:0:(o2iblnd.c:1940:kiblnd_fmr_pool_map()) Failed to map mr 10/11 elements&lt;br/&gt;
Sep 19 02:50:14 ossb2 kernel: LNetError: 21764:0:(o2iblnd_cb.c:560:kiblnd_fmr_map_tx()) Can&apos;t map 41033 pages: -22&lt;br/&gt;
Sep 19 02:50:14 ossb2 kernel: LNetError: 21764:0:(o2iblnd_cb.c:1554:kiblnd_send()) Can&apos;t setup GET sink for 172.20.110.209@o2ib: -22&lt;br/&gt;
Sep 19 02:50:14 ossb2 kernel: LustreError: 21764:0:(events.c:449:server_bulk_callback()) event type 5, status -5, desc ffff88086ea2e400&lt;br/&gt;
Sep 19 02:51:54 ossb2 kernel: LustreError: 21764:0:(ldlm_lib.c:3237:target_bulk_io()) @@@ timeout on bulk WRITE after 100+0s  req@ffff880457208c50 x1578948605516272/t0(0) o1000-&amp;gt;jlustre-MDT0001-mdtlov_UUID@172.20.110.209@o2ib:210/0 lens 376/0 e 4 to 0 dl 1505803920 ref 1 fl Interpret:/0/ffffffff rc 0/-1&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;/var/log/messages in  mdt1 server&amp;#93;&lt;/span&gt;&lt;br/&gt;
Sep 19 14:51:22 ossb1 kernel: LustreError: 11-0: jlustre-MDT0000-osp-MDT0001: operation out_update to node 172.20.110.210@o2ib failed: rc = -110&lt;br/&gt;
Sep 19 14:51:22 ossb1 kernel: LustreError: 31069:0:(layout.c:2085:__req_capsule_get()) @@@ Wrong buffer for field `object_update_reply&apos; (1 of 1) in format `OUT_UPDATE&apos;: 0 vs. 4096 (server)#012  req@ffff8807d3aa7800 x1578948605516272/t0(0) o1000-&amp;gt;jlustre-MDT0000-osp-MDT0001@172.20.110.210@o2ib:24/4 lens 376/192 e 4 to 0 dl 1505803889 ref 2 fl Interpret:ReM/0/0 rc -110/-110&lt;br/&gt;
Sep 19 14:51:24 ossb1 kernel: LustreError: 30780:0:(llog_cat.c:773:llog_cat_cancel_records()) jlustre-MDT0000-osp-MDT0001: fail to cancel 1 of 1 llog&lt;/p&gt;
</comment>
                            <comment id="208727" author="sebg-crd-pm" created="Tue, 19 Sep 2017 08:38:10 +0000"  >&lt;p&gt;It looks to be network IO on mdt1 timed out, could you verify the network on mdt1 is working correctly?&lt;br/&gt;
=&amp;gt;how to verify  network on mdt1 is working correctly?  Could you give me anu comment? Thanks.&lt;/p&gt;</comment>
                            <comment id="209796" author="sebg-crd-pm" created="Thu, 28 Sep 2017 01:10:58 +0000"  >&lt;p&gt;Hi Lai,&lt;/p&gt;

&lt;p&gt;       Do you need more detail log?  or you have already reproduce it in your site. Thanks.&lt;/p&gt;</comment>
                            <comment id="209801" author="laisiyao" created="Thu, 28 Sep 2017 02:15:01 +0000"  >&lt;p&gt;can you test &apos;lfs mkdir -i 1 dir1&apos; to create a remote directory?&lt;/p&gt;</comment>
                            <comment id="210048" author="sebg-crd-pm" created="Mon, 2 Oct 2017 01:44:25 +0000"  >&lt;p&gt;create a remote directory =&amp;gt;fail&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;root@robin client&amp;#93;&lt;/span&gt;# lfs mkdir -i 0 dir0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;root@robin client&amp;#93;&lt;/span&gt;# lfs mkdir -i 1 dir1&lt;br/&gt;
error on LL_IOC_LMV_SETSTRIPE &apos;dir1&apos; (3): Input/output error&lt;br/&gt;
error: mkdir: create stripe dir &apos;dir1&apos; failed&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;root@robin client&amp;#93;&lt;/span&gt;# lfs mkdir -c 2 dir2&lt;br/&gt;
error on LL_IOC_LMV_SETSTRIPE &apos;dir2&apos; (3): Input/output error&lt;br/&gt;
error: mkdir: create stripe dir &apos;dir2&apos; failed&lt;/p&gt;</comment>
                            <comment id="210154" author="sebg-crd-pm" created="Tue, 3 Oct 2017 00:59:58 +0000"  >&lt;p&gt; I have also test create striped directory  successed when the two mdts in the same server.(transfer message by loopback device) &lt;br/&gt;
So I guess there is something wrong between MDT transfer message by IB .&lt;/p&gt;

&lt;p&gt;Any update ?   Thanks.&lt;/p&gt;</comment>
                            <comment id="210350" author="sebg-crd-pm" created="Thu, 5 Oct 2017 03:08:25 +0000"  >&lt;p&gt;This bug can not be reproduced in release 2.10.1 &lt;/p&gt;</comment>
                            <comment id="210355" author="pjones" created="Thu, 5 Oct 2017 05:34:00 +0000"  >&lt;p&gt;Good news - thanks&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="48390">LU-10010</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="28258" name="client.log" size="3887298" author="sebg-crd-pm" created="Mon, 11 Sep 2017 08:24:53 +0000"/>
                            <attachment id="28257" name="mdt0.log" size="4710279" author="sebg-crd-pm" created="Mon, 11 Sep 2017 08:24:54 +0000"/>
                            <attachment id="28256" name="mdt1.log" size="4428732" author="sebg-crd-pm" created="Mon, 11 Sep 2017 08:24:54 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzjtb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>