<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:02:41 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-6723] Setting map_on_demand for o2iblnd driver prevents lustre bring up.</title>
                <link>https://jira.whamcloud.com/browse/LU-6723</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;While testing setting map_on_demand with the patch from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3322&quot; title=&quot;ko2iblnd support for different map_on_demand and peer_credits between systems&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3322&quot;&gt;&lt;del&gt;LU-3322&lt;/del&gt;&lt;/a&gt; I discovered set map_on_demand to any value on our Cray routers prevented lustre from functioning. This looks like a bug in the o2iblnd driver which only shows up on our Cray nodes.&lt;/p&gt;</description>
                <environment>Cray routers running SLES11 SP3. Found this issue exist for all lustre versions.</environment>
        <key id="30667">LU-6723</key>
            <summary>Setting map_on_demand for o2iblnd driver prevents lustre bring up.</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="2">Won&apos;t Fix</resolution>
                                        <assignee username="ashehata">Amir Shehata</assignee>
                                    <reporter username="simmonsja">James A Simmons</reporter>
                        <labels>
                            <label>lnet</label>
                    </labels>
                <created>Mon, 15 Jun 2015 18:59:29 +0000</created>
                <updated>Wed, 16 Dec 2015 20:45:17 +0000</updated>
                            <resolved>Wed, 16 Dec 2015 20:45:17 +0000</resolved>
                                    <version>Lustre 2.7.0</version>
                    <version>Lustre 2.8.0</version>
                    <version>Lustre 2.5.4</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="118575" author="simmonsja" created="Mon, 15 Jun 2015 19:00:10 +0000"  >&lt;p&gt;As a note this happened also when the patch from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3322&quot; title=&quot;ko2iblnd support for different map_on_demand and peer_credits between systems&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3322&quot;&gt;&lt;del&gt;LU-3322&lt;/del&gt;&lt;/a&gt; is not applied.&lt;/p&gt;</comment>
                            <comment id="118701" author="adilger" created="Tue, 16 Jun 2015 17:12:40 +0000"  >&lt;p&gt;James, could you please provide a bit more information about what you mean by &quot;Cray routers prevented Lustre from functioning&quot;?  Any errors in the logs?  Does &quot;lctl ping&quot; work? Does the IB-level network testing still work?&lt;/p&gt;

&lt;p&gt;Is Cray running a customized OFED?  It may be that this isn&apos;t a Lustre/LNet problem at all.&lt;/p&gt;</comment>
                            <comment id="118867" author="simmonsja" created="Wed, 17 Jun 2015 19:07:44 +0000"  >&lt;p&gt;Is is the errors that  appear on the OSS nodes when I enabled map_on_demand on the Cray routers. &lt;/p&gt;

&lt;p&gt;00000020:02000400:10.0:1433178825.928974:0:28309:0:(tgt_handler.c:1834:tgt_brw_read()) sultan-OST0034: Bulk IO read error with b9cf5051-0ff9-6cf9-cd67-9364a2516176 (at 30@gni1), client will retry: rc -110&lt;br/&gt;
00000020:00000001:10.0:1433178825.947041:0:28309:0:(tgt_handler.c:1851:tgt_brw_read()) Process leaving (rc=18446744073709551506 : -110 : ffffffffffffff92)&lt;br/&gt;
00010000:00000080:10.0:1433178825.947043:0:28309:0:(ldlm_lib.c:2427:target_committed_to_req()) @@@ not sending l&lt;/p&gt;

&lt;p&gt;The Cray routers are using the mlx5 driver from the OFED 3.12 stack. Realizing what the problem is I need to collect logs from the routers so we know what is really going on. The OSS bulk timeouts are a symptom of the real problem.&lt;/p&gt;</comment>
                            <comment id="119270" author="simmonsja" created="Mon, 22 Jun 2015 20:56:23 +0000"  >&lt;p&gt;As a small note the OSS that also had problems when map_on_demand is enabled was running RHEL6.5 with the default distro infiniband stack. So it is not a inifinband issue.&lt;/p&gt;</comment>
                            <comment id="129800" author="yujian" created="Thu, 8 Oct 2015 08:09:04 +0000"  >&lt;p&gt;Hi James,&lt;br/&gt;
Does the issue in this ticket still exist?&lt;/p&gt;</comment>
                            <comment id="129821" author="simmonsja" created="Thu, 8 Oct 2015 13:19:18 +0000"  >&lt;p&gt;I haven&apos;t tried in a while. Will do.&lt;/p&gt;</comment>
                            <comment id="136565" author="simmonsja" created="Wed, 16 Dec 2015 16:47:12 +0000"  >&lt;p&gt;Just tried it. Now that the OFED stack has been updated to a newer 3.12 the mlx5 driver no longer supports FMR so this issue has gone away. I will be trying the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5783&quot; title=&quot;o2iblnd: investigate new memory registration mechanisms&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5783&quot;&gt;&lt;del&gt;LU-5783&lt;/del&gt;&lt;/a&gt; work very soon on our Cray routers to see if I hit memory issues. In that case the bugs can be reported under that ticket. You can close this ticket.&lt;/p&gt;</comment>
                            <comment id="136616" author="yujian" created="Wed, 16 Dec 2015 20:45:17 +0000"  >&lt;p&gt;Thank you James.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="30752">LU-6748</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="30752">LU-6748</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzxfsf:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>