<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:33:28 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-17201] LNetError in o2iblnd.c with qib HCA under EL9.2</title>
                <link>https://jira.whamcloud.com/browse/LU-17201</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;&#160; LNET loads the tcp interface fine, but o2ib fails with this kernel message:&lt;br/&gt;
LNetError: 701:0:(o2iblnd.c:2647:kiblnd_hdev_get_attr()) Invalid mr size: 0xffffffffffffffff&lt;br/&gt;
LNetError: 701:0:(o2iblnd.c:2880:kiblnd_dev_failover()) Can&apos;t get device attributes: -22&lt;br/&gt;
LNetError: 701:0:(o2iblnd.c:3354:kiblnd_startup()) ko2iblnd: Can&apos;t initialize device: rc = -22&lt;br/&gt;
LNetError: 105-4: Error -100 starting up LNI o2ib&lt;/p&gt;

&lt;p&gt;&#160; We are trying (perhaps over-hopefully) to get the lustre client to work in EL9 on old Qlogic/Intel Truescale Infiniband hardware. RedHat had removed the qib module back in EL8, although it remains in the mainline kernels from kernel.org. The ELRepo repository maintains a few of these RH-deprecated kernel modules compiled against the RHEL kernel. As of kmod-ib_qib-1.11-6.el9_2.elrepo, this module actually works.&lt;/p&gt;

&lt;p&gt;&#160; The closest bug report I could find is &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10549&quot; title=&quot;Cannot start lnet with MOFED 3.4&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10549&quot;&gt;LU-10549&lt;/a&gt;, which suggests a mismatch in real vs. expected data fields reported by the module. I suspect no-one has actually tried the EL9 kernel ib_qib with lustre, considering it only started working last week.&lt;/p&gt;

&lt;p&gt;&#160; In the mean time, I&apos;ll try to swap out the EL9.2 kernel + kmod with the ELRepo-maintained kernel-lt, which includes the standard kernel.org qib module.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</description>
                <environment>Alma Linux 9.2&lt;br/&gt;
Kernel 5.14.0-284.30.1.el9_2.x86_64&lt;br/&gt;
kmod-ib_qib-1.11-6.el9_2.elrepo.x86_64</environment>
        <key id="78433">LU-17201</key>
            <summary>LNetError in o2iblnd.c with qib HCA under EL9.2</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="nathan.crawford@uci.edu">Nathan Crawford</reporter>
                        <labels>
                    </labels>
                <created>Mon, 16 Oct 2023 21:51:18 +0000</created>
                <updated>Sat, 21 Oct 2023 03:33:45 +0000</updated>
                                            <version>Lustre 2.15.3</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="389747" author="nscfreny" created="Wed, 18 Oct 2023 09:44:48 +0000"  >&lt;p&gt;Quick question, did you also install infinipath-psm (provides /etc/udev/rules.d/60-ipath.rules)?&lt;/p&gt;

&lt;p&gt;We did some tests late last year with Rocky 9 + ib_qib with the patch that is now in elrepo. Had to install infinipath-psm from CentOS 7 (failed to rebuild it for el9). We did not try Lustre o2ib.&lt;/p&gt;</comment>
                            <comment id="390138" author="nathan.crawford@uci.edu" created="Sat, 21 Oct 2023 03:33:45 +0000"  >&lt;p&gt;We haven&apos;t tried to install the psm libs, but they weren&apos;t needed for the Lustre client on Rocky 8.&lt;/p&gt;

&lt;p&gt;The proposed workaround isn&apos;t going to work as the ELRepo kernel-lt for el9 is already too new (6.1.58). May need to dig a bit more.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i03ylz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>