<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:31:20 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-16949] LNet: deadlock on o2ib NI going down under Centos 7.9</title>
                <link>https://jira.whamcloud.com/browse/LU-16949</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;The issue can be reproduced by adding an o2ib NI and then interrupting the corresponding link by pulling the cable or shutting down the switch connection or the whole switch.&#160;&lt;/p&gt;

&lt;p&gt;Alternatively, one can add the o2ib NI when the corresponding link is already down (cable pulled) to the same effect.&lt;/p&gt;

&lt;p&gt;Using &quot;ifdown&quot; to bring the whole interface down doesn&apos;t reproduce the problem.&#160;&lt;/p&gt;

&lt;p&gt;I could reproduce this on a Centos 7.9 VM, but not on a Centos 8.2 system.&lt;/p&gt;

&lt;p&gt;The issue got introduced by&#160;&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
commit da230373bd14306cb97fb48748ebce205f09d468
Author: Serguei Smirnov &amp;lt;ssmirnov@whamcloud.com&amp;gt;
Date: &#160; Thu Feb 16 10:34:03 2023 -0800
LU-16563 lnet: use discovered ni status to set initial health &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;It then got masked by another issue causing failure when trying to add an o2ib NI starting from&#160;&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
commit cc5594df3e70d1924f34ccdf4c3178654d277ad0
Author: Shaun Tancheff &amp;lt;shaun.tancheff@hpe.com&amp;gt;
Date: &#160; Sun Apr 23 07:19:11 2023 -0500
LU-16759 o2ib: MOFED 5.5+ ib_dma_virt_map_sg&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;until some later commit which I didn&apos;t determine re-enabled adding o2iblnd NI. The latest master is behaving on 7.9 Centos as described.&lt;/p&gt;</description>
                <environment>centos 7.9 VM 3.10.0-1160.25.1.el7_lustre.x86_64 kernel&lt;br/&gt;
could not reproduce on centos 8.2</environment>
        <key id="76911">LU-16949</key>
            <summary>LNet: deadlock on o2ib NI going down under Centos 7.9</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="ssmirnov">Serguei Smirnov</assignee>
                                    <reporter username="ssmirnov">Serguei Smirnov</reporter>
                        <labels>
                            <label>lnet</label>
                            <label>o2iblnd</label>
                    </labels>
                <created>Fri, 7 Jul 2023 22:39:34 +0000</created>
                <updated>Mon, 22 Jan 2024 17:20:19 +0000</updated>
                            <resolved>Fri, 8 Sep 2023 22:35:18 +0000</resolved>
                                                    <fixVersion>Lustre 2.16.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="378351" author="gerrit" created="Tue, 11 Jul 2023 22:55:05 +0000"  >&lt;p&gt;&quot;Serguei Smirnov &amp;lt;ssmirnov@whamcloud.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/51635&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/51635&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16949&quot; title=&quot;LNet: deadlock on o2ib NI going down under Centos 7.9&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16949&quot;&gt;&lt;del&gt;LU-16949&lt;/del&gt;&lt;/a&gt; lnet: get monitor thread to update ping buffer&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 4522d02f3962130bab89a29c1fd8c393ba412faf&lt;/p&gt;</comment>
                            <comment id="381535" author="gerrit" created="Mon, 7 Aug 2023 03:50:13 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; merged in patch &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/51635/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/51635/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16949&quot; title=&quot;LNet: deadlock on o2ib NI going down under Centos 7.9&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16949&quot;&gt;&lt;del&gt;LU-16949&lt;/del&gt;&lt;/a&gt; lnet: get monitor thread to update ping buffer&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 7ac399c5aec01186ad4c9a7153aea400777c897f&lt;/p&gt;</comment>
                            <comment id="385365" author="pjones" created="Fri, 8 Sep 2023 22:35:18 +0000"  >&lt;p&gt;landed for 2.16&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                                        </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i03pzr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>