<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:10:17 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-14499] o2iblnd: LU-13368 changes cause shutdown procedure to not complete</title>
                <link>https://jira.whamcloud.com/browse/LU-14499</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Changes applied by the patches from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13368&quot; title=&quot;lnet may be trying to use deleted routes leading to errors kiblnd_rejected(): 10.0.10.212@o2ib7 rejected: consumer defined fatal error&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13368&quot;&gt;&lt;del&gt;LU-13368&lt;/del&gt;&lt;/a&gt; appear to be causing the o2iblnd shutdown procedure to not complete properly sometimes on lustre_rmmod:&lt;/p&gt;

&lt;p&gt;In that case, messages similar to the following keep showing up in the log:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
[51025.354675] LNet: 9402:0:(o2iblnd.c:3107:kiblnd_shutdown()) 10.1.11.124@o2ib10: waiting &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; 3 peers to disconnect
[51029.354481] LNet: 9402:0:(o2iblnd.c:3107:kiblnd_shutdown()) 10.1.11.124@o2ib10: waiting &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; 3 peers to disconnect
[51037.353971] LNet: 9402:0:(o2iblnd.c:3107:kiblnd_shutdown()) 10.1.11.124@o2ib10: waiting &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; 3 peers to disconnect
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;&#160;&lt;/p&gt;</description>
                <environment></environment>
        <key id="63237">LU-14499</key>
            <summary>o2iblnd: LU-13368 changes cause shutdown procedure to not complete</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="ssmirnov">Serguei Smirnov</assignee>
                                    <reporter username="ssmirnov">Serguei Smirnov</reporter>
                        <labels>
                    </labels>
                <created>Mon, 8 Mar 2021 17:37:11 +0000</created>
                <updated>Mon, 30 Jan 2023 17:25:46 +0000</updated>
                                                                                <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="294257" author="gerrit" created="Mon, 8 Mar 2021 17:53:39 +0000"  >&lt;p&gt;Serguei Smirnov (ssmirnov@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/41937&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/41937&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14499&quot; title=&quot;o2iblnd: LU-13368 changes cause shutdown procedure to not complete&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14499&quot;&gt;LU-14499&lt;/a&gt; lnet: Revert &quot;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13368&quot; title=&quot;lnet may be trying to use deleted routes leading to errors kiblnd_rejected(): 10.0.10.212@o2ib7 rejected: consumer defined fatal error&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13368&quot;&gt;&lt;del&gt;LU-13368&lt;/del&gt;&lt;/a&gt; lnet: discard the callback&quot;&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: eda619ef352141de76b3bc2fe97d56ed68c7c9d9&lt;/p&gt;</comment>
                            <comment id="326262" author="hornc" created="Mon, 14 Feb 2022 19:10:07 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=ssmirnov&quot; class=&quot;user-hover&quot; rel=&quot;ssmirnov&quot;&gt;ssmirnov&lt;/a&gt; could this issue impact ksocklnd as well?&lt;/p&gt;</comment>
                            <comment id="326302" author="ssmirnov" created="Mon, 14 Feb 2022 22:08:04 +0000"  >&lt;p&gt;Chris,&lt;/p&gt;

&lt;p&gt;Despite concluding that &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13368&quot; title=&quot;lnet may be trying to use deleted routes leading to errors kiblnd_rejected(): 10.0.10.212@o2ib7 rejected: consumer defined fatal error&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13368&quot;&gt;&lt;del&gt;LU-13368&lt;/del&gt;&lt;/a&gt; patch is causing the issue, I never had complete understanding of what was going wrong exactly. It could be specific to o2iblnd only. I don&apos;t think I recall socklnd getting stuck in the same manner.&#160;&lt;/p&gt;</comment>
                            <comment id="344303" author="hornc" created="Mon, 22 Aug 2022 21:21:12 +0000"  >&lt;p&gt;We traced a memory leak back to the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13368&quot; title=&quot;lnet may be trying to use deleted routes leading to errors kiblnd_rejected(): 10.0.10.212@o2ib7 rejected: consumer defined fatal error&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13368&quot;&gt;&lt;del&gt;LU-13368&lt;/del&gt;&lt;/a&gt; change. Given Alexey&apos;s prior misgivings about the patch, and its known bugginess, I think we should revert it.&lt;/p&gt;</comment>
                            <comment id="358847" author="ofaaland" created="Thu, 12 Jan 2023 19:22:19 +0000"  >&lt;p&gt;Hi Serguei,&lt;br/&gt;
Is this stuck because you need more information?&lt;br/&gt;
thanks,&lt;/p&gt;</comment>
                            <comment id="358858" author="ssmirnov" created="Thu, 12 Jan 2023 20:37:55 +0000"  >&lt;p&gt;Hi Olaf,&lt;/p&gt;

&lt;p&gt;From comments in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13368&quot; title=&quot;lnet may be trying to use deleted routes leading to errors kiblnd_rejected(): 10.0.10.212@o2ib7 rejected: consumer defined fatal error&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13368&quot;&gt;&lt;del&gt;LU-13368&lt;/del&gt;&lt;/a&gt; and this ticket, it looks like &quot;lnet: discard the callback&quot; change should be reverted. On the other hand, there were potential fixes supplied by Yang Sheng which didn&apos;t get tested. If I remember correctly, this got stuck pending the test results, which would help decide whether to revert the change, or keep it and add the fixes.&#160;&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=ys&quot; class=&quot;user-hover&quot; rel=&quot;ys&quot;&gt;ys&lt;/a&gt;, &lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=sihara&quot; class=&quot;user-hover&quot; rel=&quot;sihara&quot;&gt;sihara&lt;/a&gt;: is my understanding correct?&#160;&lt;/p&gt;

&lt;p&gt;Thanks,&lt;/p&gt;

&lt;p&gt;Serguei.&#160;&lt;/p&gt;</comment>
                            <comment id="359003" author="ys" created="Fri, 13 Jan 2023 16:49:48 +0000"  >&lt;p&gt;Hi, Serguei, Yes, you are right. &lt;/p&gt;</comment>
                            <comment id="359008" author="ofaaland" created="Fri, 13 Jan 2023 17:08:26 +0000"  >&lt;p&gt;What are the gerrit URLs for those changes?  Thanks.&lt;/p&gt;</comment>
                            <comment id="359225" author="ssmirnov" created="Mon, 16 Jan 2023 22:57:24 +0000"  >&lt;p&gt;Hi Olaf,&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=ys&quot; class=&quot;user-hover&quot; rel=&quot;ys&quot;&gt;ys&lt;/a&gt; will correct me if I&apos;m wrong, but I believe these are the two changes which are supposed to be fixing the original &quot;discard the callback&quot;:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://review.whamcloud.com/#/c/fs/lustre-release/+/40937/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/fs/lustre-release/+/40937/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://review.whamcloud.com/#/c/fs/lustre-release/+/41970/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/fs/lustre-release/+/41970/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Thanks,&lt;/p&gt;

&lt;p&gt;Serguei.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="359244" author="ys" created="Tue, 17 Jan 2023 03:19:45 +0000"  >&lt;p&gt;Sorry for the delay. Yes, Serguei is right. &lt;br/&gt;
The &lt;a href=&quot;https://review.whamcloud.com/#/c/fs/lustre-release/+/38845/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/fs/lustre-release/+/38845/&lt;/a&gt; is original patch. &lt;br/&gt;
The &lt;a href=&quot;https://review.whamcloud.com/#/c/fs/lustre-release/+/40937/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/fs/lustre-release/+/40937/&lt;/a&gt; is a patch to work with 38845 to provide full function.&lt;br/&gt;
The &lt;a href=&quot;https://review.whamcloud.com/#/c/fs/lustre-release/+/41970/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/fs/lustre-release/+/41970/&lt;/a&gt; is a bug fixing patch for this ticket. Since i think it should be tested first, So i mark it as a &apos;test patch&apos;. &lt;/p&gt;
</comment>
                            <comment id="360100" author="ofaaland" created="Mon, 23 Jan 2023 20:09:15 +0000"  >&lt;p&gt;Hi Serguei and Yang Sheng,&lt;/p&gt;

&lt;p&gt;Thanks for clarifying.  It looks like changes 40937 and 41970 aren&apos;t progressing.  Are you waiting on something?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;</comment>
                            <comment id="360872" author="ssmirnov" created="Mon, 30 Jan 2023 17:25:46 +0000"  >&lt;p&gt;Hi Olaf, the problem here appears to be that even though the patches are code-complete and Maloo-tested, we&apos;re not able to verify Yang Sheng&apos;s fixes in a proper IB environment as Shuichi doesn&apos;t have the available resources. Would you be able to give these patches a try on your system?&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="59459">LU-13638</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="58410">LU-13368</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i01otj:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>