<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:55:54 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-12816] LBUG: (niobuf.c:350:ptlrpc_register_bulk()) ASSERTION( !(desc-&gt;bd_registered &amp;&amp; req-&gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&gt;bd_last_mbits )</title>
                <link>https://jira.whamcloud.com/browse/LU-12816</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;c0-0c0s14n3 LustreError: 7380:0:(niobuf.c:350:ptlrpc_register_bulk()) LBUG
c0-0c0s14n3 Pid: 7380, comm: ptlrpcd_01_49
c0-0c0s14n3 Call Trace:
c0-0c0s14n3 [&amp;lt;ffffffff81008efc&amp;gt;] try_stack_unwind+0x17c/0x190
c0-0c0s14n3 [&amp;lt;ffffffff81007e84&amp;gt;] dump_trace+0x64/0x380
c0-0c0s14n3 [&amp;lt;ffffffffa025476e&amp;gt;] libcfs_call_trace+0x4e/0x60 [libcfs]
c0-0c0s14n3 [&amp;lt;ffffffffa0254e75&amp;gt;] lbug_with_loc+0x45/0xb0 [libcfs]
c0-0c0s14n3 [&amp;lt;ffffffffa0a0ed32&amp;gt;] ptlrpc_register_bulk+0x822/0x950 [ptlrpc]
c0-0c0s14n3 [&amp;lt;ffffffffa0a0f765&amp;gt;] ptl_send_rpc+0x215/0xd40 [ptlrpc]
c0-0c0s14n3 [&amp;lt;ffffffffa0a0561d&amp;gt;] ptlrpc_send_new_req+0x42d/0x9d0 [ptlrpc]
c0-0c0s14n3 [&amp;lt;ffffffffa0a077b8&amp;gt;] ptlrpc_check_set+0x8a8/0x2c70 [ptlrpc]
c0-0c0s14n3 [&amp;lt;ffffffffa0a33f2a&amp;gt;] ptlrpcd_check+0x3aa/0x5b0 [ptlrpc]
c0-0c0s14n3 [&amp;lt;ffffffffa0a342fc&amp;gt;] ptlrpcd+0x1cc/0x4c0 [ptlrpc]
c0-0c0s14n3 [&amp;lt;ffffffff810775b6&amp;gt;] kthread+0xd6/0xf0
c0-0c0s14n3 [&amp;lt;ffffffff8152690f&amp;gt;] ret_from_fork+0x3f/0x70
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This is the same fundamental problem as &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10643&quot; title=&quot;ASSERTION( !(desc-&amp;gt;bd_registered &amp;amp;&amp;amp; req-&amp;gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&amp;gt;bd_last_mbits )&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10643&quot;&gt;&lt;del&gt;LU-10643&lt;/del&gt;&lt;/a&gt;. If LNetMEAttach fails with an ENOMEM error, ptl_send_rpc() fails mid-processing and must cleanup the work it has done before the client tries to send the rpc again. The ptl_send_rpc path makes two calls to LNetMEAttach in the case of bulk reads and writes. &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10643&quot; title=&quot;ASSERTION( !(desc-&amp;gt;bd_registered &amp;amp;&amp;amp; req-&amp;gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&amp;gt;bd_last_mbits )&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10643&quot;&gt;&lt;del&gt;LU-10643&lt;/del&gt;&lt;/a&gt; addresses an ENOMEM after the first call. This bug is the result of an ENOMEM after the second call.&lt;/p&gt;

&lt;p&gt;The assertion fails because desc-&amp;gt;bd_registered is true.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;LustreError: 7380:0:(niobuf.c:350:ptlrpc_register_bulk()) ASSERTION
( !(desc-&amp;gt;bd_registered &amp;amp;&amp;amp; req-&amp;gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&amp;gt;bd_last_mbits ) failed: 
registered: 1 rq_mbits: 1636629211272768 bd_last_mbits: 1636629211272768

crash_x86_64&amp;gt; ptlrpc_request ffff88298086dc40 | grep send_state
      cr_send_state = LUSTRE_IMP_FULL,
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Error scenario: An attempt is made to send a bulk rpc under low memory conditions. ptl_send_rpc() successfully calls ptlrpc_register_bulk(), which attaches the request buffer and sets bd_registered. ptl_send_rpc() then tries to attach the reply buffer. But this fails with an ENOMEM error. The cleanup path does not reset bd_registered, so when the next attempt is made to send the rpc the assert is triggered in ptlrpc_register_bulk().&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;ptl_send_rpc:
....
        ptlrpc_register_bulk:
               sets bd_registered
               LNetMEAttach(request buffer)   &amp;lt;--- CAST-16472 fixes ENOMEM error  handling

        if reply expected:
               LNetMEAttach(reply buffer)
               if ENOMEM
                      got cleanup_bulk
....
cleanup_bulk:
         ptlrpc_unregister_bulk()      &amp;lt;--- doesn&apos;t reset bd_registered
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment></environment>
        <key id="57021">LU-12816</key>
            <summary>LBUG: (niobuf.c:350:ptlrpc_register_bulk()) ASSERTION( !(desc-&gt;bd_registered &amp;&amp; req-&gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&gt;bd_last_mbits )</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="amk">Ann Koehler</assignee>
                                    <reporter username="amk">Ann Koehler</reporter>
                        <labels>
                    </labels>
                <created>Fri, 27 Sep 2019 15:26:41 +0000</created>
                <updated>Sat, 2 May 2020 15:32:12 +0000</updated>
                            <resolved>Fri, 17 Apr 2020 17:52:31 +0000</resolved>
                                    <version>Lustre 2.12.1</version>
                                    <fixVersion>Lustre 2.14.0</fixVersion>
                    <fixVersion>Lustre 2.12.5</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="255494" author="gerrit" created="Fri, 27 Sep 2019 16:04:50 +0000"  >&lt;p&gt;Ann Koehler (amk@cray.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/36309&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/36309&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12816&quot; title=&quot;LBUG: (niobuf.c:350:ptlrpc_register_bulk()) ASSERTION( !(desc-&amp;gt;bd_registered &amp;amp;&amp;amp; req-&amp;gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&amp;gt;bd_last_mbits )&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12816&quot;&gt;&lt;del&gt;LU-12816&lt;/del&gt;&lt;/a&gt; ptlrpc: ptlrpc_register_bulk LBUG on ENOMEM&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: e75c8ae1f2ba20d96ff2b36b0c4c4e451feab8ea&lt;/p&gt;</comment>
                            <comment id="259297" author="gerrit" created="Fri, 6 Dec 2019 01:09:21 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/36309/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/36309/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12816&quot; title=&quot;LBUG: (niobuf.c:350:ptlrpc_register_bulk()) ASSERTION( !(desc-&amp;gt;bd_registered &amp;amp;&amp;amp; req-&amp;gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&amp;gt;bd_last_mbits )&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12816&quot;&gt;&lt;del&gt;LU-12816&lt;/del&gt;&lt;/a&gt; ptlrpc: ptlrpc_register_bulk LBUG on ENOMEM&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: e6225c07ce4c0037a127a41b2bc539364dfd1f4d&lt;/p&gt;</comment>
                            <comment id="259474" author="spitzcor" created="Mon, 9 Dec 2019 17:06:58 +0000"  >&lt;p&gt;Thanks for landing this fix.  Sadly, Ann has retired and she won&apos;t be able to close this bug herself.  Best wishes, Ann!&lt;/p&gt;</comment>
                            <comment id="267911" author="gerrit" created="Fri, 17 Apr 2020 17:59:24 +0000"  >&lt;p&gt;Minh Diep (mdiep@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/38266&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/38266&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12816&quot; title=&quot;LBUG: (niobuf.c:350:ptlrpc_register_bulk()) ASSERTION( !(desc-&amp;gt;bd_registered &amp;amp;&amp;amp; req-&amp;gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&amp;gt;bd_last_mbits )&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12816&quot;&gt;&lt;del&gt;LU-12816&lt;/del&gt;&lt;/a&gt; ptlrpc: ptlrpc_register_bulk LBUG on ENOMEM&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 691c8c42a4f9c0b0fb283e4095cd880f4cc4ecd6&lt;/p&gt;</comment>
                            <comment id="269074" author="gerrit" created="Fri, 1 May 2020 04:33:22 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/38266/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/38266/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12816&quot; title=&quot;LBUG: (niobuf.c:350:ptlrpc_register_bulk()) ASSERTION( !(desc-&amp;gt;bd_registered &amp;amp;&amp;amp; req-&amp;gt;rq_send_state != LUSTRE_IMP_REPLAY) || mbits != desc-&amp;gt;bd_last_mbits )&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12816&quot;&gt;&lt;del&gt;LU-12816&lt;/del&gt;&lt;/a&gt; ptlrpc: ptlrpc_register_bulk LBUG on ENOMEM&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 1679e79cf103dabe25ffa88af720842011fbc628&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                                        </outwardlinks>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="59023">LU-13509</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00nfz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>