<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:33:31 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-17207] LNet: race between monitor thread stopping and discovery thread PUSH may cause a crash</title>
                <link>https://jira.whamcloud.com/browse/LU-17207</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;This crash showed up in Janitor testing for &lt;a href=&quot;https://review.whamcloud.com/#/c/fs/lustre-release/+/52522/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/fs/lustre-release/+/52522/&lt;/a&gt; (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17103&quot; title=&quot;sanity-lnet test_207: timed out&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17103&quot;&gt;&lt;del&gt;LU-17103&lt;/del&gt;&lt;/a&gt;):&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
Crash with latest lustre function lnet_attach_rsp_tracker in backtrace called here: BUG: unable to handle kernel NULL pointer dereference at           (&lt;span class=&quot;code-keyword&quot;&gt;null&lt;/span&gt;)
IP: [&amp;lt;ffffffffa01e3acc&amp;gt;] lnet_attach_rsp_tracker.isra.29+0xcc/0x1a0 [lnet]
PGD 0
Oops: 0000 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: ksocklnd(OE) lnet(OE) libcfs(OE) veth crc32_generic crc_t10dif crct10dif_generic crct10dif_common rpcsec_gss_krb5 squashfs pcspkr i2c_piix4 i2c_core binfmt_misc ip_tables ext4 mbcache jbd2 ata_generic pata_acpi ata_piix serio_raw libata [last unloaded: libcfs]
CPU: 3 PID: 8711 Comm: lnet_discovery Kdump: loaded Tainted: G           OE  ------------   3.10.0-7.9-debug #1
Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.2-1.fc38 04/01/2014
task: ffff8800aa96d550 ti: ffff8800b4348000 task.ti: ffff8800b4348000
RIP: 0010:[&amp;lt;ffffffffa01e3acc&amp;gt;]  [&amp;lt;ffffffffa01e3acc&amp;gt;] lnet_attach_rsp_tracker.isra.29+0xcc/0x1a0 [lnet]
RSP: 0018:ffff8800b434bcb8  EFLAGS: 00010282
RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000017
RDX: 0000000000000001 RSI: 0000000000000000 RDI: ffff8800b639d380
RBP: ffff8800b434bce8 R08: 00000000d5555b28 R09: 632e65766f6d2d62
R10: 0000000000000180 R11: ffff8800b434bb86 R12: ffff8800b639d380
R13: ffff8800b639d380 R14: 0000000000000000 R15: 0000007175e68a80
FS:  0000000000000000(0000) GS:ffff88013e380000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 00000000b6238000 CR4: 00000000000006e0
Call Trace:
 [&amp;lt;ffffffffa01eaf8a&amp;gt;] LNetPut+0x29a/0x9c0 [lnet]
 [&amp;lt;ffffffffa01fd7bc&amp;gt;] lnet_peer_send_push+0x2ec/0x440 [lnet]
 [&amp;lt;ffffffffa0206760&amp;gt;] ? lnet_discovery_event_reply+0xc70/0xc70 [lnet]
 [&amp;lt;ffffffffa0208488&amp;gt;] lnet_peer_discovery+0x4a8/0x1710 [lnet]
 [&amp;lt;ffffffff817e8dce&amp;gt;] ? _raw_spin_unlock_irq+0xe/0x30
 [&amp;lt;ffffffff817e60fa&amp;gt;] ? __schedule+0x32a/0x7d0
 [&amp;lt;ffffffff810bb2a0&amp;gt;] ? wake_up_atomic_t+0x30/0x30
 [&amp;lt;ffffffffa0207fe0&amp;gt;] ? lnet_peer_merge_data+0x1230/0x1230 [lnet]
 [&amp;lt;ffffffff810ba114&amp;gt;] kthread+0xe4/0xf0
 [&amp;lt;ffffffff810ba030&amp;gt;] ? kthread_create_on_node+0x140/0x140
 [&amp;lt;ffffffff817f3e5d&amp;gt;] ret_from_fork_nospec_begin+0x7/0x21
 [&amp;lt;ffffffff810ba030&amp;gt;] ? kthread_create_on_node+0x140/0x140&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;As pointed out by Chris Horn, this may be the result of discovery thread issuing a push while the monitor thread is stopping:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
&lt;span class=&quot;code-quote&quot;&gt;&quot;Monitor thread is stopping and the ln_mt_resendqs are freed. Discovery then wakes and tries to issue push which attempts to dereference ln_mt_resendqs&quot;&lt;/span&gt;&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment></environment>
        <key id="78447">LU-17207</key>
            <summary>LNet: race between monitor thread stopping and discovery thread PUSH may cause a crash</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="ssmirnov">Serguei Smirnov</assignee>
                                    <reporter username="ssmirnov">Serguei Smirnov</reporter>
                        <labels>
                    </labels>
                <created>Tue, 17 Oct 2023 18:25:45 +0000</created>
                <updated>Thu, 9 Nov 2023 08:39:25 +0000</updated>
                            <resolved>Thu, 9 Nov 2023 00:45:16 +0000</resolved>
                                                    <fixVersion>Lustre 2.16.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="389655" author="gerrit" created="Tue, 17 Oct 2023 19:23:54 +0000"  >&lt;p&gt;&quot;Serguei Smirnov &amp;lt;ssmirnov@whamcloud.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/52734&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/52734&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17207&quot; title=&quot;LNet: race between monitor thread stopping and discovery thread PUSH may cause a crash&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17207&quot;&gt;&lt;del&gt;LU-17207&lt;/del&gt;&lt;/a&gt; lnet: race b/w monitor thr stop and discovery push&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 58f00213c811a05d1a96b44ef17b608c3fb883f6&lt;/p&gt;</comment>
                            <comment id="392310" author="gerrit" created="Wed, 8 Nov 2023 22:03:57 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; merged in patch &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/52734/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/52734/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17207&quot; title=&quot;LNet: race between monitor thread stopping and discovery thread PUSH may cause a crash&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17207&quot;&gt;&lt;del&gt;LU-17207&lt;/del&gt;&lt;/a&gt; lnet: race b/w monitor thr stop and discovery push&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 36b14a23a6e8240045074b097adfe01cb529d4a3&lt;/p&gt;</comment>
                            <comment id="392356" author="pjones" created="Thu, 9 Nov 2023 00:45:16 +0000"  >&lt;p&gt;Landed for 2.16&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="77823">LU-17103</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i03yp3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>