<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:14:16 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-14964] recovery-small: GPF in llog_exist after tests finished</title>
                <link>https://jira.whamcloud.com/browse/LU-14964</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;there&apos;s a relatively new crash being observed in maloo testing on rhel8 testing for past several days in cleanup of recovery-small in review-dne-part-5&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[  753.814277] Lustre: DEBUG MARKER: == recovery-small test complete, duration 6075 sec ======= 10:32:41 (1629887561)
[  783.518588] general protection fault: 0000 [#1] SMP PTI
[  783.519513] CPU: 0 PID: 3045 Comm: mdt_rdpg00_000 Kdump: loaded Tainted: G           OE    --------- -  - 4.18.0-240.22.1.el8_lustre.x86_64 #1
[  783.521414] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
[  783.522494] RIP: 0010:llog_exist+0xd9/0x180 [obdclass]
[  783.523265] Code: c7 05 7f 0f 0c 00 01 00 00 00 e8 a2 53 ee ff 5b c3 48 85 ff 0f 84 aa 00 00 00 48 8b 87 08 01 00 00 48 85 c0 0f 84 9a 00 00 00 &amp;lt;48&amp;gt; 8b 40 50 48 85 c0 74 53 48 89 df e8 b6 b3 77 fb f6 05 2b f6 f0
[  783.526007] RSP: 0018:ffffae2500ea3ae8 EFLAGS: 00010206
[  783.526776] RAX: 5a5a5a5a5a5a5a5a RBX: ffff9f95b14bd000 RCX: 0000000000000000
[  783.527839] RDX: 0000000000000ba5 RSI: 0000000000000000 RDI: ffff9f959e040900
[  783.528890] RBP: ffff9f959091f0d0 R08: 000000d823bc83f1 R09: 0000000000000bc0
[  783.530782] R10: ffffae2500ea3ae8 R11: ffff9f95a7d08b6c R12: ffff9f95b04c2080
[  783.532083] R13: ffff9f95b10d7ec0 R14: ffff9f959091f0d0 R15: ffff9f959132c000
[  783.533175] FS:  0000000000000000(0000) GS:ffff9f95bfc00000(0000) knlGS:0000000000000000
[  783.534423] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  783.535314] CR2: 00007fd080ad6000 CR3: 000000008960a005 CR4: 00000000003606f0
[  783.536412] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  783.537505] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  783.538587] Call Trace:
[  783.539040]  llog_cat_prep_log+0x4f/0x3c0 [obdclass]
[  783.539833]  llog_cat_declare_add_rec+0x56/0x220 [obdclass]
[  783.540700]  llog_declare_add+0x187/0x1d0 [obdclass]
[  783.541925]  top_trans_start+0x212/0x940 [ptlrpc]
[  783.542820]  mdd_attr_set+0x657/0xfe0 [mdd]
[  783.543538]  ? panic_notifier+0x20/0x20 [libcfs]
[  783.544400]  mdt_mfd_close+0x56c/0x8c0 [mdt]
[  783.545087]  mdt_close_internal+0xc4/0x240 [mdt]
[  783.545820]  mdt_close+0x47d/0x8b0 [mdt]
[  783.546470]  tgt_request_handle+0xc90/0x1940 [ptlrpc]
[  783.547300]  ptlrpc_server_handle_request+0x323/0xbc0 [ptlrpc]
[  783.548246]  ptlrpc_main+0xba2/0x1490 [ptlrpc]
[  783.548964]  ? __schedule+0x2cc/0x700
[  783.549562]  ? ptlrpc_wait_event+0x500/0x500 [ptlrpc]
[  783.550380]  kthread+0x112/0x130
[  783.550894]  ? kthread_flush_work_fn+0x10/0x10
[  783.551577]  ret_from_fork+0x35/0x40
[  783.552136] Modules linked in: osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lustre(OE) lmv(OE) mdc(OE) lov(OE) osc(OE) fid(OE) fld(OE) dm_flakey ptlrpc_gss(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache rpcrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp ib_ipoib rdma_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm sunrpc ib_core intel_rapl_msr intel_rapl_common crct10dif_pclmul crc32_pclmul dm_mod ghash_clmulni_intel pcspkr joydev virtio_balloon i2c_piix4 ip_tables ext4 mbcache jbd2 ata_generic ata_piix 8139too libata 8139cp crc32c_intel serio_raw virtio_blk mii &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;The 3 observed failures are:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://testing.whamcloud.com/test_sessions/d7bfd1a1-7ebf-40df-b80b-7e104a524f67&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sessions/d7bfd1a1-7ebf-40df-b80b-7e104a524f67&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://testing.whamcloud.com/test_sets/f93ef5ed-4963-4851-ac10-3e0477e9543c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/f93ef5ed-4963-4851-ac10-3e0477e9543c&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;and&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://testing.whamcloud.com/test_sets/1976e1d2-dc91-45b3-b29f-691f921ddeff&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/1976e1d2-dc91-45b3-b29f-691f921ddeff&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;the RAX is suspicious value so probably accessing some free memory?&lt;/p&gt;</description>
                <environment></environment>
        <key id="65799">LU-14964</key>
            <summary>recovery-small: GPF in llog_exist after tests finished</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="green">Oleg Drokin</reporter>
                        <labels>
                    </labels>
                <created>Wed, 25 Aug 2021 19:44:57 +0000</created>
                <updated>Thu, 5 May 2022 14:26:23 +0000</updated>
                            <resolved>Mon, 1 Nov 2021 20:54:32 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="311196" author="adilger" created="Wed, 25 Aug 2021 20:03:29 +0000"  >&lt;p&gt;This might also relate to &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14932&quot; title=&quot;runtests: test_1 llog_cat_cleanup()) ASSERTION( index ) on MDS&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14932&quot;&gt;&lt;del&gt;LU-14932&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="311709" author="hornc" created="Tue, 31 Aug 2021 15:39:53 +0000"  >&lt;p&gt;+1 on master - &lt;a href=&quot;https://testing.whamcloud.com/test_sets/4824cef4-96bd-4fb8-9cbe-1e068d07016d&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/4824cef4-96bd-4fb8-9cbe-1e068d07016d&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="312514" author="adilger" created="Sat, 11 Sep 2021 01:29:31 +0000"  >&lt;p&gt;Seems very likely this is the same problem as &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14474&quot; title=&quot;Oops in llog_cat_prep_log() in sanity-quota / recovery-small&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14474&quot;&gt;&lt;del&gt;LU-14474&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="312515" author="adilger" created="Sat, 11 Sep 2021 01:44:17 +0000"  >&lt;p&gt;Closing this as a duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14474&quot; title=&quot;Oops in llog_cat_prep_log() in sanity-quota / recovery-small&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14474&quot;&gt;&lt;del&gt;LU-14474&lt;/del&gt;&lt;/a&gt;, which predates the opening of this issue.&lt;/p&gt;</comment>
                            <comment id="312516" author="adilger" created="Sat, 11 Sep 2021 02:08:26 +0000"  >&lt;p&gt;Reopen this.  While &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14474&quot; title=&quot;Oops in llog_cat_prep_log() in sanity-quota / recovery-small&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14474&quot;&gt;&lt;del&gt;LU-14474&lt;/del&gt;&lt;/a&gt; has the same stack, it wasn&apos;t seen since the original report in 2021-02-26.  That was seen on patch &lt;a href=&quot;https://review.whamcloud.com/40274&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/40274&lt;/a&gt; &quot;&lt;tt&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13073&quot; title=&quot;Multiple MDS deadlocks (in lod_qos_prep_create) after OSS crash&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13073&quot;&gt;&lt;del&gt;LU-13073&lt;/del&gt;&lt;/a&gt; osp: don&apos;t block waiting for new objects&lt;/tt&gt;&quot; before it landed on 2021-03-10, but this problem only started happening again on 2021-08-13, so it may be the same symptom but a different cause.&lt;/p&gt;</comment>
                            <comment id="312517" author="adilger" created="Sat, 11 Sep 2021 02:23:35 +0000"  >&lt;p&gt;The first recent crash like this was on 2021-08-13, on a patch with parent v2_14_53-23-g29eabeb34c during cleanup of files after the end of the test.  The failing patch &lt;a href=&quot;https://review.whamcloud.com/44541&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/44541&lt;/a&gt; has not yet landed, so cannot be the source of the problem.  This is similar to &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14995&quot; title=&quot;recovery-small: FAIL: remove sub-test dirs failed: d110h d110i d110j Directory not empty&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14995&quot;&gt;&lt;del&gt;LU-14995&lt;/del&gt;&lt;/a&gt;, which fails recovery-small during file removal but does not crash.  Patches landed on 2021-08-10 are:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;29eabeb34c LU-14798 lustre: Support RDMA only pages
a7a889f77c LU-14798 lnet: add LNet GPU Direct Support
644cb83921 LU-14893 lctl: check user for changelog_deregister
bbd9646f91 LU-14881 libcfs: Complete testing for tcp_sock_set_*
4e1f9c4bd1 LU-14413 test: test for overstriping for sanity 27M
d6a3e06cb0 LU-14740 quota: reject invalid project id on server side
6b31918565 LU-8066 obdclass: move lu_ref to debugfs
d77e95cc6d LU-14790 lnet: Reflect ni_fatal in NI status
0b94a058fe LU-14694 mdt: do not remove orphans at umount
0a6beb2a50 LU-9859 libcfs: discard cfs_cap_t, use kernel_cap_t
ba1fa08a0f LU-10973 lnet: LUTF Python infra
a55b6dafea LU-10973 lnet: LUTF infrastructure updates
8c166f6bf4 LU-6142 lustre: use list_first_entry() in lustre subdirectory.
163870abfb LU-14382 mdt: implement fallocate in MDC/MDT
dfeb63f2ee LU-14844 tests: make sure mgc_requeue_timeout_min exist.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="312518" author="adilger" created="Sat, 11 Sep 2021 02:30:26 +0000"  >&lt;p&gt;Test crashed 24/970 = 1/40 in review-dne&lt;span class=&quot;error&quot;&gt;&amp;#91;-zfs&amp;#93;&lt;/span&gt;-part-5 sessions, all of them on master/master-next.&lt;/p&gt;</comment>
                            <comment id="314458" author="adilger" created="Thu, 30 Sep 2021 23:31:16 +0000"  >&lt;p&gt;May be fixed by patch: &lt;a href=&quot;https://review.whamcloud.com/44998&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/44998&lt;/a&gt; &quot;&lt;tt&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14474&quot; title=&quot;Oops in llog_cat_prep_log() in sanity-quota / recovery-small&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14474&quot;&gt;&lt;del&gt;LU-14474&lt;/del&gt;&lt;/a&gt; llog: reset pointer to the next llog&lt;/tt&gt;&quot;.&lt;/p&gt;</comment>
                            <comment id="317156" author="adilger" created="Mon, 1 Nov 2021 20:54:32 +0000"  >&lt;p&gt;All recent failures are due to patches with an old parent that does not contain the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14474&quot; title=&quot;Oops in llog_cat_prep_log() in sanity-quota / recovery-small&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14474&quot;&gt;&lt;del&gt;LU-14474&lt;/del&gt;&lt;/a&gt; fix.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="63090">LU-14474</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="65610">LU-14932</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="65972">LU-14995</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i022pb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>