<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:34:41 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-17344] ldlm_resource_get() ASSERTION(name-&gt;name[0] != 0) failed</title>
                <link>https://jira.whamcloud.com/browse/LU-17344</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;A system running LFSCK was crashing in a loop, apparently trying to destroy a bad object FID:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt; LustreError: 16300:0:(ldlm_resource.c:1488:ldlm_resource_get()) ASSERTION(name-&amp;gt;name[0] != 0) failed:
 kernel:Kernel panic - not syncing: LBUG 
 Call Trace:
 libcfs_call_trace+0x90/0xf0 [libcfs]
 lbug_with_loc+0x4c/0xa0 [libcfs]
 ldlm_resource_get+0x7e9/0x950 [ptlrpc]
 ldlm_lock_create+0x55/0xa60 [ptlrpc]
 ldlm_cli_enqueue_local+0xcc/0x850 [ptlrpc]
 lfsck_layout_slave_conditional_destroy [lfsck]
 lfsck_layout_slave_in_notify+0xa19/0xed0 [lfsck]
 lfsck_in_notify+0x23c/0x320 [lfsck]
 tgt_handle_lfsck_notify+0x5c/0x140 [ptlrpc]
 tgt_request_handle+0x8bf/0x18c0 [ptlrpc]
 ptlrpc_server_handle_request+0x253/0xc40 [ptlrpc]
 ptlrpc_main+0xc4a/0x1cb0 [ptlrpc]
 kthread+0xd1/0xe0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;It probably makes sense to have &lt;tt&gt;lfsck_layout_slave_conditional_destroy()&lt;/tt&gt; or a higher level check that the FID is valid before calling all the way down to &lt;tt&gt;ldlm_cli_enqueue_local()&lt;/tt&gt;.&lt;/p&gt;</description>
                <environment></environment>
        <key id="79400">LU-17344</key>
            <summary>ldlm_resource_get() ASSERTION(name-&gt;name[0] != 0) failed</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="hongchao.zhang">Hongchao Zhang</assignee>
                                    <reporter username="adilger">Andreas Dilger</reporter>
                        <labels>
                    </labels>
                <created>Thu, 7 Dec 2023 19:06:10 +0000</created>
                <updated>Fri, 29 Dec 2023 08:54:20 +0000</updated>
                                            <version>Lustre 2.14.0</version>
                    <version>Lustre 2.16.0</version>
                    <version>Lustre 2.15.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="395904" author="adilger" created="Thu, 7 Dec 2023 19:29:48 +0000"  >&lt;p&gt;Hongchao, could you please take a look into this.   It looks like a relatively simple patch  in &lt;tt&gt;lfsck_layout_slave_conditional_destroy()&lt;/tt&gt; to check that the FID is valid before calling down the stack to &quot;destroy&quot; this bad object.&lt;/p&gt;

&lt;p&gt;It looks like this crash was on the OST side (&lt;tt&gt;lfsck_layout_slave_in_notify()&lt;/tt&gt;) so it makes sense to handle this case if there is bad data sent from the MDS, but it would likely &lt;b&gt;also&lt;/b&gt; make sense to add a check on the MDS side of LFSCK so that it doesn&apos;t even &lt;b&gt;send&lt;/b&gt; the request to destroy an object that doesn&apos;t exist (e.g. FID &lt;tt&gt;0x0:0x0:0x0&lt;/tt&gt; or similar) that is coming from a bad file layout on disk.&lt;/p&gt;

&lt;p&gt;Probably also changing &lt;tt&gt;ldlm_resource_get()&lt;/tt&gt; to print and return an error instead of &lt;tt&gt;LASSERT()&lt;/tt&gt; would make the code more robust, though harder to debug in the future.&lt;/p&gt;</comment>
                            <comment id="396179" author="hongchao.zhang" created="Mon, 11 Dec 2023 02:00:35 +0000"  >&lt;p&gt;Hi,&lt;br/&gt;
The FID had seemed to parsed correctly and the corresponding &lt;b&gt;dt_object&lt;/b&gt; had been returned successfully and&lt;br/&gt;
the existance of the &lt;b&gt;dt_object&lt;/b&gt; already been verified, then is it possible that the issue is related to the conversion&lt;br/&gt;
between &lt;b&gt;ost_id&lt;/b&gt; and &lt;b&gt;lu_fid&lt;/b&gt; for some special FID? how about adding some corresponding check here?&lt;br/&gt;
Thanks&lt;/p&gt;</comment>
                            <comment id="398200" author="gerrit" created="Fri, 29 Dec 2023 08:54:20 +0000"  >&lt;p&gt;&quot;Hongchao Zhang &amp;lt;hongchao@whamcloud.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/53565&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/53565&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17344&quot; title=&quot;ldlm_resource_get() ASSERTION(name-&amp;gt;name[0] != 0) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17344&quot;&gt;LU-17344&lt;/a&gt; lfsck: check the validity of the res_id&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 23b263a28ff0b60f9f0674959fc9cb439cb84e71&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i0449j:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>