<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:55:25 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-5892] lfsck_needs_scan_dir() LBUG</title>
                <link>https://jira.whamcloud.com/browse/LU-5892</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;I&#8217;ve run into a problem while executing the LFSCK Phase 3 test plan at &lt;a href=&quot;https://jira.hpdd.intel.com/browse/LU-4836&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jira.hpdd.intel.com/browse/LU-4836&lt;/a&gt; &lt;br/&gt;
Test 3.3.2 calls for creating a number of subdirectories and creating a variety of objects in each subdirectory including local and remote subdirectories; local and remote in terms of the MDS. The test plan calls for setting fail_loc to 1502 so that all objects crated will have no linkEA. Creating files and local subdirectories work with this failure injected, but not remote directories. In the following, &#8220;rdir-1&#8221; is the remote directory:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Create local subdirectories
Create remote subdirectories
error on LL_IOC_LMV_SETSTRIPE &apos;/lustre/scratch/test_dir/sdir-0/rdir-1&apos; (3): No data available
error: mkdir: create stripe dir &apos;/lustre/scratch/test_dir/sdir-0/rdir-1&apos; failed
status        script            Total(sec) E(xcluded) S(low) 
------------------------------------------------------------------------------------

touch: missing file operand
Try `touch --help&apos; for more information.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;I cannot remove the remote directory:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# rm -rf /lustre/scratch/test_dir/sdir-0/rdir-1 
[root@c13 tests]# ls /lustre/scratch/test_dir/sdir-0/rdir-1 
ls: cannot access /lustre/scratch/test_dir/sdir-0/rdir-1: No such file or directory
[root@c13 tests]# ls /lustre/scratch/test_dir
sdir-0
[root@c13 tests]# ls /lustre/scratch/test_dir/sdir-0/
ls: cannot access /lustre/scratch/test_dir/sdir-0/rdir-1: No such file or directory
rdir-1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;So, I figured I&#8217;d run LFSCk since it is supposed to correct these errors, but LFSCK crashes the node. On MDS1, I reset the fail_loc to zero and ran &#8216;lctl lfsck_start&#8217;:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# lctl set_param fail_loc=0

# lctl lfsck_start -A -M scratch-MDT0000 -c -C --reset --type namespace
Started LFSCK on the device scratch-MDT0000: scrub namespace
[root@mds01 ~]# 
Message from syslogd@mds01-ib at Nov 10 08:11:20 ...
 kernel:LustreError: 17451:0:(lfsck_engine.c:232:lfsck_needs_scan_dir()) ASSERTION( depth &amp;gt; 0 ) failed: 

Message from syslogd@mds01-ib at Nov 10 08:11:20 ...
 kernel:LustreError: 17451:0:(lfsck_engine.c:232:lfsck_needs_scan_dir()) LBUG
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;From the crash dmesg on MDS1:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&amp;lt;0&amp;gt;LustreError: 17451:0:(lfsck_engine.c:232:lfsck_needs_scan_dir()) ASSERTION( d
epth &amp;gt; 0 ) failed: 
&amp;lt;0&amp;gt;LustreError: 17451:0:(lfsck_engine.c:232:lfsck_needs_scan_dir()) LBUG
&amp;lt;4&amp;gt;Pid: 17451, comm: lfsck
&amp;lt;4&amp;gt;
&amp;lt;4&amp;gt;Call Trace:
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa06f2895&amp;gt;] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa06f2e97&amp;gt;] lbug_with_loc+0x47/0xb0 [libcfs]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa19106f7&amp;gt;] lfsck_exec_oit+0x7f7/0xb80 [lfsck]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa078904a&amp;gt;] ? fld_cache_lookup+0x3a/0x1e0 [fld]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa1911d35&amp;gt;] lfsck_master_oit_engine+0x12b5/0x1f30 [lfsck]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff810546b9&amp;gt;] ? __wake_up_common+0x59/0x90
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa191346e&amp;gt;] lfsck_master_engine+0xabe/0x1390 [lfsck]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa19129b0&amp;gt;] ? lfsck_master_engine+0x0/0x1390 [lfsck]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
&amp;lt;4&amp;gt;
&amp;lt;0&amp;gt;Kernel panic - not syncing: LBUG
&amp;lt;4&amp;gt;Pid: 17451, comm: lfsck Not tainted 2.6.32-431.29.2.el6_lustre.gd99708b.x86_6
4 #1
&amp;lt;4&amp;gt;Call Trace:
&amp;lt;4&amp;gt; [&amp;lt;ffffffff81528fdc&amp;gt;] ? panic+0xa7/0x16f
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa06f2eeb&amp;gt;] ? lbug_with_loc+0x9b/0xb0 [libcfs]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa19106f7&amp;gt;] ? lfsck_exec_oit+0x7f7/0xb80 [lfsck]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa078904a&amp;gt;] ? fld_cache_lookup+0x3a/0x1e0 [fld]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa1911d35&amp;gt;] ? lfsck_master_oit_engine+0x12b5/0x1f30 [lfsck]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff810546b9&amp;gt;] ? __wake_up_common+0x59/0x90
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa191346e&amp;gt;] ? lfsck_master_engine+0xabe/0x1390 [lfsck]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa19129b0&amp;gt;] ? lfsck_master_engine+0x0/0x1390 [lfsck]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8109abf6&amp;gt;] ? kthread+0x96/0xa0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c20a&amp;gt;] ? child_rip+0xa/0x20
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;On the second MDS, dmesg contains:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Lustre: *** cfs_fail_loc=1502, val=0***
LustreError: 1426:0:(osd_handler.c:4546:osd_index_ea_insert()) scratch-MDT0001-osd: add [0x3c0000400:0xea61:0x0] error: rc = -61
LustreError: 1426:0:(osd_handler.c:2495:osd_object_destroy()) scratch-MDT0001-osd: delete inode [0x3c0000400:0xea61:0x0]: rc = -61
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment>OpenSFS cluster with two MDSs with one MDT each, three OSSs with two OSTs each and three clients. Lustre master tag 2.6.90.</environment>
        <key id="27521">LU-5892</key>
            <summary>lfsck_needs_scan_dir() LBUG</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="yong.fan">nasf</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                            <label>lfsck</label>
                    </labels>
                <created>Mon, 10 Nov 2014 16:51:51 +0000</created>
                <updated>Sun, 23 Nov 2014 09:27:14 +0000</updated>
                            <resolved>Sun, 23 Nov 2014 09:27:14 +0000</resolved>
                                    <version>Lustre 2.7.0</version>
                                    <fixVersion>Lustre 2.7.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="98785" author="jlevi" created="Mon, 10 Nov 2014 18:26:19 +0000"  >&lt;p&gt;Fan Yong,&lt;br/&gt;
Can you look into this one?&lt;br/&gt;
Thank you!&lt;/p&gt;</comment>
                            <comment id="98808" author="jamesanunez" created="Mon, 10 Nov 2014 20:45:06 +0000"  >&lt;p&gt;I&apos;ve uploaded the vmcore from the MDS that crashed at uploads/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5892&quot; title=&quot;lfsck_needs_scan_dir() LBUG&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5892&quot;&gt;&lt;del&gt;LU-5892&lt;/del&gt;&lt;/a&gt; on the ftp site.&lt;/p&gt;</comment>
                            <comment id="98880" author="yong.fan" created="Tue, 11 Nov 2014 14:34:16 +0000"  >&lt;p&gt;James,&lt;br/&gt;
To generate MDT-object without linkEA, you need to inject the failure stub &quot;#define OBD_FAIL_LFSCK_LINKEA_CRASH     0x1603&quot;. The failure stub 0x1502 will cause the MDT-object has no LMV EA. That is why you saw the message &quot;osd_index_ea_insert()) scratch-MDT0001-osd: add &lt;span class=&quot;error&quot;&gt;&amp;#91;0x3c0000400:0xea61:0x0&amp;#93;&lt;/span&gt; error: rc = -61&quot;.&lt;/p&gt;

&lt;p&gt;As for the LBUG(), I will make the patch to fix it.&lt;/p&gt;</comment>
                            <comment id="98883" author="yong.fan" created="Tue, 11 Nov 2014 14:44:45 +0000"  >&lt;p&gt;Here is the patch:&lt;br/&gt;
&lt;a href=&quot;http://review.whamcloud.com/12670&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/12670&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="99885" author="gerrit" created="Sun, 23 Nov 2014 07:13:37 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/12670/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/12670/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5892&quot; title=&quot;lfsck_needs_scan_dir() LBUG&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5892&quot;&gt;&lt;del&gt;LU-5892&lt;/del&gt;&lt;/a&gt; lfsck: remove improper LASSERT in lfsck_needs_scan_dir&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 4db8abadae3f8a393fe0d25e07575305ae3876da&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzx0k7:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>16466</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>