<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:19:51 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-8706] e2fsck -fDy running forever</title>
                <link>https://jira.whamcloud.com/browse/LU-8706</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We had several OSSes crash yesterday that appear to be related to quotas. It was exposed when I ran e2fsck against one of the OSTs and on a subsequent run after it was repaired, it started complaining about the htree and has been spewing &quot;Unattached inode &amp;lt;num&amp;gt; connect to lost+found (similar to what was reported in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3542&quot; title=&quot;deleted/unused inodes not actually cleared by e2fsck&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3542&quot;&gt;&lt;del&gt;LU-3542&lt;/del&gt;&lt;/a&gt;) for the past couple of hours. All the other OSTs checked fine when I just ran e2fsck without the -D option.&lt;/p&gt;

&lt;p&gt;Do I just let this run to completion or do I have alternatives? &lt;/p&gt;</description>
                <environment>toss-2.4-2</environment>
        <key id="40586">LU-8706</key>
            <summary>e2fsck -fDy running forever</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="adilger">Andreas Dilger</assignee>
                                    <reporter username="jamervi">Joe Mervini</reporter>
                        <labels>
                    </labels>
                <created>Thu, 13 Oct 2016 18:08:59 +0000</created>
                <updated>Fri, 14 Oct 2016 17:22:58 +0000</updated>
                            <resolved>Fri, 14 Oct 2016 17:22:58 +0000</resolved>
                                    <version>Lustre 2.5.3</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="169512" author="adilger" created="Thu, 13 Oct 2016 18:27:01 +0000"  >&lt;p&gt;What version of e2fsck are you running?  There was a bug in e2fsck (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7368&quot; title=&quot;e2fsck unsafe to interrupt with quota enabled&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7368&quot;&gt;&lt;del&gt;LU-7368&lt;/del&gt;&lt;/a&gt;) that if you interrupted it when quotas was enabled that it could corrupt the filesystem.  There was a second bug in the &lt;tt&gt;e2fsck -fD&lt;/tt&gt; option (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7381&quot; title=&quot;&amp;quot;e2fsck -fD&amp;quot; on directory may cause extent tree corruption&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7381&quot;&gt;&lt;del&gt;LU-7381&lt;/del&gt;&lt;/a&gt;) that was also fixed in e2fsprogs-1.42.13.wc5, which is the latest release.&lt;/p&gt;

&lt;p&gt;I suspect that there isn&apos;t anything to be done at this stage, and you need to let the e2fsck run to completion.  If you are getting entries moved into lost+found, then you will need to run &lt;tt&gt;ll_recover_lost_found_objs&lt;/tt&gt; on the OSTs to move the OST objects from &lt;tt&gt;lost+found&lt;/tt&gt; back into the &lt;tt&gt;O/0/d&amp;#42;&lt;/tt&gt; directories.&lt;/p&gt;</comment>
                            <comment id="169514" author="jamervi" created="Thu, 13 Oct 2016 18:43:26 +0000"  >&lt;p&gt;Andres - On this particular stack we&apos;re at 1.42.9-wc1-7 so I&apos;ll let it run out. Thank God for ll_recover_lost_found_objs! It&apos;s definitely saved me in the past...&lt;/p&gt;</comment>
                            <comment id="169548" author="jamervi" created="Thu, 13 Oct 2016 22:05:10 +0000"  >&lt;p&gt;The fsck has been running now for 6.5 hours and the inode count that has been moved to lost+found is ~2.4M. I&apos;m concerned about that directory space. Should I be?&lt;/p&gt;</comment>
                            <comment id="169559" author="jamervi" created="Thu, 13 Oct 2016 22:36:39 +0000"  >&lt;p&gt;Another point - this file system is relatively new (just over a year old and I believe that it was formatted against 2.5.3). So I&apos;m hoping that &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7381&quot; title=&quot;&amp;quot;e2fsck -fD&amp;quot; on directory may cause extent tree corruption&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7381&quot;&gt;&lt;del&gt;LU-7381&lt;/del&gt;&lt;/a&gt; is not relevant.&lt;/p&gt;</comment>
                            <comment id="169614" author="adilger" created="Fri, 14 Oct 2016 07:46:24 +0000"  >&lt;p&gt;Depending on the amount of corruption, you may have a substantial fraction of all OST objects moved into lost+found.  The directory size itself is unlikely to be a problem, since it is grown as more entries are added.  However, I believe the insertion is a linear search for each new object, so this has the potential to take a long time.  How many objects are on your other OSTs?&lt;/p&gt;</comment>
                            <comment id="169676" author="jamervi" created="Fri, 14 Oct 2016 16:27:46 +0000"  >&lt;p&gt;The fsck completed sometime over and ll_recover_lost_found_objs was successful and our file system is back online. I was surprised that the object recovery took only seconds to perform, especially since the fsck to SO long to connect what appeared to be every object to lost+found. I anticipated it would have taken hours. &lt;/p&gt;

&lt;p&gt;In any event, this ticket can be closed. Thanks for your support.&lt;/p&gt;</comment>
                            <comment id="169683" author="adilger" created="Fri, 14 Oct 2016 17:22:58 +0000"  >&lt;p&gt;Glad to hear that this problem was resolved.&lt;/p&gt;

&lt;p&gt;The ll_recover_lost_found_objs code runs much faster because it is using the hashed directory code in the kernel to insert objects into the &lt;tt&gt;O/0/d*&lt;/tt&gt; directories, and since there are 32 of them they are 1/32 as large.  The e2fsck code is doing a linear search and insertion for each entry in O(n^2) time, and this is a single directory that is 32x as large.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="32950">LU-7368</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="32992">LU-7381</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzyrrr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>