<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:20:36 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-8794] update_log_dir consuming 1.1TB on MDT0000</title>
                <link>https://jira.whamcloud.com/browse/LU-8794</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;On zinc, a DNE filesystem with 16 MDTs, the pool containing MDT0000 (zinc1) ran out of space.  Upon inspection, we find that 1.1 TB is occupied by files contained in updat_log_dir.  The rest of the MDT occupies about 300MB, which is about the same as the space used by each of the other 15 MDTs.&lt;/p&gt;</description>
                <environment>Lustre: Build Version: 2.8.0_5.chaos</environment>
        <key id="41264">LU-8794</key>
            <summary>update_log_dir consuming 1.1TB on MDT0000</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="5">Cannot Reproduce</resolution>
                                        <assignee username="laisiyao">Lai Siyao</assignee>
                                    <reporter username="ofaaland">Olaf Faaland</reporter>
                        <labels>
                            <label>llnl</label>
                    </labels>
                <created>Wed, 2 Nov 2016 21:19:38 +0000</created>
                <updated>Wed, 29 Nov 2017 19:00:14 +0000</updated>
                            <resolved>Wed, 29 Nov 2017 19:00:14 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="172085" author="ofaaland" created="Wed, 2 Nov 2016 21:20:12 +0000"  >&lt;p&gt;There are 158 files in update_log_dir.&lt;br/&gt;
68 size&amp;gt;10GB&lt;br/&gt;
29 10GB &amp;gt; size &amp;gt;= 1GB&lt;br/&gt;
7 1GB &amp;gt; size &amp;gt;= 1M&lt;br/&gt;
44 size &amp;lt; 1M&lt;/p&gt;</comment>
                            <comment id="172210" author="jgmitter" created="Thu, 3 Nov 2016 17:26:41 +0000"  >&lt;p&gt;Hi Lai,&lt;/p&gt;

&lt;p&gt;Can you please take a look at this issue?&lt;/p&gt;

&lt;p&gt;Thanks.&lt;br/&gt;
Joe&lt;/p&gt;</comment>
                            <comment id="172249" author="ofaaland" created="Thu, 3 Nov 2016 22:46:49 +0000"  >&lt;p&gt;Unfortunately I cannot be certain of the filesystem activity that caused this.  We were not monitoring the space usage in the pool (although we are now).&lt;/p&gt;

&lt;p&gt;I also cannot provide debug logs from the MDTs, as we discovered the problem after a reboot of the servers.&lt;/p&gt;

&lt;p&gt;The only information available is syslog output for the servers and the contents of the MDT itself.&lt;/p&gt;

&lt;p&gt;Di Wang suggested I can delete the contents of update_log_dir.  Let me know if you need any information about its contents before I do that.&lt;/p&gt;</comment>
                            <comment id="172256" author="ofaaland" created="Fri, 4 Nov 2016 01:45:11 +0000"  >&lt;p&gt;Note that this ticket is purely for trying to figure out why the update logs are occupying so much space.  There is a separate ticket, &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8787&quot; title=&quot;zpool containing MDT0000 out of space&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8787&quot;&gt;&lt;del&gt;LU-8787&lt;/del&gt;&lt;/a&gt;, for how to recover.&lt;/p&gt;

&lt;p&gt;If the contents of the MDT won&apos;t help us learn what happened, we can just close the ticket until it happens again and we can get better information.&lt;br/&gt;
We have started monitoring space used in the pool containing the MDT, and will be more likely to notice if the volume of update logs increases.&lt;/p&gt;</comment>
                            <comment id="172274" author="di.wang" created="Fri, 4 Nov 2016 04:49:41 +0000"  >&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/18028&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/18028&lt;/a&gt; (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-6838&quot; title=&quot;update llog become too big before it is destroyed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-6838&quot;&gt;&lt;del&gt;LU-6838&lt;/del&gt;&lt;/a&gt;) might help here, but as it explained there, the plain log limit size is around 800M, probably can not explain why the update log file reach to 1T.  something is strange here. anyway I think the suggestion on &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8714&quot; title=&quot;too many update logs during soak-test.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8714&quot;&gt;LU-8714&lt;/a&gt; is the way to go.&lt;/p&gt;</comment>
                            <comment id="214957" author="ofaaland" created="Wed, 29 Nov 2017 19:00:06 +0000"  >&lt;p&gt;I was unable to reproduce the problem after it was initially encountered, and we have not seen it since on test or production systems since then, perhaps because we have not been testing DNE2 and use very few remote directories.  Closing.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="40741">LU-8714</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="31056">LU-6838</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzyu9j:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>