<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:15:02 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-8145] start dump trace thread once CERROR</title>
                <link>https://jira.whamcloud.com/browse/LU-8145</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;As we can not monitoring the debug trace all the time, especially when&lt;br/&gt;
the error will happen. So if the dump trace can start once CERROR, that would be very helpful, and we can collect as more debug info as possible.&lt;/p&gt;</description>
                <environment></environment>
        <key id="36939">LU-8145</key>
            <summary>start dump trace thread once CERROR</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="2">Won&apos;t Fix</resolution>
                                        <assignee username="jgmitter">Joseph Gmitter</assignee>
                                    <reporter username="cengku9660">Gu Zheng</reporter>
                        <labels>
                            <label>patch</label>
                    </labels>
                <created>Mon, 16 May 2016 08:06:28 +0000</created>
                <updated>Thu, 7 Jun 2018 14:01:49 +0000</updated>
                            <resolved>Thu, 7 Jun 2018 14:01:49 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="152382" author="gerrit" created="Mon, 16 May 2016 08:08:16 +0000"  >&lt;p&gt;Gu Zheng (gzheng@ddn.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/20218&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/20218&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8145&quot; title=&quot;start dump trace thread once CERROR&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8145&quot;&gt;&lt;del&gt;LU-8145&lt;/del&gt;&lt;/a&gt; libcfs: add dump debug trace on error support&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 831bd3adc5294a5d6a91bc46440062f9499f9bf6&lt;/p&gt;</comment>
                            <comment id="152442" author="adilger" created="Mon, 16 May 2016 17:17:03 +0000"  >&lt;p&gt;Have you been using this patch in production anywhere?  It seems to me that virtually every time that Lustre is started, especially after a recovery, it will print a CERROR() to the console and start debug daemon.  In that case, it will always be running on all of the nodes.&lt;/p&gt;</comment>
                            <comment id="152504" author="cengku9660" created="Tue, 17 May 2016 01:16:42 +0000"  >&lt;p&gt;Hi Andreas,&lt;br/&gt;
Thanks for your comments.&lt;br/&gt;
Just simple tests, it works well, but not used on production yet.&lt;br/&gt;
&#8220;It seems to me that virtually every time that Lustre is started&#8221;&lt;br/&gt;
Is there CERROR message in the dumped log?&lt;/p&gt;</comment>
                            <comment id="152505" author="adilger" created="Tue, 17 May 2016 01:30:14 +0000"  >&lt;p&gt;I thought that messages like:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Lustre: myth-OST0000: Will be in recovery for at least 5:00, or until 2 clients reconnect
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;were printed with LustreError, but they are not.  Maybe it will be OK.  Best to check some of your customer systems for LustreError in the console logs starting at boot time to see if having debug_daemon enabled after CERROR() would be useful for debugging or just dumping the debug log once.  I suspect that in most cases the valuable debugging information will have happened &lt;em&gt;before&lt;/em&gt; the CERROR() and not after, so dumping the current logs would be enough.&lt;/p&gt;

&lt;p&gt;Note also that there are already module parameters like &lt;tt&gt;dump_on_eviction&lt;/tt&gt;, &lt;tt&gt;dump_on_timeout&lt;/tt&gt;, and &lt;tt&gt;dump_on_peer_timeout&lt;/tt&gt; that will dump the logs once without having to enable debug_daemon to collect a large amount of debug information.&lt;/p&gt;</comment>
                            <comment id="152583" author="green" created="Tue, 17 May 2016 17:18:37 +0000"  >&lt;p&gt;Note that Fujitsu has a similar thing where all lustre messages are dumped into a debug-deamon-like buffer not to clog dmesg.&lt;/p&gt;

&lt;p&gt;But anyway the biggest problem here is such that we run with very lean debug mask by default, so there&apos;s hardly anything you get from it outside of CERROR/CWARN that is already logged in syslog/dmesg.&lt;/p&gt;

&lt;p&gt;Now you could increase the debug level, but if you do it by default, this suddenly makes your FS slower and nobody likes that. If you do it on first CERROR, it&apos;s kind of too late too.&lt;/p&gt;</comment>
                            <comment id="152956" author="cengku9660" created="Fri, 20 May 2016 05:41:48 +0000"  >&lt;p&gt;Yeah, it maybe not very helpful to production environment, but, IMO, it is useful to debug, especially when we hit error on production but hard to reproduce. &lt;/p&gt;</comment>
                            <comment id="215149" author="simmonsja" created="Fri, 1 Dec 2017 18:37:32 +0000"  >&lt;p&gt;Can we close this ticket.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="42674">LU-8980</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10490" key="com.atlassian.jira.plugin.system.customfieldtypes:datepicker">
                        <customfieldname>End date</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>Fri, 20 May 2016 08:06:28 +0000</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                            <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzybp3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                <customfield id="customfield_10493" key="com.atlassian.jira.plugin.system.customfieldtypes:datepicker">
                        <customfieldname>Start date</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>Mon, 16 May 2016 08:06:28 +0000</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                    </customfields>
    </item>
</channel>
</rss>