<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:25:02 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-2418] Add Way to Detect Dropped Packets on Production Systems</title>
                <link>https://jira.whamcloud.com/browse/LU-2418</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Today, it is very difficult to confirm whether timeouts in Lustre are due to dropped packets in LNet.  This is due to two reasons:&lt;/p&gt;

&lt;p&gt;1- neterrors are off by default so logging does not show dropped packets.&lt;br/&gt;
2- the errors counter is never incremented (see &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2223&quot; title=&quot;LNet counter &amp;quot;errors&amp;quot; never gets incremented&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2223&quot;&gt;LU-2223&lt;/a&gt;).&lt;/p&gt;

&lt;p&gt;My understanding is that neterrors are off by default because there is too much &quot;noise&quot; when they are on.  That begs the question: how can logs which are issued that frequently be considered errors?&lt;/p&gt;

&lt;p&gt;I think this issue can be address in one of two ways:&lt;/p&gt;

&lt;p&gt;1- Clean up the neterror logs so they are not noisy and then leave neterrors on by default.&lt;br/&gt;
2- Add a set of new counters to LNet to count the reasons for dropped packets.&lt;/p&gt;</description>
                <environment></environment>
        <key id="16833">LU-2418</key>
            <summary>Add Way to Detect Dropped Packets on Production Systems</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="doug">Doug Oucharek</reporter>
                        <labels>
                    </labels>
                <created>Fri, 30 Nov 2012 19:57:15 +0000</created>
                <updated>Mon, 13 Jun 2016 23:40:37 +0000</updated>
                                            <version>Lustre 2.4.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="48763" author="isaac" created="Tue, 4 Dec 2012 17:42:09 +0000"  >&lt;p&gt;I remembered that they were too noisy in sites where there&apos;s always 10s of nodes down for maintenance e.g., but there&apos;s repeated attempts to communicate with them e.g. router pinger or upper layers. So if a same error happened with 50 nodes, there&apos;d be 50 such error messages, instead of one that says this error happened with these 50 nodes.&lt;/p&gt;

&lt;p&gt;It became even worse at sites where console outputs of servers were gathered into one place.&lt;/p&gt;</comment>
                            <comment id="155572" author="doug" created="Mon, 13 Jun 2016 23:40:37 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8223&quot; title=&quot;De-Noise LNet neterr logs so they can be ON by default&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8223&quot;&gt;LU-8223&lt;/a&gt; implements one part of this solution: get neterrors on by default.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="37297">LU-8223</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvd5r:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>5735</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>