<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:02:51 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-11] Lustre 2.x functionality regression: Missing aggregate MDT stats</title>
                <link>https://jira.whamcloud.com/browse/LU-11</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;LLNL has pointed out that LMT performance at scale (e.g. 20k clients) will greatly suffer if LMT has to read the per-client-export stats in order to recreate the missing aggregate MDT stats.  By extension, any other monitoring tools depending on aggregate MDT stats will also be affected by this regression.&lt;/p&gt;

&lt;p&gt;----- Forwarded message from Brian Behlendorf &amp;lt;behlendorf1@llnl.gov&amp;gt; -----&lt;/p&gt;

&lt;p&gt;Date: Tue, 2 Nov 2010 14:43:49 -0700&lt;br/&gt;
From: Brian Behlendorf &amp;lt;behlendorf1@llnl.gov&amp;gt;&lt;br/&gt;
To: Jim Garlick &amp;lt;garlick@llnl.gov&amp;gt;&lt;br/&gt;
Subject: Re: &lt;span class=&quot;error&quot;&gt;&amp;#91;mjmac@whamcloud.com: Re: LMT work&amp;#93;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;Check out bug 21420 comment #40, specifically commit 9eb3d1db in HEAD.&lt;br/&gt;
This is where they moved the stats from being global to being&lt;br/&gt;
per-export, they appear to think they were useless.&lt;/p&gt;

&lt;p&gt;commit 9eb3d1db42d2937daef25950f6527ccb46221f8e&lt;br/&gt;
Author: LiuYing &amp;lt;emoly.liu@sun.com&amp;gt;&lt;br/&gt;
Date:   Fri Oct 8 10:48:14 2010 +0800&lt;/p&gt;

&lt;p&gt;   b=21420 Add mds/mgs stats to HEAD&lt;/p&gt;

&lt;p&gt;   1)remove useless counter from mds and move some definitions &lt;br/&gt;
     from mds to mdt;&lt;br/&gt;
   2)move LPROCFS_MD_OP_INITs from lprocfs_alloc_md_stats() to&lt;br/&gt;
     lprocfs_init_mps_stats(), which is needed by this stats;&lt;br/&gt;
   3)increase mdt counter for each type operation&lt;/p&gt;

&lt;p&gt;   i=andreas&lt;br/&gt;
   i=wangdi&lt;/p&gt;</description>
                <environment></environment>
        <key id="10100">LU-11</key>
            <summary>Lustre 2.x functionality regression: Missing aggregate MDT stats</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bobijam">Zhenyu Xu</assignee>
                                    <reporter username="mjmac">Michael MacDonald</reporter>
                        <labels>
                    </labels>
                <created>Wed, 3 Nov 2010 07:52:23 +0000</created>
                <updated>Fri, 12 Nov 2010 17:58:39 +0000</updated>
                            <resolved>Fri, 12 Nov 2010 17:58:39 +0000</resolved>
                                    <version>Lustre 2.0.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="10144" author="rread" created="Wed, 3 Nov 2010 18:22:11 +0000"  >&lt;p&gt;I reopened &lt;a href=&quot;https://bugzilla.lustre.org/show_bug.cgi?id=21420&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;21420&lt;/a&gt; and requested the functionality be restored however it looks like the MDS aggregate stats were removed in 2008 in commit 69a3513021212ed1eb8823a50f80853e22e607b3. This patch only removed the &lt;em&gt;unused&lt;/em&gt; initialization code.&lt;/p&gt;</comment>
                            <comment id="10150" author="rread" created="Fri, 5 Nov 2010 15:54:05 +0000"  >&lt;p&gt;I had this chat with Andreas earlier today:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[11:00] adilger: looking at the patch, it _does_ appear that there should be MDT global stats - see mdt_lproc.c::mdt_procfs_init() hunk, and that mdt_counter_incr() is incrementing the obd_stats counter in addition to the per-export counter
[11:00] rread: true, but i couldn&apos;t find the stats when i tested this
[11:01] rread: mdt_stats_counter_init is only called for the nid_stats
[11:02] rread: don&apos;t we also need to call this with obd_stats somewhere?
[11:04] adilger: the stats init for the obd devices is done as part of the lprocfs_alloc_md_stats() code
[11:05] adilger: I wonder if the stats are being collected, but the MDT obd device itself is not being hooked into lprocfs?
[11:06] rread: the stats file was there, just no stats
[11:58] adilger: sorry, was on another concall...  I suspect this is a bug in the MDT device setup due to the half-finished MDS-&amp;gt;MDT code reorg
[11:59] adilger: i.e. something foolish like the &quot;old&quot; MDT has an OBD device, and the &quot;new&quot; CMD MDT has a separate MDT device
[12:00] adilger: err, a separate OBD device
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="10151" author="rread" created="Fri, 5 Nov 2010 15:56:45 +0000"  >&lt;p&gt;Bobi Jam, please review the comments here and for some context,  the most recent ones on &lt;a href=&quot;https://bugzilla.lustre.org/show_bug.cgi?id=21420&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;21420&lt;/a&gt;. It appears there is just an initialization problem here. &lt;/p&gt;</comment>
                            <comment id="10153" author="bobijam" created="Sat, 6 Nov 2010 00:36:15 +0000"  >&lt;p&gt;found the root cause, mdt_counter_incr() should act upon obd-&amp;gt;md_stats instead upon obd-&amp;gt;obd_stats, the former is for recording md ops, while the later one for obd ops (such as connect, disconnect)&lt;/p&gt;</comment>
                            <comment id="10154" author="bobijam" created="Sat, 6 Nov 2010 00:45:03 +0000"  >&lt;p&gt;I&apos;ve tried my patch&lt;br/&gt;
================ w/o patch ==================================================&lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;cat /proc/fs/lustre/mdt/lustre-MDT0000/md_stats&lt;br/&gt;
snapshot_time             1289058069.650218 secs.usecs&lt;/li&gt;
&lt;/ol&gt;


&lt;ol&gt;
	&lt;li&gt;cat /proc/fs/lustre/mdt/lustre-MDT0000/exports/0&amp;#64;lo/stats&lt;br/&gt;
snapshot_time             1289058074.119639 secs.usecs&lt;br/&gt;
open                      1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;br/&gt;
close                     1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;br/&gt;
mkdir                     1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;================ with patch ==================================================&lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;cat /proc/fs/lustre/mdt/lustre-MDT0000/md_stats&lt;br/&gt;
snapshot_time             1289057387.663119 secs.usecs&lt;br/&gt;
open                      1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;br/&gt;
close                     1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;br/&gt;
mkdir                     1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;


&lt;ol&gt;
	&lt;li&gt;cat /proc/fs/lustre/mdt/lustre-MDT0000/exports/0&amp;#64;lo/stats&lt;br/&gt;
snapshot_time             1289057384.833959 secs.usecs&lt;br/&gt;
open                      1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;br/&gt;
close                     1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;br/&gt;
mkdir                     1 samples &lt;span class=&quot;error&quot;&gt;&amp;#91;reqs&amp;#93;&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;
</comment>
                            <comment id="10157" author="bobijam" created="Sat, 6 Nov 2010 09:11:58 +0000"  >&lt;p&gt;posted patch for review at &lt;a href=&quot;http://review.whamcloud.com/#change,124&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,124&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="10160" author="bobijam" created="Mon, 8 Nov 2010 20:19:28 +0000"  >&lt;p&gt;posted patch in bz 21420.&lt;/p&gt;</comment>
                            <comment id="10179" author="bobijam" created="Fri, 12 Nov 2010 17:58:39 +0000"  >&lt;p&gt;patch (&lt;a href=&quot;https://bugzilla.lustre.org/attachment.cgi?id=32148&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://bugzilla.lustre.org/attachment.cgi?id=32148&lt;/a&gt;) got landed.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                    <customfield id="customfield_10020" key="com.atlassian.jira.plugin.system.customfieldtypes:float">
                        <customfieldname>Bugzilla ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>21420.0</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzw0if:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>10163</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>