<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:28:41 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-16632] sanity test_56xh: &apos;lfs migrate -W&apos; too slow</title>
                <link>https://jira.whamcloud.com/browse/LU-16632</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;This issue was created by maloo for S Buisson &amp;lt;sbuisson@ddn.com&amp;gt;&lt;/p&gt;

&lt;p&gt;This issue relates to the following test suite run: &lt;a href=&quot;https://testing.whamcloud.com/test_sets/b7260618-4120-4ad7-ad92-b6aae2bb0332&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/b7260618-4120-4ad7-ad92-b6aae2bb0332&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;test_56xh failed with the following error:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&apos;lfs migrate -W&apos; too slow in VM (80 &amp;gt; 2 * 25 2)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Test session details:&lt;br/&gt;
clients: &lt;a href=&quot;https://build.whamcloud.com/job/lustre-reviews/92966&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://build.whamcloud.com/job/lustre-reviews/92966&lt;/a&gt; - 4.18.0-348.7.1.el8_5.x86_64&lt;br/&gt;
servers: &lt;a href=&quot;https://build.whamcloud.com/job/lustre-reviews/92966&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://build.whamcloud.com/job/lustre-reviews/92966&lt;/a&gt; - 4.18.0-348.23.1.el8_lustre.x86_64&lt;/p&gt;








&lt;p&gt;VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV&lt;br/&gt;
sanity test_56xh - &apos;lfs migrate -W&apos; too slow in VM (80 &amp;gt; 2 * 25 2)&lt;/p&gt;</description>
                <environment></environment>
        <key id="75013">LU-16632</key>
            <summary>sanity test_56xh: &apos;lfs migrate -W&apos; too slow</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="yujian">Jian Yu</assignee>
                                    <reporter username="maloo">Maloo</reporter>
                        <labels>
                    </labels>
                <created>Fri, 10 Mar 2023 19:14:40 +0000</created>
                <updated>Tue, 14 Nov 2023 22:37:19 +0000</updated>
                            <resolved>Wed, 29 Mar 2023 03:30:22 +0000</resolved>
                                                    <fixVersion>Lustre 2.16.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="365560" author="adilger" created="Fri, 10 Mar 2023 20:43:18 +0000"  >&lt;p&gt;The &quot;lfs migrate -W&quot; option and corresponding test were added in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13482&quot; title=&quot;add verbose stats to lfs_migrate and &amp;quot;lfs migrate&amp;quot;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13482&quot;&gt;&lt;del&gt;LU-13482&lt;/del&gt;&lt;/a&gt;. &lt;/p&gt;

&lt;p&gt;We can&apos;t reliably measure performance inside the VMs, so we likely need to increase the margin of error allowed for this test.  It currently checks:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
       (( elapsed &amp;lt;= size_mb * 150 / 100 )) ||
               error &lt;span class=&quot;code-quote&quot;&gt;&quot;&lt;span class=&quot;code-quote&quot;&gt;&apos;lfs migrate -W&apos;&lt;/span&gt; too slow in VM ($elapsed &amp;gt; 2 * $size_mb 2)&quot;&lt;/span&gt;
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;And the error message says it is 2x the time, but the check is only 1.5x the time. The check and message should be fixed to use &quot;&lt;tt&gt;&amp;#42; 350 / 100&lt;/tt&gt;&quot;, along with removing the stray &quot;2&quot; at the end of the error (test took 80s vs 25s without throttle, which is 3.2x longer).&lt;/p&gt;

&lt;p&gt;I wonder if this is exacerbated by the addition of many sleeps, which gives the VM more chances to reschedule this thread under contention, or possibly inaccuracies in the nanosecond clock that are hurting the rate calculations?&lt;/p&gt;

&lt;p&gt;I wonder if the algorithm should be designed to &quot;catch up&quot; in those cases (possibly exceeding the bandwidth cap if it was previously starved, or at least not going to sleep in those cases), or is the intent that it should not exceed the bandwidth limit even if previous intervals were slower than necessary (notwithstanding the fact that it is very bursty with its writes)?&lt;/p&gt;</comment>
                            <comment id="365616" author="JIRAUSER18433" created="Sat, 11 Mar 2023 23:08:39 +0000"  >&lt;p&gt;Rescheduling seems more likely to me than clock inaccuracies. Looking at a handful of past runs shows that the test very reliably hits 25s for the migrate job. Not much variability.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;I think it&apos;s preferable that the bandwidth cap is respected as much as possible, even if the migrate falls behind a bit. I think it makes more sense from a user perspective. I agree that the algorithm could be made less bursty in its writes.&lt;/p&gt;</comment>
                            <comment id="365617" author="gerrit" created="Sat, 11 Mar 2023 23:18:08 +0000"  >&lt;p&gt;&quot;Timothy Day &amp;lt;timday@amazon.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/50262&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/50262&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16632&quot; title=&quot;sanity test_56xh: &amp;#39;lfs migrate -W&amp;#39; too slow&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16632&quot;&gt;&lt;del&gt;LU-16632&lt;/del&gt;&lt;/a&gt; tests: more margin of error for 56xh&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 2227fdcd8901262f09ccb669aed8fb3b4addc1a5&lt;/p&gt;</comment>
                            <comment id="367644" author="gerrit" created="Tue, 28 Mar 2023 22:18:44 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; merged in patch &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/50262/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/50262/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16632&quot; title=&quot;sanity test_56xh: &amp;#39;lfs migrate -W&amp;#39; too slow&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16632&quot;&gt;&lt;del&gt;LU-16632&lt;/del&gt;&lt;/a&gt; tests: more margin of error for sanity/56xh&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 36cbba150bce9e2890c8b462ec2ce4af2d6353a5&lt;/p&gt;</comment>
                            <comment id="367675" author="pjones" created="Wed, 29 Mar 2023 03:30:22 +0000"  >&lt;p&gt;Landed for 2.16&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="58891">LU-13482</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i03g33:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>