<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:56:19 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-12865] sanity test 160f fails with &#8216;mds1: User cl6 not registered&#8217;</title>
                <link>https://jira.whamcloud.com/browse/LU-12865</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;sanity test_160f fails with &#8216;mds1: User cl6 not registered&#8217;. So far this year, there have been 52 sanity test 160f failures with this error; 36 of those failures are for ARM clients.&lt;/p&gt;

&lt;p&gt;Looking at the suite_log for a recent failure, &lt;a href=&quot;https://testing.whamcloud.com/test_sets/8bedad40-ebd5-11e9-b62b-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/8bedad40-ebd5-11e9-b62b-52540065bddc&lt;/a&gt;, we see that user cl6 is registered and that we are able to manipulate the changelog register prior to the error&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;== sanity test 160f: changelog garbage collect (timestamped users) =================================== 20:27:47 (1570566467)
CMD: trevis-49vm2 /usr/sbin/lctl get_param mdd.lustre-MDT0000.changelog_mask -n
CMD: trevis-49vm2 /usr/sbin/lctl set_param mdd.lustre-MDT0000.changelog_mask=+hsm
mdd.lustre-MDT0000.changelog_mask=+hsm
CMD: trevis-49vm2 /usr/sbin/lctl --device lustre-MDT0000 changelog_register -n
CMD: trevis-49vm3 /usr/sbin/lctl get_param mdd.lustre-MDT0001.changelog_mask -n
CMD: trevis-49vm3 /usr/sbin/lctl set_param mdd.lustre-MDT0001.changelog_mask=+hsm
mdd.lustre-MDT0001.changelog_mask=+hsm
CMD: trevis-49vm3 /usr/sbin/lctl --device lustre-MDT0001 changelog_register -n
CMD: trevis-49vm2 /usr/sbin/lctl get_param mdd.lustre-MDT0002.changelog_mask -n
CMD: trevis-49vm2 /usr/sbin/lctl set_param mdd.lustre-MDT0002.changelog_mask=+hsm
mdd.lustre-MDT0002.changelog_mask=+hsm
CMD: trevis-49vm2 /usr/sbin/lctl --device lustre-MDT0002 changelog_register -n
CMD: trevis-49vm3 /usr/sbin/lctl get_param mdd.lustre-MDT0003.changelog_mask -n
CMD: trevis-49vm3 /usr/sbin/lctl set_param mdd.lustre-MDT0003.changelog_mask=+hsm
mdd.lustre-MDT0003.changelog_mask=+hsm
CMD: trevis-49vm3 /usr/sbin/lctl --device lustre-MDT0003 changelog_register -n
Registered 4 changelog users: &apos;cl6 cl6 cl6 cl6&apos;
&#8230;
mds1: verifying user cl6 clear:  19 + 2 == 21
CMD: trevis-49vm2 /usr/sbin/lctl get_param -n mdd.lustre-MDT0000.changelog_users
CMD: trevis-49vm3 /usr/sbin/lctl get_param -n mdd.lustre-MDT0001.changelog_users
CMD: trevis-49vm3 /usr/sbin/lctl get_param -n mdd.lustre-MDT0001.changelog_users
lustre-MDT0001: clear the changelog for cl6 to record #10
CMD: trevis-49vm3 /usr/sbin/lctl get_param -n mdd.lustre-MDT0001.changelog_users
mds2: verifying user cl6 clear:  8 + 2 == 10
CMD: trevis-49vm3 /usr/sbin/lctl get_param -n mdd.lustre-MDT0001.changelog_users
CMD: trevis-49vm2 /usr/sbin/lctl get_param -n mdd.lustre-MDT0002.changelog_users
CMD: trevis-49vm2 /usr/sbin/lctl get_param -n mdd.lustre-MDT0002.changelog_users
lustre-MDT0002: clear the changelog for cl6 to record #2
CMD: trevis-49vm2 /usr/sbin/lctl get_param -n mdd.lustre-MDT0002.changelog_users
mds3: verifying user cl6 clear:  0 + 2 == 2
CMD: trevis-49vm2 /usr/sbin/lctl get_param -n mdd.lustre-MDT0002.changelog_users
CMD: trevis-49vm3 /usr/sbin/lctl get_param -n mdd.lustre-MDT0003.changelog_users
CMD: trevis-49vm3 /usr/sbin/lctl get_param -n mdd.lustre-MDT0003.changelog_users
lustre-MDT0003: clear the changelog for cl6 to record #2
CMD: trevis-49vm3 /usr/sbin/lctl get_param -n mdd.lustre-MDT0003.changelog_users
mds4: verifying user cl6 clear:  0 + 2 == 2
CMD: trevis-49vm3 /usr/sbin/lctl get_param -n mdd.lustre-MDT0003.changelog_users
total: 8 create in 0.02 seconds: 453.39 ops/second
CMD: trevis-49vm2 ps -e -o comm= | grep chlg_gc_thread
pdsh@trevis-79vm17: trevis-49vm2: ssh exited with exit code 1
CMD: trevis-49vm2 ps -e -o comm= | grep chlg_gc_thread
pdsh@trevis-79vm17: trevis-49vm2: ssh exited with exit code 1
CMD: trevis-49vm3 ps -e -o comm= | grep chlg_gc_thread
pdsh@trevis-79vm17: trevis-49vm3: ssh exited with exit code 1
CMD: trevis-49vm3 ps -e -o comm= | grep chlg_gc_thread
pdsh@trevis-79vm17: trevis-49vm3: ssh exited with exit code 1
CMD: trevis-49vm2 /usr/sbin/lctl get_param -n mdd.lustre-MDT0000.changelog_users
 sanity test_160f: @@@@@@ FAIL: mds1: User cl6 not registered 
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;There is no indication of a problem in any of the console logs.&lt;/p&gt;

&lt;p&gt;Logs for other failures are at&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/ecc72250-ccfd-11e9-a25b-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/ecc72250-ccfd-11e9-a25b-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/8da05bea-cff3-11e9-9fc9-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/8da05bea-cff3-11e9-9fc9-52540065bddc&lt;/a&gt;&lt;/p&gt;</description>
                <environment></environment>
        <key id="57162">LU-12865</key>
            <summary>sanity test 160f fails with &#8216;mds1: User cl6 not registered&#8217;</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="adilger">Andreas Dilger</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                    </labels>
                <created>Tue, 15 Oct 2019 19:07:55 +0000</created>
                <updated>Mon, 15 Jan 2024 23:29:01 +0000</updated>
                            <resolved>Sat, 18 Jan 2020 15:01:18 +0000</resolved>
                                    <version>Lustre 2.13.0</version>
                    <version>Lustre 2.12.3</version>
                    <version>Lustre 2.12.4</version>
                    <version>Lustre 2.12.5</version>
                                    <fixVersion>Lustre 2.14.0</fixVersion>
                    <fixVersion>Lustre 2.12.6</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="256543" author="adilger" created="Thu, 17 Oct 2019 07:45:01 +0000"  >&lt;p&gt;It looks like this test failure might be a problem in the test script itself?  Looking at the MDT0000 debug log shows that the &lt;tt&gt;cl6&lt;/tt&gt; user is deregistered by the GC thread because it is idle for too long:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;1570566291.903442:0:27506:0:(mdd_trans.c:192:mdd_chlg_garbage_collect()) lustre-MDD0000: Force deregister of ChangeLog user cl7 idle since more than 35s
1570566291.903509:0:27506:0:(mdd_trans.c:192:mdd_chlg_garbage_collect()) lustre-MDD0000: Force deregister of ChangeLog user cl6 idle since more than 11s
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This is because the test sets &lt;tt&gt;changelog_max_idle_time=10&lt;/tt&gt;, but sleeps for 9s and then does a number of operations that could be slow.  In rare cases the test ran too long and the MDS evicted the &quot;good&quot; along with the bad one.&lt;/p&gt;

&lt;p&gt;Patch to fix the test forthcoming.&lt;/p&gt;</comment>
                            <comment id="256544" author="gerrit" created="Thu, 17 Oct 2019 07:45:21 +0000"  >&lt;p&gt;Andreas Dilger (adilger@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/36468&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/36468&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12865&quot; title=&quot;sanity test 160f fails with &#8216;mds1: User cl6 not registered&#8217;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12865&quot;&gt;&lt;del&gt;LU-12865&lt;/del&gt;&lt;/a&gt; tests: fix sanity 160f to be more robust&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 7e2a15299f1180d8b7026f46886c85dbf4dacd7b&lt;/p&gt;</comment>
                            <comment id="261475" author="gerrit" created="Sat, 18 Jan 2020 04:04:26 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/36468/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/36468/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12865&quot; title=&quot;sanity test 160f fails with &#8216;mds1: User cl6 not registered&#8217;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12865&quot;&gt;&lt;del&gt;LU-12865&lt;/del&gt;&lt;/a&gt; tests: fix sanity 160f to be more robust&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 4b0f0164c6ed761897409186376e9edc989323c9&lt;/p&gt;</comment>
                            <comment id="272009" author="gerrit" created="Thu, 4 Jun 2020 18:47:57 +0000"  >&lt;p&gt;James Nunez (jnunez@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/38833&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/38833&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12865&quot; title=&quot;sanity test 160f fails with &#8216;mds1: User cl6 not registered&#8217;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12865&quot;&gt;&lt;del&gt;LU-12865&lt;/del&gt;&lt;/a&gt; tests: fix sanity 160f to be more robust&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 011b8e3eac372032dd8f09576239e7a05d496251&lt;/p&gt;</comment>
                            <comment id="275108" author="gerrit" created="Sat, 11 Jul 2020 07:28:25 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/38833/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/38833/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-12865&quot; title=&quot;sanity test 160f fails with &#8216;mds1: User cl6 not registered&#8217;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-12865&quot;&gt;&lt;del&gt;LU-12865&lt;/del&gt;&lt;/a&gt; tests: fix sanity 160f to be more robust&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: fdc1320ea1a817182291ec67394a903b2e0a4911&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="50857">LU-10680</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="57176">LU-12871</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00o33:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>