<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:34:32 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-17327] Write conf-santity test case for online OST and MDT addition</title>
                <link>https://jira.whamcloud.com/browse/LU-17327</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We need an automated test to exercise adding OSTs and MDTs online to a live filesystem that is under load.&lt;/p&gt;

&lt;p&gt;Andreas provided this guidance:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Add a conf-sanity test to format MDTs and OSTs, mount the first half of them, start a workload (eg. &quot;&lt;tt&gt;rsync -a /etc /usr/lib $DIR/$tdir&lt;/tt&gt;&quot;), and then mount the second half of MDTs and OSTs).&lt;/p&gt;

&lt;p&gt;You can likely copy test_46a to test_46b and add in the &lt;tt&gt;rsync&lt;/tt&gt; workload instead of waiting nicely for the second set of OSTs to be added.&lt;/p&gt;&lt;/blockquote&gt;</description>
                <environment></environment>
        <key id="79266">LU-17327</key>
            <summary>Write conf-santity test case for online OST and MDT addition</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="yujian">Jian Yu</assignee>
                                    <reporter username="yujian">Jian Yu</reporter>
                        <labels>
                    </labels>
                <created>Thu, 30 Nov 2023 21:37:38 +0000</created>
                <updated>Sat, 3 Feb 2024 00:41:44 +0000</updated>
                                            <version>Lustre 2.16.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="395040" author="gerrit" created="Thu, 30 Nov 2023 21:38:25 +0000"  >&lt;p&gt;&quot;Jian Yu &amp;lt;yujian@whamcloud.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/53300&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/53300&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17327&quot; title=&quot;Write conf-santity test case for online OST and MDT addition&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17327&quot;&gt;LU-17327&lt;/a&gt; tests: add test case for online MDT and OST addition&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 61e7844a131c94ddb58f367ef134e5961c3b143e&lt;/p&gt;</comment>
                            <comment id="395387" author="adilger" created="Mon, 4 Dec 2023 19:16:35 +0000"  >&lt;p&gt;It looks like the test case has exposed an expected failure case where the MDS created a file with an object on the newly-added OST but the client wasn&apos;t aware of the new OST yet:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[  183.117948] Lustre: DEBUG MARKER: == conf-sanity test 46b: online OST and MDT addition ===== 16:48:36 (1701380916)
[  206.830361] Lustre: Mounted lustre-client
[  214.067224] LustreError: 14230:0:(lov_ea.c:279:lsme_unpack()) lustre-clilov_UUID: OST index 1 more than OST count 1
[  214.070240] Lustre: 14230:0:(lov_pack.c:57:lov_dump_lmm_common()) objid 0x2ab:1025, magic 0x0bd10bd0, pattern 0x1
[  214.072822] Lustre: 14230:0:(lov_pack.c:61:lov_dump_lmm_common()) stripe_size 4194304, stripe_count 1, layout_gen 0
[  214.075459] Lustre: 14230:0:(lov_pack.c:81:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x2c0000401:2
[  214.078104] LustreError: 14230:0:(lcommon_cl.c:196:cl_file_inode_init()) lustre: failed to initialize cl_object [0x200000401:0x2ab:0x0]: rc = -22
[  214.081653] LustreError: 14230:0:(llite_lib.c:3613:ll_prep_inode()) new_inode -fatal: rc -22
[  460.900709] Lustre: DEBUG MARKER: conf-sanity test_46b: @@@@@@ FAIL: rsync failed
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Issue &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17334&quot; title=&quot;Client should handle dir/file/object created on newly added MDT/OST&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17334&quot;&gt;LU-17334&lt;/a&gt; is tracking the fix for client gracefully handling of this case, while &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17300&quot; title=&quot;Avoid creating new dir/file/object on newly added MDT/OST&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17300&quot;&gt;LU-17300&lt;/a&gt; is tracking the fix to avoid creating such files in the first place.  Both fixes are useful to implement for interop and reliability reasons.&lt;/p&gt;</comment>
                            <comment id="395388" author="adilger" created="Mon, 4 Dec 2023 19:27:04 +0000"  >&lt;p&gt;I looked through the test results on Gerrit Janitor and 100% of the test runs for the new test_46b failed, but 40/44 test runs &lt;b&gt;only&lt;/b&gt; failed because they ran out of space while copying the source trees into the test filesystem:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Started lustre-OST0001
waiting for rsync to finish
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/kbd/keymaps/xkb/.hr-alternatequotes.map.gz.QHFVy3&quot; failed: No space left on device (28)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/kbd/keymaps/xkb/.hr-unicode.map.gz.lliNpP&quot; failed: No space left on device (28)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/kbd/keymaps/xkb/.hr-unicodeus.map.gz.AAhphB&quot; failed: No space left on device (28)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/kbd/keymaps/xkb/.hr-us.map.gz.ID9K9m&quot; failed: No space left on device (28)
:
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;There were 4 cases that failed due to the MDS creating a file on a new OST that the client didn&apos;t know existed yet (with errors on the client console as in the previous comment):&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;waiting for rsync to finish
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/etc/.cron.deny.PDs0S0&quot; failed: Invalid argument (22)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/etc/.crypttab.FFAEg4&quot; failed: Invalid argument (22)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/etc/.csh.login.f7BhT7&quot; failed: Invalid argument (22)
rsync: write failed on &quot;/mnt/lustre/d46b.conf-sanity/lib/locale/locale-archive&quot;: No space left on device (28)
rsync error: error in file IO (code 11) at receiver.c(393) [receiver=3.1.2]
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="395390" author="adilger" created="Mon, 4 Dec 2023 19:33:04 +0000"  >&lt;p&gt;The test runs on Autotest showed much more chance of hitting the object-on-new-OST creation race, with a long list of files being created on new OSTs:&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/be67a6fa-8bc9-4538-be28-dcf2688028a3&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/be67a6fa-8bc9-4538-be28-dcf2688028a3&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/036c867c-6d9c-4c1c-9403-63b24195a873&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/036c867c-6d9c-4c1c-9403-63b24195a873&lt;/a&gt;&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/dracut/modules.d/99shutdown/.shutdown.sh.iiLyky&quot; failed: Invalid argument (22)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/dracut/modules.d/99squash/.shchkdir.V60fMT&quot; failed: Invalid argument (22)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/dracut/modules.d/99squash/.module-setup.sh.TDJhBN&quot; failed: Invalid argument (22)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/firewalld/helpers/.RAS.xml.zYZx1S&quot; failed: Invalid argument (22)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/firewalld/helpers/.amanda.xml.okGbNj&quot; failed: Invalid argument (22)
rsync: mkstemp &quot;/mnt/lustre/d46b.conf-sanity/lib/firewalld/helpers/.ftp.xml.Rsg5G5&quot; failed: Invalid argument (22)
:
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;and the client console logs showing this error was hit for each new OST addition.&lt;/p&gt;

&lt;p&gt;This is likely because there are more Autotest OSTs to be added (6) instead of Janitor (only 1):&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;:
[  605.944711] LustreError: 41994:0:(lcommon_cl.c:196:cl_file_inode_init()) lustre: failed to initialize cl_object [0x200000401:0x5bc:0x0]: rc = -22
[  605.946775] LustreError: 41994:0:(lcommon_cl.c:196:cl_file_inode_init()) Skipped 8 previous similar messages
[  605.948389] LustreError: 41994:0:(llite_lib.c:3613:ll_prep_inode()) new_inode -fatal: rc -22
[  605.949762] LustreError: 41994:0:(llite_lib.c:3613:ll_prep_inode()) Skipped 8 previous similar messages
[  606.954945] LustreError: 41994:0:(lov_ea.c:279:lsme_unpack()) lustre-clilov_UUID: OST index 2 more than OST count 2
[  606.956738] LustreError: 41994:0:(lov_ea.c:279:lsme_unpack()) Skipped 25 previous similar messages
[  606.958203] Lustre: 41994:0:(lov_pack.c:57:lov_dump_lmm_common()) objid 0x663:1025, magic 0x0bd10bd0, pattern 0x1
[  606.959944] Lustre: 41994:0:(lov_pack.c:57:lov_dump_lmm_common()) Skipped 25 previous similar messages
[  606.961495] Lustre: 41994:0:(lov_pack.c:61:lov_dump_lmm_common()) stripe_size 4194304, stripe_count 1, layout_gen 0
[  606.963239] Lustre: 41994:0:(lov_pack.c:61:lov_dump_lmm_common()) Skipped 25 previous similar messages
[  606.964782] Lustre: 41994:0:(lov_pack.c:81:lov_dump_lmm_objects()) stripe 0 idx 2 subobj 0x2c0000400:37
[  606.966319] Lustre: 41994:0:(lov_pack.c:81:lov_dump_lmm_objects()) Skipped 25 previous similar messages
[  606.967869] LustreError: 41994:0:(lcommon_cl.c:196:cl_file_inode_init()) lustre: failed to initialize cl_object [0x200000401:0x663:0x0]: rc = -22
[  606.969972] LustreError: 41994:0:(lcommon_cl.c:196:cl_file_inode_init()) Skipped 25 previous similar messages
[  606.971572] LustreError: 41994:0:(llite_lib.c:3613:ll_prep_inode()) new_inode -fatal: rc -22
[  606.972935] LustreError: 41994:0:(llite_lib.c:3613:ll_prep_inode()) Skipped 25 previous similar messages
[  626.075632] LustreError: 41994:0:(lov_ea.c:279:lsme_unpack()) lustre-clilov_UUID: OST index 5 more than OST count 5
[  626.082235] LustreError: 41994:0:(lov_ea.c:279:lsme_unpack()) Skipped 5 previous similar messages
[  626.083709] Lustre: 41994:0:(lov_pack.c:57:lov_dump_lmm_common()) objid 0x147a:1025, magic 0x0bd10bd0, pattern 0x1
[  626.085377] Lustre: 41994:0:(lov_pack.c:57:lov_dump_lmm_common()) Skipped 5 previous similar messages
[  626.086871] Lustre: 41994:0:(lov_pack.c:61:lov_dump_lmm_common()) stripe_size 4194304, stripe_count 1, layout_gen 0
[  626.088539] Lustre: 41994:0:(lov_pack.c:61:lov_dump_lmm_common()) Skipped 5 previous similar messages
[  626.090037] Lustre: 41994:0:(lov_pack.c:81:lov_dump_lmm_objects()) stripe 0 idx 5 subobj 0x380000400:2
[  626.091542] Lustre: 41994:0:(lov_pack.c:81:lov_dump_lmm_objects()) Skipped 5 previous similar messages
[  626.093115] LustreError: 41994:0:(lcommon_cl.c:196:cl_file_inode_init()) lustre: failed to initialize cl_object [0x200000401:0x147a:0x0]: rc = -22
[  626.095205] LustreError: 41994:0:(lcommon_cl.c:196:cl_file_inode_init()) Skipped 5 previous similar messages
[  626.096842] LustreError: 41994:0:(llite_lib.c:3613:ll_prep_inode()) new_inode -fatal: rc -22
[  626.098230] LustreError: 41994:0:(llite_lib.c:3613:ll_prep_inode()) Skipped 5 previous similar messages
[  632.765740] LustreError: 41994:0:(lov_ea.c:279:lsme_unpack()) lustre-clilov_UUID: OST index 6 more than OST count 6
[  632.767467] LustreError: 41994:0:(lov_ea.c:279:lsme_unpack()) Skipped 36 previous similar messages
[  632.768944] Lustre: 41994:0:(lov_pack.c:57:lov_dump_lmm_common()) objid 0x16a9:1025, magic 0x0bd10bd0, pattern 0x1
[  632.770626] Lustre: 41994:0:(lov_pack.c:57:lov_dump_lmm_common()) Skipped 36 previous similar messages
[  632.772137] Lustre: 41994:0:(lov_pack.c:61:lov_dump_lmm_common()) stripe_size 4194304, stripe_count 1, layout_gen 0
[  632.773800] Lustre: 41994:0:(lov_pack.c:61:lov_dump_lmm_common()) Skipped 36 previous similar messages
[  632.775283] Lustre: 41994:0:(lov_pack.c:81:lov_dump_lmm_objects()) stripe 0 idx 6 subobj 0x3c0000400:2
[  632.776766] Lustre: 41994:0:(lov_pack.c:81:lov_dump_lmm_objects()) Skipped 36 previous similar messages
[  632.778285] LustreError: 41994:0:(lcommon_cl.c:196:cl_file_inode_init()) lustre: failed to initialize cl_object [0x200000401:0x16a9:0x0]: rc = -22
[  632.780422] LustreError: 41994:0:(lcommon_cl.c:196:cl_file_inode_init()) Skipped 36 previous similar messages
:
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="395648" author="gerrit" created="Wed, 6 Dec 2023 09:52:25 +0000"  >&lt;p&gt;&lt;del&gt;&quot;Andreas Dilger &amp;lt;adilger@whamcloud.com&amp;gt;&quot; uploaded a new patch:&lt;/del&gt; &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/53343&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/53343&lt;/a&gt;&lt;br/&gt;
&lt;del&gt;Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17327&quot; title=&quot;Write conf-santity test case for online OST and MDT addition&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17327&quot;&gt;LU-17327&lt;/a&gt; tests: conf-sanity/46b to avoid QOS&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Project: fs/lustre-release&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Branch: master&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Current Patch Set: 1&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Commit: 30534d63510a1cefe07f7a79934b6f1428d6aa18&lt;/del&gt;&lt;/p&gt;</comment>
                            <comment id="396101" author="gerrit" created="Fri, 8 Dec 2023 21:28:36 +0000"  >&lt;p&gt;&lt;del&gt;&quot;Andreas Dilger &amp;lt;adilger@whamcloud.com&amp;gt;&quot; uploaded a new patch:&lt;/del&gt; &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/53390&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/53390&lt;/a&gt;&lt;br/&gt;
&lt;del&gt;Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17327&quot; title=&quot;Write conf-santity test case for online OST and MDT addition&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17327&quot;&gt;LU-17327&lt;/a&gt; tests: conf-sanity/46c to avoid MDT balance&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Project: fs/lustre-release&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Branch: master&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Current Patch Set: 1&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Commit: 1bfdb01a73c30f5ac5e32e1a10015b1f907019b3&lt;/del&gt;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="79003">LU-17300</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="79318">LU-17334</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i043ef:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>