<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:05:13 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-7009] Testing DNE on ZFS</title>
                <link>https://jira.whamcloud.com/browse/LU-7009</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;I think the best approach here is to run the DNE-on-ZFS testing a few times to see which test scripts are passing regularly, and then move the passing scripts (or passing with minimal additions to ALWAYS_EXCEPT) to review-zfs-part-1 running with 4 MDTs and leave the failing tests (if not too many) in review-zfs-part-2 running with 1 MDT until we can fix the remaining test issues.  As tests pass they can be moved into review-zfs-part-1 until it gets too large, and then just add the remaining few tests to ALWAYS_EXCEPT and set review-zfs-part-2 to also run with 4 MDTs and rebalance the tests to run with approximately the same time.&lt;/p&gt;

&lt;p&gt;That way we can start basic DNE-on-ZFS testing ASAP and migrate the remaining ZFS tests to DNE without having to skip many of the tests.&lt;/p&gt;

&lt;p&gt;Of course, if DNE-on-ZFS is mostly or completely passing today then this incremental migration to DNE testing can be skipped.&lt;/p&gt;</description>
                <environment></environment>
        <key id="31493">LU-7009</key>
            <summary>Testing DNE on ZFS</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="jamesanunez">James Nunez</assignee>
                                    <reporter username="di.wang">Di Wang</reporter>
                        <labels>
                    </labels>
                <created>Fri, 14 Aug 2015 21:23:26 +0000</created>
                <updated>Fri, 15 Nov 2019 15:14:14 +0000</updated>
                            <resolved>Fri, 15 Nov 2019 15:14:14 +0000</resolved>
                                    <version>Lustre 2.8.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>11</watches>
                                                                            <comments>
                            <comment id="125484" author="gerrit" created="Fri, 28 Aug 2015 03:28:35 +0000"  >&lt;p&gt;Lai Siyao (lai.siyao@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/16122&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/16122&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7009&quot; title=&quot;Testing DNE on ZFS&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7009&quot;&gt;&lt;del&gt;LU-7009&lt;/del&gt;&lt;/a&gt; test: test DNE on ZFS&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: fe93134b19331e9300f339c4584390b37fced368&lt;/p&gt;</comment>
                            <comment id="125957" author="adilger" created="Tue, 1 Sep 2015 22:09:50 +0000"  >&lt;p&gt;See also &lt;a href=&quot;https://testing.hpdd.intel.com/test_sessions/7f7566d8-d281-11e4-a20f-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sessions/7f7566d8-d281-11e4-a20f-5254006e85c2&lt;/a&gt; which has results from &lt;a href=&quot;http://review.whamcloud.com/14148&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/14148&lt;/a&gt; but back in March.  Not sure if those test results are all still available, but at least it provides one more data point for this testing.&lt;/p&gt;</comment>
                            <comment id="125962" author="di.wang" created="Tue, 1 Sep 2015 22:33:56 +0000"  >&lt;p&gt;Met this issue when testing on shadow&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;19:31:31:Loading modules from /usr/lib64/lustre
19:31:31:detected 2 online CPUs by sysfs
19:31:31:Force libcfs to create 2 CPU partitions
19:31:31:debug=vfstrace rpctrace dlmtrace neterror ha config 		      ioctl super lfsck
19:31:31:subsystem_debug=all -lnet -lnd -pinger
19:31:33:Formatting mgs, mds, osts
19:31:33:/usr/lib64/lustre/tests/test-framework.sh: line 3157: [: too many arguments
19:31:35:/usr/lib64/lustre/tests/test-framework.sh: line 3157: [: too many arguments
19:31:35:Format mgs: lustre-mgs/mgs
19:31:35:/usr/lib64/lustre/tests/test-framework.sh: line 3157: [: too many arguments
19:31:36:/usr/lib64/lustre/tests/test-framework.sh: line 3157: [: too many arguments
19:31:36:/usr/lib64/lustre/tests/test-framework.sh: line 3157: [: too many arguments
19:31:37:CMD: shadow-50vm3 grep -c /mnt/mgs&apos; &apos; /proc/mounts
19:31:37:CMD: shadow-50vm3 lsmod | grep lnet &amp;gt; /dev/null &amp;amp;&amp;amp; lctl dl | grep &apos; ST &apos;
19:31:37:/usr/lib64/lustre/tests/test-framework.sh: line 3157: [: too many arguments
19:31:38:CMD: shadow-50vm3 ! zpool list -H lustre-mgs &amp;gt;/dev/null 2&amp;gt;&amp;amp;1 ||
19:31:38:			grep -q ^lustre-mgs/ /proc/mounts ||
19:31:38:			zpool export  lustre-mgs
19:31:39:CMD: shadow-50vm3 mkfs.lustre --mgs --param=sys.timeout=20 --backfstype=zfs --device-size=2097152 --reformat lustre-mgs/mgs /dev/lvm-Role_MDS/P1
19:31:39:shadow-50vm3: 
19:31:40:shadow-50vm3: mkfs.lustre FATAL: unhandled fs type 5 &apos;zfs&apos;
19:31:40:shadow-50vm3: 
19:31:40:shadow-50vm3: mkfs.lustre FATAL: unable to prepare backend (22)
19:31:40:shadow-50vm3: mkfs.lustre: exiting with 22 (Invalid argument)
20:31:24:********** Timeout by autotest system **********
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Looks like a TEI issue.&lt;/p&gt;</comment>
                            <comment id="128040" author="pjones" created="Tue, 22 Sep 2015 03:26:26 +0000"  >&lt;p&gt;This testing is complete and tickets were created for the issues found&lt;/p&gt;</comment>
                            <comment id="128042" author="adilger" created="Tue, 22 Sep 2015 04:03:12 +0000"  >&lt;p&gt;I&apos;d rather this not be closed until we are actually testing DNE+ZFS automatically via autotest. &lt;/p&gt;

&lt;p&gt;Per the original comment, if there are tests that are already passing them they can be moved into an autotest config file separate from the tests that are currently failing and/or disabled. If enough tests are passing, we could just rearrange review-zfs-part-&lt;/p&gt;
{1,2}
&lt;p&gt; and change the config for one of them to start testing with MDSCOUNT=2.&lt;/p&gt;

&lt;p&gt;It makes sense to start doing this as soon as possible to avoid regressions while the existing bugs are fixed or disabled. &lt;/p&gt;</comment>
                            <comment id="128046" author="pjones" created="Tue, 22 Sep 2015 04:14:55 +0000"  >&lt;p&gt;That&apos;s a good point Andreas. We&apos;d need to land either fixes for the bugs or else temporarily disable the affected tests while the fix was being worked on and then make the TEI request. Should we start by introducing that change on full test runs first and then rolling out to master after a few successful runs?&lt;/p&gt;</comment>
                            <comment id="128048" author="adilger" created="Tue, 22 Sep 2015 05:14:48 +0000"  >&lt;p&gt;I&apos;m reluctant to break &quot;full&quot; runs more than they already are today.  Also, history shows that tests running on master get far less attention than those running on review.  Rather, I think the existing testing patch can be expanded until it starts passing or skipping (for only the ZFS+DNE config) all the tests currently being run for review-zfs-part-&lt;/p&gt;
{1,2}
&lt;p&gt; and then landed like any other patch.&lt;/p&gt;

&lt;p&gt;Since it would only be adding exceptions for the ZFS+DNE tests it couldn&apos;t cause breakage when it lands. It should be including full runs of those tests so should be OK to enable the DNE config afterward, as long as we leave a non-DNE test running for the excepted or skipped tests, as we do with review-ldiskfs today. &lt;/p&gt;</comment>
                            <comment id="188794" author="jamesanunez" created="Fri, 17 Mar 2017 16:42:35 +0000"  >&lt;p&gt;We are running the &apos;full&apos; test group with ZFS + DNE. Some recent results are at:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://testing.hpdd.intel.com/test_sessions/9803525a-f453-11e6-b655-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sessions/9803525a-f453-11e6-b655-5254006e85c2&lt;/a&gt; &lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sessions/d38b0f30-0262-11e7-a4d2-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sessions/d38b0f30-0262-11e7-a4d2-5254006e85c2&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sessions/d5adbff2-096e-11e7-b5b2-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sessions/d5adbff2-096e-11e7-b5b2-5254006e85c2&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;There are many failures, but there are a few that consistently fail:&lt;br/&gt;
sanity-lfsck test_5e (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8840&quot; title=&quot;sanity-lfsck test_2e: @@@@@@ FAIL: (5) Fail to repair crashed linkEA: 0&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8840&quot;&gt;&lt;del&gt;LU-8840&lt;/del&gt;&lt;/a&gt;)&lt;br/&gt;
sanity-quota test_12b (Ticket ??)&lt;/p&gt;</comment>
                            <comment id="193846" author="adilger" created="Thu, 27 Apr 2017 23:58:24 +0000"  >&lt;p&gt;Rather than doing &quot;full&quot; test runs (which we know have permanent failures even with ldiskfs), it would be better to run &lt;tt&gt;review-dne-part-1&lt;/tt&gt; and &lt;tt&gt;review-dne-part-2&lt;/tt&gt; with &lt;tt&gt;mdtfilesystemtype=zfs&lt;/tt&gt; and &lt;tt&gt;ostfilesystemtype=zfs&lt;/tt&gt; (or &lt;tt&gt;review-zfs-part-1&lt;/tt&gt; and &lt;tt&gt;review-zfs-part-2&lt;/tt&gt; with &lt;tt&gt;mdscount=4&lt;/tt&gt;), so that we can have an apples-to-apples comparison of tests that are failing only with ZFS.&lt;/p&gt;

&lt;p&gt;I see that there is also a regular &lt;tt&gt;replay-single&lt;/tt&gt; test failure, but other than that and the two failures mentioned above, I think there are enough tests that ARE passing that we should create a new test session &lt;tt&gt;review-zfs&lt;/tt&gt; that has only the failing DNE ZFS tests, and change the main ZFS test sessions to have multiple MDTs/MDS.  It would be good to do this now that we are past the 2.10 feature freeze and before the release so that we have high confidence in this code and there are no (further?) regressions.&lt;/p&gt;</comment>
                            <comment id="231450" author="adilger" created="Sat, 4 Aug 2018 18:49:36 +0000"  >&lt;p&gt;James, not sure if this is already duplicate with another ticket you are using to track this work?&lt;/p&gt;</comment>
                            <comment id="258340" author="adilger" created="Fri, 15 Nov 2019 01:28:49 +0000"  >&lt;p&gt;James, I think this work is done now?  We are always testing review-dne-zfs-partX for each patch.  I think this can be closed.&lt;/p&gt;</comment>
                            <comment id="258386" author="jamesanunez" created="Fri, 15 Nov 2019 15:14:14 +0000"  >&lt;p&gt;There are multiple test groups that run in a DNE environment namely review-dne-*&lt;del&gt;part&lt;/del&gt;&lt;/p&gt;
{1,2,3,4}
&lt;p&gt;. Thus, this ticket can be closed.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="32253">LU-7191</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="31033">LU-6831</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="32254">LU-7192</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzxkkn:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>