<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:37:19 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-10686] sanity-pfl test 9 fails with &#8220;[0x100010000:0x6025:0x0] !=  &#8220;</title>
                <link>https://jira.whamcloud.com/browse/LU-10686</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Since lustre-master build # 3703, 2.10.57.57, on 2018-01-31 we see sanity-pfl test_9 failing to get and compare the FID of the file&#8217;s second component with the error&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[0x100010000:0x6025:0x0] !=  
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The FID of the second component of the file after MDS failover should be on the right hand side of the &#8220;!=&#8221;.&lt;/p&gt;

&lt;p&gt;Looking at the suite_log, we see that there is some issue writing to the file&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;dd: error writing &apos;/mnt/lustre/d9.sanity-pfl/f9.sanity-pfl&apos;: No data available
1+0 records in
0+0 records out
0 bytes copied, 0.000605975 s, 0.0 kB/s
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;We know the file system isn&#8217;t full because, earlier in the test, &#8216;lfs df&#8217; is printed and shows the file system only 2% full. I see this &apos;No data available&apos; message when trying to reproduce this issue outside of autotest even without the replay-barrier and when the test succeeds. So, this is probably not the cause of the failure.&lt;/p&gt;

&lt;p&gt;Right after the failed write and prior to the MDS failover, we get the FID of the second component&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;before MDS recovery, the ost fid of 2nd component is [0x100010000:0x6025:0x0]
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;We then failover the MDS and, it looks like it is back on-line, we can&#8217;t get the FID of the second component&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;onyx-32vm2: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 16 sec
onyx-32vm1: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 16 sec
after MDS recovery, the ost fid of 2nd component is 
 sanity-pfl test_9: @@@@@@ FAIL: [0x100010000:0x6025:0x0] !=  
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;There doesn&#8217;t seem to be anything enlightening in the console and dmesg logs. Looking at the MDS log, we see the second component created&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;00020000:01000000:0.0:1518721834.723830:0:29881:0:(lod_pool.c:919:lod_find_pool()) lustre-MDT0000-osd: request for an unknown pool (test_85b)
00000004:00080000:0.0:1518721834.723866:0:29881:0:(osp_object.c:1546:osp_create()) lustre-OST0001-osc-MDT0000: Wrote last used FID: [0x100010000:0x6025:0x0], index 1: 0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Logs for this failure are at &lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/2df1628e-0736-11e8-a6ad-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/2df1628e-0736-11e8-a6ad-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/8c95d88c-0732-11e8-a7cd-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/8c95d88c-0732-11e8-a7cd-52540065bddc&lt;/a&gt;&lt;/p&gt;</description>
                <environment></environment>
        <key id="50877">LU-10686</key>
            <summary>sanity-pfl test 9 fails with &#8220;[0x100010000:0x6025:0x0] !=  &#8220;</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="bobijam">Zhenyu Xu</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                    </labels>
                <created>Tue, 20 Feb 2018 20:22:54 +0000</created>
                <updated>Mon, 26 Aug 2019 23:46:21 +0000</updated>
                            <resolved>Tue, 25 Sep 2018 10:36:58 +0000</resolved>
                                    <version>Lustre 2.11.0</version>
                    <version>Lustre 2.12.0</version>
                    <version>Lustre 2.10.4</version>
                    <version>Lustre 2.10.5</version>
                    <version>Lustre 2.10.6</version>
                    <version>Lustre 2.10.7</version>
                                    <fixVersion>Lustre 2.12.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="221964" author="pjones" created="Wed, 28 Feb 2018 19:41:07 +0000"  >&lt;p&gt;Bobijam&lt;/p&gt;

&lt;p&gt;Could you please investigate?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="222859" author="jamesanunez" created="Thu, 8 Mar 2018 18:22:02 +0000"  >&lt;p&gt;We only see this during full test session testing and not in review testing; the testing we do for every patch.&lt;/p&gt;
</comment>
                            <comment id="225571" author="mdiep" created="Mon, 9 Apr 2018 23:17:58 +0000"  >&lt;p&gt;+1 on 2.10 &lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/816b22e2-3aa3-11e8-8f8a-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/816b22e2-3aa3-11e8-8f8a-52540065bddc&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="226581" author="sarah" created="Mon, 23 Apr 2018 20:42:35 +0000"  >&lt;p&gt;+1 on master &lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/d761fec6-471b-11e8-95c0-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/d761fec6-471b-11e8-95c0-52540065bddc&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;I did some search on Maloo, it seems the failure only seen on single MDT config, which is why review testing pass since sanity-pfl is run with dne config&lt;/p&gt;</comment>
                            <comment id="231550" author="gerrit" created="Mon, 6 Aug 2018 21:32:31 +0000"  >&lt;p&gt;James Nunez (jnunez@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/32945&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/32945&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10686&quot; title=&quot;sanity-pfl test 9 fails with &#8220;[0x100010000:0x6025:0x0] !=  &#8220;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10686&quot;&gt;&lt;del&gt;LU-10686&lt;/del&gt;&lt;/a&gt; tests: stop running sanity-pfl test 9&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: bee808d14011fb74aa5847674cf90b3a406afbc6&lt;/p&gt;</comment>
                            <comment id="232189" author="gerrit" created="Sat, 18 Aug 2018 02:24:01 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/32945/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/32945/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10686&quot; title=&quot;sanity-pfl test 9 fails with &#8220;[0x100010000:0x6025:0x0] !=  &#8220;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10686&quot;&gt;&lt;del&gt;LU-10686&lt;/del&gt;&lt;/a&gt; tests: stop running sanity-pfl test 9&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 1ca1da79a9e6b2af9f89a6c237d40b0333f64965&lt;/p&gt;</comment>
                            <comment id="233331" author="paf" created="Tue, 11 Sep 2018 15:34:43 +0000"  >&lt;p&gt;Well, I&apos;m not sure why it doesn&apos;t pass in the single MDT config, but the problem with the test is pretty simple - We&apos;re writing beyond the defined layout for the file.&#160; This test doesn&apos;t actually instantiate the layout.&#160; The layout goes to 2 MiB, but the dd write is &lt;b&gt;at&lt;/b&gt; 2 MiB.&#160; That gets ENODATA because it&apos;s beyond the end of the layout.&lt;/p&gt;

&lt;p&gt;I&apos;ll push a patch.&lt;/p&gt;</comment>
                            <comment id="233332" author="gerrit" created="Tue, 11 Sep 2018 15:37:34 +0000"  >&lt;p&gt;Patrick Farrell (paf@cray.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/33137&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/33137&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10686&quot; title=&quot;sanity-pfl test 9 fails with &#8220;[0x100010000:0x6025:0x0] !=  &#8220;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10686&quot;&gt;&lt;del&gt;LU-10686&lt;/del&gt;&lt;/a&gt; tests: correct layout in sanity-pfl 9&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 18d21ed1e69ccdff0d87da8b5fa58fa188e673cb&lt;/p&gt;</comment>
                            <comment id="233334" author="bobijam" created="Tue, 11 Sep 2018 16:44:39 +0000"  >&lt;p&gt;I&apos;ve verified that &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11158&quot; title=&quot;PFL component instantiation is not replayed properly&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11158&quot;&gt;&lt;del&gt;LU-11158&lt;/del&gt;&lt;/a&gt; fixed this issue.&lt;/p&gt;</comment>
                            <comment id="233339" author="paf" created="Tue, 11 Sep 2018 17:11:16 +0000"  >&lt;p&gt;Have you confirmed the write is no longer returning ENODATA?&lt;/p&gt;

&lt;p&gt;Given that the write in the test is beyond the end of the specified layout, I think we&apos;ve still got a problem.&lt;/p&gt;</comment>
                            <comment id="233375" author="bobijam" created="Wed, 12 Sep 2018 01:44:18 +0000"  >&lt;p&gt;yes, the write returns ENODATA, while the test just intends to instantiate the 2nd component, and verify the recovery replay the 2nd component instantiation.&lt;/p&gt;</comment>
                            <comment id="233378" author="paf" created="Wed, 12 Sep 2018 03:55:18 +0000"  >&lt;p&gt;If the write returns ENODATA, how is it getting the component instantiated? &#160;Does it still hit with the first byte...?&lt;/p&gt;</comment>
                            <comment id="233382" author="bobijam" created="Wed, 12 Sep 2018 06:58:42 +0000"  >&lt;p&gt;Even it hit the ENODATA, MDS will still instantiate the available component. I don&apos;t mean that change the 2nd component end to EOF is not right, the essential issue here is that the component instantiation replay has bug, and &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11158&quot; title=&quot;PFL component instantiation is not replayed properly&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11158&quot;&gt;&lt;del&gt;LU-11158&lt;/del&gt;&lt;/a&gt; patch can fix it.&lt;/p&gt;</comment>
                            <comment id="233967" author="pjones" created="Tue, 25 Sep 2018 10:36:58 +0000"  >&lt;p&gt;It sounds like this is believed to be a duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11158&quot; title=&quot;PFL component instantiation is not replayed properly&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11158&quot;&gt;&lt;del&gt;LU-11158&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="253633" author="adilger" created="Mon, 26 Aug 2019 23:46:21 +0000"  >&lt;p&gt;This test was removed from &lt;tt&gt;ALWAYS_EXCEPT&lt;/tt&gt; by patch &lt;a href=&quot;https://review.whamcloud.com/32847&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/32847&lt;/a&gt; &quot;&lt;tt&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11158&quot; title=&quot;PFL component instantiation is not replayed properly&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11158&quot;&gt;&lt;del&gt;LU-11158&lt;/del&gt;&lt;/a&gt; mdt: grow lvb buffer to hold layout&lt;/tt&gt;&quot;.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="52752">LU-11158</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzt1b:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>