<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:59:17 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-13205] sanity-pfl test 16a fails with &#8220;setstripe /mnt/lustre/d16.sanity-pfl/f16.sanity-pfl.copy failed&#8220;</title>
                <link>https://jira.whamcloud.com/browse/LU-13205</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Looking at a recent failure, &lt;a href=&quot;https://testing.whamcloud.com/test_sets/52073ba0-4715-11ea-b69a-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/52073ba0-4715-11ea-b69a-52540065bddc&lt;/a&gt;, sanity-pfl test 16a fails with the following in the client test_log&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;== sanity-pfl test 16a: Verify setstripe/getstripe with YAML config file ============================= 05:25:02 (1580793902)
CMD: trevis-4vm12 dumpe2fs -h /dev/mapper/mds1_flakey 2&amp;gt;&amp;amp;1 |
		grep -E -q &apos;(ea_inode|large_xattr)&apos;
1. PFL file
getstripe --yaml /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl
setstripe --yaml=/mnt/lustre/d16a.sanity-pfl/template /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl.copy
Set stripe size 4096 failed: Invalid argument
lfs setstripe: cannot build layout from YAML file /mnt/lustre/d16a.sanity-pfl/template.
error: setstripe: can&apos;t create composite layout from template file /mnt/lustre/d16a.sanity-pfl/template
 sanity-pfl test_16a: @@@@@@ FAIL: setstripe /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl.copy failed 
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:6121:error()
  = /usr/lib64/lustre/tests/test-framework.sh:9710:verify_yaml_layout()
  = /usr/lib64/lustre/tests/sanity-pfl.sh:761:test_16a()
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The only hint of a problem in the console logs is for client 2 (vm9) where we see&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[ 1212.789487] LustreError: 1758:0:(pack_generic.c:2447:lustre_swab_lov_comp_md_v1()) Invalid magic 0x1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;We&#8217;ve seen this same error message in sanity-pfl test 14 failures in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13186&quot; title=&quot;sanity-pfl test 14 fails with &amp;#39;/mnt/lustre/d14.sanity-pfl/f14.sanity-pfl: component 4 doesn&amp;#39;t have poolname pool2&amp;#39;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13186&quot;&gt;LU-13186&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Patch &lt;a href=&quot;https://review.whamcloud.com/#/c/28425/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/28425/&lt;/a&gt; for &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9846&quot; title=&quot;Overstriping - more than stripe per OST per component&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9846&quot;&gt;&lt;del&gt;LU-9846&lt;/del&gt;&lt;/a&gt; landed to Lustre 2.12.54 on 01 JUNE 2019 and was not back ported to b2_12. This patch moved test_16 to test_16a and created a new test 16b.&lt;/p&gt;

&lt;p&gt;sanity-pfl test 16a started failing with this error message on 30 JULY 2019 with Lustre 2.12.56.72 and test 16 started failing for b2_12 on 13 AUG 2019 for Lustre 2.12.2.115.&lt;/p&gt;

&lt;p&gt;sanity-pfl test 16a fails only for PPC clients and fails 100% of the time for PPC. &lt;/p&gt;

&lt;p&gt;Logs for recent failures are at&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/fd407770-4706-11ea-a1c8-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/fd407770-4706-11ea-a1c8-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/07588ab8-2592-11ea-80b4-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/07588ab8-2592-11ea-80b4-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/a2a89ace-1fdb-11ea-adca-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/a2a89ace-1fdb-11ea-adca-52540065bddc&lt;/a&gt;&lt;/p&gt;</description>
                <environment>PPC clients</environment>
        <key id="58001">LU-13205</key>
            <summary>sanity-pfl test 16a fails with &#8220;setstripe /mnt/lustre/d16.sanity-pfl/f16.sanity-pfl.copy failed&#8220;</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                            <label>always_except</label>
                            <label>ppc</label>
                    </labels>
                <created>Wed, 5 Feb 2020 19:03:06 +0000</created>
                <updated>Mon, 24 Feb 2020 02:25:31 +0000</updated>
                                            <version>Lustre 2.13.0</version>
                    <version>Lustre 2.14.0</version>
                    <version>Lustre 2.12.4</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="262979" author="adilger" created="Mon, 10 Feb 2020 07:16:16 +0000"  >&lt;p&gt;Based on the date of the start of failures and the area affected, it looks like this was very likely caused by the landing of:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;commit 9d17996766e0fa93b1029d2422d45d087edde389
CommitDate: Sat Jul 27 00:21:20 2019 +0000

    LU-10100 llite: swab LOV EA user data
    
    Many sub-tests failed with &quot;Invalid argument&quot; failures
    on PPC client because of the endianness issue.
    
    This patch fixes the issue by adding a common function
    lustre_swab_lov_user_md() to swab the LOV EA user data.
    
    Test-Parameters: clientarch=ppc64 envdefinitions=ONLY=27 testlist=sanity

    Reviewed-on: https://review.whamcloud.com/35291
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="262981" author="adilger" created="Mon, 10 Feb 2020 07:26:10 +0000"  >&lt;p&gt;It seems that &quot;caused&quot; might be a strong word.  Prior to the landing of the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10100&quot; title=&quot;sanity test_27a: setstripe failed with &amp;quot;error on ioctl 0x8008669a for &amp;#39;*&amp;#39; (3): Invalid argument&amp;quot;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10100&quot;&gt;&lt;del&gt;LU-10100&lt;/del&gt;&lt;/a&gt; patch, sanity-pfl just failed every test.  After the landing of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10100&quot; title=&quot;sanity test_27a: setstripe failed with &amp;quot;error on ioctl 0x8008669a for &amp;#39;*&amp;#39; (3): Invalid argument&amp;quot;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10100&quot;&gt;&lt;del&gt;LU-10100&lt;/del&gt;&lt;/a&gt;, many of the tests started passing, and test_16a, test_16b, test_17 began crashing, but only because they actually started doing something rather than exiting as soon as the &quot;&lt;tt&gt;lfs setstripe&lt;/tt&gt;&quot; command failed.  That means reverting &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10100&quot; title=&quot;sanity test_27a: setstripe failed with &amp;quot;error on ioctl 0x8008669a for &amp;#39;*&amp;#39; (3): Invalid argument&amp;quot;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10100&quot;&gt;&lt;del&gt;LU-10100&lt;/del&gt;&lt;/a&gt; would not &lt;em&gt;improve&lt;/em&gt; the situation, as much as revert to a state where 100% of the tests are broken on ppc64 compared to only the PFL tests.&lt;/p&gt;

&lt;p&gt;I&apos;m going to push a debug patch to see if it can identify where the missing swabbing is, which will hopefully allow us to solve the problem.  The current issue is that the machine state is so fatally corrupted that the machine reboots without actually printing any stack traces of what is being run.&lt;/p&gt;</comment>
                            <comment id="262982" author="gerrit" created="Mon, 10 Feb 2020 07:33:05 +0000"  >&lt;p&gt;Andreas Dilger (adilger@whamcloud.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/37494&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/37494&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13205&quot; title=&quot;sanity-pfl test 16a fails with &#8220;setstripe /mnt/lustre/d16.sanity-pfl/f16.sanity-pfl.copy failed&#8220;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13205&quot;&gt;LU-13205&lt;/a&gt; lov: debug lov buffer swabbing crash&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: e17fe2322008ce7520f4496932ea8638018a4261&lt;/p&gt;</comment>
                            <comment id="263882" author="adilger" created="Mon, 24 Feb 2020 02:25:31 +0000"  >&lt;p&gt;I&apos;ve captured a stack trace from the crash.  It looks like it is calling &lt;tt&gt;DEBUG_REQ()&lt;/tt&gt; in &lt;tt&gt;mdc_close()&lt;/tt&gt; with a bad message buffer:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[ 1188.641162] Lustre: DEBUG MARKER: == sanity-pfl test 16b: Verify setstripe/getstripe with YAML config file + overstriping == 23:01:06 (1581548466)
[ 1189.084215] LustreError: 3150:0:(pack_generic.c:2488:lustre_swab_lov_comp_md_v1()) lov: invalid magic 0x1 for component 3
[ 1189.087595] LustreError: 3150:0:(pack_generic.c:1199:lustre_msg_get_transno()) incorrect message magic: 00000200
[ 1189.087680] LustreError: 3150:0:(pack_generic.c:1089:lustre_msg_get_opc()) incorrect message magic: 00000200 (msg:c0000000049cb800)
[ 1189.087807] ------------[ cut here ]------------
[ 1189.087904] WARNING: CPU: 1 PID: 3150 at lustre/ptlrpc/pack_generic.c:1090 .lustre_msg_get_opc+0x108/0x330 [ptlrpc]
 CPU: 1 PID: 3150 Comm: lfs Kdump: loaded Tainted: G           OE  ------------   3.10.0-1062.9.1.el7.ppc64 #1
 Call Trace:
 .lustre_msg_get_opc+0xf8/0x330 [ptlrpc] (unreliable)
 ._debug_req+0x674/0x830 [ptlrpc]
 .mdc_close+0x568/0x1150 [mdc]
 .lmv_close+0x238/0x580 [lmv]
 .ll_close_inode_openhandle+0xb7c/0x1380 [lustre]
 .ll_md_real_close+0x190/0x330 [lustre]
 .ll_file_release+0x728/0xcf0 [lustre]
 .ll_dir_release+0x15c/0x210 [lustre]
 .__fput+0xe8/0x330
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="58003">LU-13207</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="48641">LU-10100</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="58022">LU-13215</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="57963">LU-13186</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00t8v:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>