<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:02:55 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-13634] racer test 1 failed with &#8216;Illegal instruction     (core dumped)&#8217;</title>
                <link>https://jira.whamcloud.com/browse/LU-13634</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;racer test_1 fails with the error message &apos;test_1 failed with 8&apos;. Looking at the failure for an ARM client test, at &lt;a href=&quot;https://testing.whamcloud.com/test_sets/e8f119a9-57ca-45e3-b8f3-246009fe1b75&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/e8f119a9-57ca-45e3-b8f3-246009fe1b75&lt;/a&gt;, in the suite_log we see many &#8216;illegal instructions&#8217; errors like&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;./file_exec.sh: line 16: 25000 Illegal instruction     (core dumped) $DIR/$file 0.$((RANDOM % 5 + 1)) 2&amp;gt; /dev/null
racer cleanup
sleeping 5 sec ...
racer cleanup
sleeping 5 sec ...
Waited 5, rc=3 sleeping 10 sec ...
Waited 5, rc=3 sleeping 10 sec ...
Waited 20, rc=3 sleeping 20 sec ...
Waited 20, rc=3 sleeping 20 sec ...
Waited 50, rc=3 sleeping 40 sec ...
Waited 50, rc=3 sleeping 40 sec ...
Waited 110, rc=3 sleeping 80 sec ...
Waited 110, rc=3 sleeping 80 sec ...
Waited 230, rc=3 sleeping 160 sec ...
Waited 230, rc=3 sleeping 160 sec ...
Waited 470, rc=3 sleeping 320 sec ...
Waited 470, rc=3 sleeping 320 sec ...
Waited 950, rc=3 USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
Filesystem             1K-blocks   Used Available Use% Mounted on
10.2.8.103@tcp:/lustre  15466208 221764  14268332   2% /mnt/lustre
Waited 950, rc=2 USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
Filesystem             1K-blocks   Used Available Use% Mounted on
10.2.8.103@tcp:/lustre  15466208 221764  14268332   2% /mnt/lustre
Running /usr/lib64/lustre/tests/racer/racer.sh for 900 seconds. CTRL-C to exit
Running /usr/lib64/lustre/tests/racer/racer.sh for 900 seconds. CTRL-C to exit
layout: raid0 raid0 pfl pfl pfl dom dom dom flr flr flr
layout: raid0 raid0 pfl pfl pfl dom dom dom flr flr flr
layout: raid0 raid0 pfl pfl pfl dom dom dom flr flr flr
layout: raid0 raid0 pfl pfl pfl dom dom dom flr flr flr
layout: raid0 raid0 pfl pfl pfl dom dom dom flr flr flr
layout: raid0 raid0 pfl pfl pfl dom dom dom flr flr flr
./file_exec.sh: line 16: 29938 Illegal instruction     (core dumped) $DIR/$file 0.$((RANDOM % 5 + 1)) 2&amp;gt; /dev/null
racer cleanup
./file_exec.sh: line 16: 26172 Illegal instruction     (core dumped) $DIR/$file 0.$((RANDOM % 5 + 1)) 2&amp;gt; /dev/null
sleeping 5 sec ...
racer cleanup
&#8230;
./file_exec.sh: line 16: 22980 Illegal instruction     (core dumped) $DIR/$file 0.$((RANDOM % 5 + 1)) 2&amp;gt; /dev/null
./file_exec.sh: line 16:  7946 Terminated              $DIR/$file 0.$((RANDOM % 5 + 1)) 2&amp;gt; /dev/null
sleeping 5 sec ...
racer cleanup
sleeping 5 sec ...
Waited 5, rc=3 sleeping 10 sec ...
Waited 5, rc=3 sleeping 10 sec ...
Waited 20, rc=3 sleeping 20 sec ...
Waited 20, rc=3 sleeping 20 sec ...
Waited 50, rc=3 sleeping 40 sec ...
Waited 50, rc=3 sleeping 40 sec ...
Waited 110, rc=3 sleeping 80 sec ...
Waited 110, rc=3 sleeping 80 sec ...
Waited 230, rc=3 sleeping 160 sec ...
Waited 230, rc=3 sleeping 160 sec ...
Waited 470, rc=3 sleeping 320 sec ...
Waited 470, rc=3 sleeping 320 sec ...
Waited 950, rc=2 USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
Filesystem             1K-blocks   Used Available Use% Mounted on
10.2.8.103@tcp:/lustre  15466208 221764  14268332   2% /mnt/lustre2
Waited 950, rc=3 USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
Filesystem             1K-blocks   Used Available Use% Mounted on
10.2.8.103@tcp:/lustre  15466208 221764  14268332   2% /mnt/lustre2
pid=14004 rc=1
pid=14006 rc=1
pid=14007 rc=1
 racer test_1: @@@@@@ FAIL: test_1 failed with 8 
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;We&#8217;ve seen these errors in at least three test sessions in the past four months starting with the Lustre version 2.12.4.61 only on the b2_12 branch starting, all during ARM client testing:&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/849d0827-b6ca-46ec-b0be-4727c4bda504&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/849d0827-b6ca-46ec-b0be-4727c4bda504&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/58789911-1a3e-4c3c-ba7f-d1e7216f0ead&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/58789911-1a3e-4c3c-ba7f-d1e7216f0ead&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;There is at least one earlier failure like this for 2.13.51.37 on 27 JAN 2020:&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/a554c12e-41ca-11ea-9847-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/a554c12e-41ca-11ea-9847-52540065bddc&lt;/a&gt;&lt;/p&gt;</description>
                <environment>ARM clients</environment>
        <key id="59455">LU-13634</key>
            <summary>racer test 1 failed with &#8216;Illegal instruction     (core dumped)&#8217;</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                    </labels>
                <created>Thu, 4 Jun 2020 22:24:18 +0000</created>
                <updated>Thu, 10 Sep 2020 17:21:39 +0000</updated>
                                            <version>Lustre 2.12.5</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="272062" author="adilger" created="Fri, 5 Jun 2020 12:58:17 +0000"  >&lt;p&gt;AFAIK, this is fairly normal for racer, as it is doing crazy things to the files. The main goal is not to crash, and to be able to clean up afterward. &lt;/p&gt;

&lt;p&gt;We should see if syzbot &lt;img class=&quot;emoticon&quot; src=&quot;https://jira.whamcloud.com/images/icons/emoticons/help_16.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt; from the upstream kernel is publicly available and consider running that as well for a whole new level of painful race conditions, if racer isn&apos;t failing enough anymore. &lt;/p&gt;</comment>
                            <comment id="272074" author="jhammond" created="Fri, 5 Jun 2020 15:30:09 +0000"  >&lt;p&gt;Two things here:&lt;/p&gt;

&lt;ol&gt;
	&lt;li&gt;In &lt;tt&gt;file_exec.sh&lt;/tt&gt; we use &lt;tt&gt;egrep&lt;/tt&gt; to filter out shell error messages matching &quot;Segmentation fault&quot; and &quot;Bus error&quot;. We should also filter &quot;Illegal instruction&quot;.&lt;/li&gt;
	&lt;li&gt;I do not think that the illegal instruction messages are necessarily connected to the test failures. Instead I suspect that the test failures are because &lt;tt&gt;racer/racer.sh&lt;/tt&gt; cannot clean up all of its children.&lt;/li&gt;
&lt;/ol&gt;
</comment>
                            <comment id="279261" author="gerrit" created="Thu, 10 Sep 2020 17:21:39 +0000"  >&lt;p&gt;Vitaly Fertman (vitaly.fertman@hpe.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/39878&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/39878&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13634&quot; title=&quot;racer test 1 failed with &#8216;Illegal instruction     (core dumped)&#8217;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13634&quot;&gt;LU-13634&lt;/a&gt; ldlm: extra checks for DOM locks&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 404f823c22d0c23ddc8d740bda69a0a8e9505d83&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i01213:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>