<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:44:45 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-11538] replay-single test 80g fails with  &apos;/usr/bin/lfs getstripe -m /mnt/lustre/d80g.replay-single/remote_dir failed&apos;</title>
                <link>https://jira.whamcloud.com/browse/LU-11538</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;replay-single test_80g fails for ZFS with DNE Lustre configurations. Looking at a recent failure, &lt;a href=&quot;https://testing.whamcloud.com/test_sets/6c06f67c-cf6a-11e8-82f2-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/6c06f67c-cf6a-11e8-82f2-52540065bddc&lt;/a&gt; , we see &#8216;lfs getstripe&#8217; fails&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;onyx-42vm6: CMD: onyx-42vm6.onyx.whamcloud.com lctl get_param -n at_max
onyx-42vm7: CMD: onyx-42vm7.onyx.whamcloud.com lctl get_param -n at_max
onyx-42vm6: mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec
onyx-42vm7: mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec
lfs getstripe: cannot open &apos;/mnt/lustre/d80g.replay-single/remote_dir&apos;: No such file or directory (2)
error: getstripe failed for /mnt/lustre/d80g.replay-single/remote_dir.
 replay-single test_80g: @@@@@@ FAIL: /usr/bin/lfs getstripe -m /mnt/lustre/d80g.replay-single/remote_dir failed 
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:5788:error()
  = /usr/lib64/lustre/tests/replay-single.sh:2580:remote_dir_check_80()
  = /usr/lib64/lustre/tests/replay-single.sh:2792:test_80g()
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Comparing the console log from this failed test session to one where test 80g passes, we see a few errors in the MDS2, MDS4 (vm10) log:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[75477.742299] Lustre: DEBUG MARKER: umount -d /mnt/lustre-mds2
[75477.926652] Lustre: Failing over lustre-MDT0001
[75477.946482] LustreError: 6854:0:(client.c:1175:ptlrpc_import_delay_req()) @@@ IMP_CLOSED   req@ffff8f07870a0f00 x1614211128795744/t0(0) o1000-&amp;gt;lustre-MDT0000-osp-MDT0001@10.2.8.153@tcp:24/4 lens 304/4320 e 0 to 0 dl 0 ref 2 fl Rpc:/0/ffffffff rc 0/-1
[75477.948593] LustreError: 6854:0:(client.c:1175:ptlrpc_import_delay_req()) Skipped 2 previous similar messages
[75477.949570] LustreError: 6854:0:(osp_object.c:582:osp_attr_get()) lustre-MDT0000-osp-MDT0001:osp_attr_get update error [0x200000401:0x1:0x0]: rc = -5
[75478.049796] Lustre: lustre-MDT0001: Not available for connect from 10.2.8.153@tcp (stopping)
[75478.605896] Lustre: DEBUG MARKER: lsmod | grep lnet &amp;gt; /dev/null &amp;amp;&amp;amp;
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This test fails almost 100% of the time for a DNE wth ZFS configuration. Frequently, replay-single test 80g fails after test 80f fails, but this is not always true.&lt;/p&gt;

&lt;p&gt;Some other recent failures are at&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/121a90c6-c6e4-11e8-82f2-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/121a90c6-c6e4-11e8-82f2-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/cb442ad8-d17c-11e8-b589-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/cb442ad8-d17c-11e8-b589-52540065bddc&lt;/a&gt;&lt;/p&gt;</description>
                <environment>DNE/ZFS</environment>
        <key id="53632">LU-11538</key>
            <summary>replay-single test 80g fails with  &apos;/usr/bin/lfs getstripe -m /mnt/lustre/d80g.replay-single/remote_dir failed&apos;</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                            <label>DNE</label>
                            <label>zfs</label>
                    </labels>
                <created>Wed, 17 Oct 2018 16:57:20 +0000</created>
                <updated>Wed, 3 Aug 2022 21:12:10 +0000</updated>
                            <resolved>Tue, 26 Feb 2019 07:49:16 +0000</resolved>
                                    <version>Lustre 2.12.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="241392" author="sarah" created="Tue, 5 Feb 2019 17:54:33 +0000"  >&lt;p&gt;seeing similar error on soak which is running b2_10-ib build 98&lt;br/&gt;
on MDS 0, ldiskfs&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[13029.868775] Lustre: 12363:0:(client.c:2114:ptlrpc_expire_one_request()) Skipped 1 previous similar message
[13052.013064] LNet: 12352:0:(o2iblnd_cb.c:3192:kiblnd_check_conns()) Timed out tx for 192.168.1.111@o2ib: 4 seconds
[13112.234837] Lustre: MGS: Connection restored to 192.168.1.111@o2ib (at 192.168.1.111@o2ib)
[13112.244106] Lustre: Skipped 1 previous similar message
[13112.621426] Lustre: soaked-MDT0000: Received new LWP connection from 192.168.1.111@o2ib, removing former export from same NID
[13173.841531] LustreError: 167-0: soaked-MDT0003-osp-MDT0000: This client was evicted by soaked-MDT0003; in progress operations using this service will fail.
[13173.857375] LustreError: 19073:0:(osp_object.c:582:osp_attr_get()) soaked-MDT0003-osp-MDT0000:osp_attr_get update error [0x2c000ee60:0x1:0x0]: rc = -5
[13173.862277] Lustre: soaked-MDT0003-osp-MDT0000: Connection restored to 192.168.1.111@o2ib (at 192.168.1.111@o2ib)
[13173.862280] Lustre: Skipped 2 previous similar messages
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="242786" author="adilger" created="Tue, 26 Feb 2019 07:49:16 +0000"  >&lt;p&gt;Issue was fixed via patch &lt;a href=&quot;https://review.whamcloud.com/34069&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/34069&lt;/a&gt; &quot;&lt;tt&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10143&quot; title=&quot;LBUG dt_object.h:2166:dt_declare_record_write&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10143&quot;&gt;&lt;del&gt;LU-10143&lt;/del&gt;&lt;/a&gt; osd-zfs: allocate sequence in advance&lt;/tt&gt;&quot;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="48828">LU-10143</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="53280">LU-11366</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i004ef:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>