<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:42:59 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-11336] replay-single test 80d hangs on MDT unmount</title>
                <link>https://jira.whamcloud.com/browse/LU-11336</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;replay-single test_80d hangs on MDT unmount. Looking at a recent hang, at &lt;a href=&quot;https://testing.whamcloud.com/test_sets/ffb8585c-a706-11e8-80f7-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/ffb8585c-a706-11e8-80f7-52540065bddc&lt;/a&gt;, the last thing we see in the test log is&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;CMD: trevis-44vm10 /usr/sbin/lctl --device lustre-MDT0001 notransno
CMD: trevis-44vm10 /usr/sbin/lctl --device lustre-MDT0001 readonly
CMD: trevis-44vm10 /usr/sbin/lctl mark mds2 REPLAY BARRIER on lustre-MDT0001
Failing mds1 on trevis-44vm9
CMD: trevis-44vm9 grep -c /mnt/lustre-mds1&apos; &apos; /proc/mounts || true
Stopping /mnt/lustre-mds1 (opts:) on trevis-44vm9
CMD: trevis-44vm9 umount -d /mnt/lustre-mds1
CMD: trevis-44vm9 lsmod | grep lnet &amp;gt; /dev/null &amp;amp;&amp;amp;
lctl dl | grep &apos; ST &apos; || true
CMD: trevis-44vm9 ! zpool list -H lustre-mdt1 &amp;gt;/dev/null 2&amp;gt;&amp;amp;1 ||
			grep -q ^lustre-mdt1/ /proc/mounts ||
			zpool export  lustre-mdt1
Failing mds2 on trevis-44vm10
CMD: trevis-44vm10 grep -c /mnt/lustre-mds2&apos; &apos; /proc/mounts || true
Stopping /mnt/lustre-mds2 (opts:) on trevis-44vm10
CMD: trevis-44vm10 umount -d /mnt/lustre-mds2
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Looking at the console log for the MDS (vm10), we see the some errors and a call trace&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[ 5775.673375] Lustre: DEBUG MARKER: /usr/sbin/lctl --device lustre-MDT0001 notransno
[ 5776.031453] Lustre: DEBUG MARKER: /usr/sbin/lctl --device lustre-MDT0001 readonly
[ 5776.390963] Lustre: DEBUG MARKER: /usr/sbin/lctl mark mds2 REPLAY BARRIER on lustre-MDT0001
[ 5776.569652] Lustre: DEBUG MARKER: mds2 REPLAY BARRIER on lustre-MDT0001
[ 5778.383410] Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds2&apos; &apos; /proc/mounts || true
[ 5778.731629] Lustre: DEBUG MARKER: umount -d /mnt/lustre-mds2
[ 5778.917429] LustreError: 8724:0:(ldlm_resource.c:1146:ldlm_resource_complain()) lustre-MDT0000-osp-MDT0001: namespace resource [0x20000abf8:0xa:0x0].0x0 (ffff96bab691e6c0) refcount nonzero (1) after lock cleanup; forcing cleanup.
[ 5778.922402] LustreError: 8724:0:(ldlm_resource.c:1146:ldlm_resource_complain()) lustre-MDT0000-osp-MDT0001: namespace resource [0x20000abf8:0xa:0x0].0x0 (ffff96bab691e6c0) refcount nonzero (1) after lock cleanup; forcing cleanup.
[ 5783.926373] LustreError: 0-0: Forced cleanup waiting for lustre-MDT0000-osp-MDT0001 namespace with 1 resources in use, (rc=-110)
[ 5830.783393] LNet: Service thread pid 27370 was inactive for 62.11s. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes:
[ 5830.785144] Pid: 27370, comm: mdt00_000 3.10.0-862.9.1.el7_lustre.x86_64 #1 SMP Fri Aug 17 20:37:05 UTC 2018
[ 5830.786119] Call Trace:
[ 5830.786419]  [&amp;lt;ffffffffc0f14dca&amp;gt;] top_trans_wait_result+0xa6/0x155 [ptlrpc]
[ 5830.787317]  [&amp;lt;ffffffffc0ef6c0b&amp;gt;] top_trans_stop+0x42b/0x930 [ptlrpc]
[ 5830.788062]  [&amp;lt;ffffffffc131d80c&amp;gt;] lod_trans_stop+0x25c/0x340 [lod]
[ 5830.788958]  [&amp;lt;ffffffffc13ca9ae&amp;gt;] mdd_trans_stop+0x2e/0x174 [mdd]
[ 5830.789654]  [&amp;lt;ffffffffc13af627&amp;gt;] mdd_create+0x967/0x13f0 [mdd]
[ 5830.790307]  [&amp;lt;ffffffffc124cc80&amp;gt;] mdt_create+0x820/0xc40 [mdt]
[ 5830.791044]  [&amp;lt;ffffffffc124d20b&amp;gt;] mdt_reint_create+0x16b/0x350 [mdt]
[ 5830.791709]  [&amp;lt;ffffffffc124ece3&amp;gt;] mdt_reint_rec+0x83/0x210 [mdt]
[ 5830.792402]  [&amp;lt;ffffffffc122e1d2&amp;gt;] mdt_reint_internal+0x6b2/0xa80 [mdt]
[ 5830.793112]  [&amp;lt;ffffffffc12391e7&amp;gt;] mdt_reint+0x67/0x140 [mdt]
[ 5830.793705]  [&amp;lt;ffffffffc0ee627a&amp;gt;] tgt_request_handle+0xaea/0x1580 [ptlrpc]
[ 5830.794488]  [&amp;lt;ffffffffc0e8940b&amp;gt;] ptlrpc_server_handle_request+0x24b/0xab0 [ptlrpc]
[ 5830.795329]  [&amp;lt;ffffffffc0e8cc44&amp;gt;] ptlrpc_main+0xb14/0x1fb0 [ptlrpc]
[ 5830.796151]  [&amp;lt;ffffffffb88bb621&amp;gt;] kthread+0xd1/0xe0
[ 5830.796694]  [&amp;lt;ffffffffb8f205f7&amp;gt;] ret_from_fork_nospec_end+0x0/0x39
[ 5830.797409]  [&amp;lt;ffffffffffffffff&amp;gt;] 0xffffffffffffffff
[ 5830.798027] LustreError: dumping log to /tmp/lustre-log.1535036577.27370
[ 9434.653754] SysRq : Changing Loglevel
[ 9434.654263] Loglevel set to 8
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This test only hnags in this way for DNE/ZFS testing. &lt;/p&gt;

&lt;p&gt;It&#8217;s not easy to detect when this issue started because of issues with the kernel call trace , but this test did not fail nor hang in July 2018 and the first signs of replay-single test 80d hanging is on August 6, 2018.&lt;/p&gt;

&lt;p&gt;Here are links to logs for some hangs:&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/06129818-af39-11e8-bd05-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/06129818-af39-11e8-bd05-52540065bddc&lt;/a&gt; &lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/0339a0e2-ae2a-11e8-bfed-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/0339a0e2-ae2a-11e8-bfed-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/021c99b8-ad2f-11e8-bfed-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/021c99b8-ad2f-11e8-bfed-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/fd1d1dd8-abe0-11e8-80f7-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/fd1d1dd8-abe0-11e8-80f7-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/057c01e4-9f6e-11e8-b0aa-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/057c01e4-9f6e-11e8-b0aa-52540065bddc&lt;/a&gt;&lt;/p&gt;
</description>
                <environment>DNE/ZFS</environment>
        <key id="53217">LU-11336</key>
            <summary>replay-single test 80d hangs on MDT unmount</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                            <label>DNE</label>
                            <label>zfs</label>
                    </labels>
                <created>Wed, 5 Sep 2018 16:06:09 +0000</created>
                <updated>Wed, 11 Mar 2020 14:12:06 +0000</updated>
                                            <version>Lustre 2.12.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>1</watches>
                                                                                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="51032">LU-10740</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="53392">LU-11418</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i001uf:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>