<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:29:33 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-2939] Lustre: MGS is waiting for obd_unlinked_exports more than 256 seconds. The obd refcount = 5. Is it stuck?</title>
                <link>https://jira.whamcloud.com/browse/LU-2939</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Running various tests in a loop I am seeing the message like that somewhat regularly.&lt;br/&gt;
The latest one happened in replay-single test 52&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;== replay-single test 52: time out lock replay (3764) == 01:37:31 (1362638251)
Filesystem           1K-blocks      Used Available Use% Mounted on
192.168.10.216@tcp:/lustre
                        374928     50772    303616  15% /mnt/lustre
mcreate: cannot create `/mnt/lustre2/fsa-centos6-6.localnet&apos; with mode 0100644: Read-only file system
rm: cannot remove `/mnt/lustre2/fsa-centos6-6.localnet&apos;: No such file or directory
fail_loc=0x8000030c
Failing mds1 on centos6-6.localnet
Stopping /mnt/mds1 (opts:) on centos6-6.localnet
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;37981.683825&amp;#93;&lt;/span&gt; Lustre: DEBUG MARKER: == replay-single test 52: time out lock replay (3764) == 01:37:31 (1362638251)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37981.704645&amp;#93;&lt;/span&gt; Lustre: DEBUG MARKER: cancel_lru_locks mdc start&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37981.735425&amp;#93;&lt;/span&gt; Lustre: DEBUG MARKER: cancel_lru_locks mdc stop&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37981.917515&amp;#93;&lt;/span&gt; Turning device loop0 (0x700000) read-only&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37981.942649&amp;#93;&lt;/span&gt; Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37981.950096&amp;#93;&lt;/span&gt; Lustre: DEBUG MARKER: local REPLAY BARRIER on lustre-MDT0000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37984.526745&amp;#93;&lt;/span&gt; LustreError: 137-5: UUID &apos;lustre-MDT0000_UUID&apos; is not available for connect (no target)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37984.527548&amp;#93;&lt;/span&gt; LustreError: Skipped 1 previous similar message&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37989.523057&amp;#93;&lt;/span&gt; LustreError: 137-5: UUID &apos;lustre-MDT0000_UUID&apos; is not available for connect (no target)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37989.523952&amp;#93;&lt;/span&gt; LustreError: Skipped 2 previous similar messages&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37996.136083&amp;#93;&lt;/span&gt; Lustre: MGS is waiting for obd_unlinked_exports more than 8 seconds. The obd refcount = 5. Is it stuck?&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;37999.132513&amp;#93;&lt;/span&gt; LustreError: 137-5: UUID &apos;lustre-MDT0000_UUID&apos; is not available for connect (no target)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38012.137563&amp;#93;&lt;/span&gt; Lustre: MGS is waiting for obd_unlinked_exports more than 16 seconds. The obd refcount = 5. Is it stuck?&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38019.133096&amp;#93;&lt;/span&gt; LustreError: 137-5: UUID &apos;lustre-MDT0000_UUID&apos; is not available for connect (no target)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38019.133919&amp;#93;&lt;/span&gt; LustreError: Skipped 11 previous similar messages&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38044.137577&amp;#93;&lt;/span&gt; Lustre: MGS is waiting for obd_unlinked_exports more than 32 seconds. The obd refcount = 5. Is it stuck?&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38050.132095&amp;#93;&lt;/span&gt; Lustre: 11212:0:(client.c:1866:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: &lt;span class=&quot;error&quot;&gt;&amp;#91;sent 1362638298/real 1362638298&amp;#93;&lt;/span&gt;  req@ffff8800b451a7f0 x1428827873041828/t0(0) o250-&amp;gt;MGC192.168.10.216@tcp@0@lo:26/25 lens 400/544 e 0 to 1 dl 1362638319 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38050.134028&amp;#93;&lt;/span&gt; Lustre: 11212:0:(client.c:1866:ptlrpc_expire_one_request()) Skipped 25 previous similar messages&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38054.136029&amp;#93;&lt;/span&gt; LustreError: 137-5: UUID &apos;lustre-MDT0000_UUID&apos; is not available for connect (no target)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38054.136894&amp;#93;&lt;/span&gt; LustreError: Skipped 20 previous similar messages&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38108.137578&amp;#93;&lt;/span&gt; Lustre: MGS is waiting for obd_unlinked_exports more than 64 seconds. The obd refcount = 5. Is it stuck?&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38119.136041&amp;#93;&lt;/span&gt; LustreError: 137-5: UUID &apos;lustre-MDT0000_UUID&apos; is not available for connect (no target)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38119.137021&amp;#93;&lt;/span&gt; LustreError: Skipped 38 previous similar messages&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38236.137579&amp;#93;&lt;/span&gt; Lustre: MGS is waiting for obd_unlinked_exports more than 128 seconds. The obd refcount = 5. Is it stuck?&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38249.135021&amp;#93;&lt;/span&gt; LustreError: 137-5: UUID &apos;lustre-MDT0000_UUID&apos; is not available for connect (no target)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38249.135879&amp;#93;&lt;/span&gt; LustreError: Skipped 77 previous similar messages&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.624138&amp;#93;&lt;/span&gt; INFO: task umount:3429 blocked for more than 120 seconds.&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.624649&amp;#93;&lt;/span&gt; &quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.625436&amp;#93;&lt;/span&gt; umount        D 0000000000000003  2608  3429   3428 0x00000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.626043&amp;#93;&lt;/span&gt;  ffff88006ed25a98 0000000000000086 0000000000000000 ffff88006ed25a48&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.626850&amp;#93;&lt;/span&gt;  ffff88006ed25a08 ffff88006a542bf0 ffffffffa0d9320f 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.627634&amp;#93;&lt;/span&gt;  ffff88007a544ab8 ffff88006ed25fd8 000000000000fba8 ffff88007a544ab8&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.628438&amp;#93;&lt;/span&gt; Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.628792&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814f8ad1&amp;gt;&amp;#93;&lt;/span&gt; schedule_timeout+0x191/0x2e0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.629232&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8107bcd0&amp;gt;&amp;#93;&lt;/span&gt; ? process_timeout+0x0/0x10&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.629751&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0a7f75d&amp;gt;&amp;#93;&lt;/span&gt; cfs_schedule_timeout_and_set_state+0x1d/0x20 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.630619&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d19670&amp;gt;&amp;#93;&lt;/span&gt; obd_exports_barrier+0xb0/0x190 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.631133&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa05d2936&amp;gt;&amp;#93;&lt;/span&gt; mgs_device_fini+0xf6/0x5c0 &lt;span class=&quot;error&quot;&gt;&amp;#91;mgs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.631615&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d45cc7&amp;gt;&amp;#93;&lt;/span&gt; class_cleanup+0x577/0xda0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.633249&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d1be9c&amp;gt;&amp;#93;&lt;/span&gt; ? class_name2dev+0x7c/0xe0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.633799&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d475ac&amp;gt;&amp;#93;&lt;/span&gt; class_process_config+0x10bc/0x1c80 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.634531&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d40f93&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_cfg_new+0x353/0x7e0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.635035&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d482e9&amp;gt;&amp;#93;&lt;/span&gt; class_manual_cleanup+0x179/0x6e0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.635513&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814faebe&amp;gt;&amp;#93;&lt;/span&gt; ? _read_unlock+0xe/0x10&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.635993&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d1be9c&amp;gt;&amp;#93;&lt;/span&gt; ? class_name2dev+0x7c/0xe0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.636501&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d5415d&amp;gt;&amp;#93;&lt;/span&gt; server_put_super+0x43d/0xe60 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.637008&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8117d6ab&amp;gt;&amp;#93;&lt;/span&gt; generic_shutdown_super+0x5b/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.637480&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8117d796&amp;gt;&amp;#93;&lt;/span&gt; kill_anon_super+0x16/0x60&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.638004&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d4a0e6&amp;gt;&amp;#93;&lt;/span&gt; lustre_kill_super+0x36/0x60 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.638468&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8117e825&amp;gt;&amp;#93;&lt;/span&gt; deactivate_super+0x85/0xa0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.638904&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8119a89f&amp;gt;&amp;#93;&lt;/span&gt; mntput_no_expire+0xbf/0x110&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.639338&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8119b34b&amp;gt;&amp;#93;&lt;/span&gt; sys_umount+0x7b/0x3a0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38400.639764&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100b0f2&amp;gt;&amp;#93;&lt;/span&gt; system_call_fastpath+0x16/0x1b&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38492.136071&amp;#93;&lt;/span&gt; Lustre: MGS is waiting for obd_unlinked_exports more than 256 seconds. The obd refcount = 5. Is it stuck?&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38509.132648&amp;#93;&lt;/span&gt; LustreError: 137-5: UUID &apos;lustre-MDT0000_UUID&apos; is not available for connect (no target)&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;38509.133504&amp;#93;&lt;/span&gt; LustreError: Skipped 154 previous similar messages&lt;br/&gt;
...&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;I dumped a crashdump so all interested parties can take a look. /exports/crashdumps/t2/hung-obd_unlinked_exports.dmp (modules present too)&lt;/p&gt;</description>
                <environment></environment>
        <key id="17814">LU-2939</key>
            <summary>Lustre: MGS is waiting for obd_unlinked_exports more than 256 seconds. The obd refcount = 5. Is it stuck?</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="green">Oleg Drokin</reporter>
                        <labels>
                    </labels>
                <created>Sat, 9 Mar 2013 19:27:35 +0000</created>
                <updated>Fri, 31 May 2013 16:37:10 +0000</updated>
                            <resolved>Sun, 10 Mar 2013 04:07:18 +0000</resolved>
                                    <version>Lustre 2.4.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="53654" author="bzzz" created="Sat, 9 Mar 2013 23:28:12 +0000"  >&lt;p&gt;I think Andreas created a similar bug very recently.&lt;/p&gt;</comment>
                            <comment id="53655" author="adilger" created="Sun, 10 Mar 2013 03:52:25 +0000"  >&lt;p&gt;It wasn&apos;t a new bug, but an older one that I found a new hit for with this same message - &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2015&quot; title=&quot;Test failure on test suite obdfilter-survey, subtest test_3a&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2015&quot;&gt;&lt;del&gt;LU-2015&lt;/del&gt;&lt;/a&gt;.  Please either mark this a duplicate, or close the old one. &lt;/p&gt;</comment>
                            <comment id="53656" author="green" created="Sun, 10 Mar 2013 04:07:18 +0000"  >&lt;p&gt;Duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2015&quot; title=&quot;Test failure on test suite obdfilter-survey, subtest test_3a&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2015&quot;&gt;&lt;del&gt;LU-2015&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="16088">LU-2015</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="18543">LU-3230</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvkjj:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>7057</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>