<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:40:38 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-4206] Sanity test_120e fails with 1 blocking RPC occured.</title>
                <link>https://jira.whamcloud.com/browse/LU-4206</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Sanity test_120e seems to be failing intermittently on some tests.&lt;br/&gt;
Seeing the following on the MDT&lt;/p&gt;

&lt;p&gt;Lustre: lustre-MDT0000: Not available for connect from 10.10.16.108@tcp (stopping)&lt;br/&gt;
Lustre: lustre-MDT0000: Not available for connect from 10.10.16.108@tcp (stopping)&lt;br/&gt;
Lustre: Skipped 5 previous similar messages&lt;br/&gt;
Lustre: lustre-MDT0000: Not available for connect from 10.10.16.109@tcp (stopping)&lt;br/&gt;
LustreError: 2966:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_CLOSED   req@ffff8800686f7c00 x1450597855266416/t0(0) o13-&amp;gt;lustre-OST0002-osc-MDT0000@10.10.16.108@tcp:7/4 lens 224/368 e 0 to 0 dl 0 ref 1 fl Rpc:/0/ffffffff rc 0/-1&lt;br/&gt;
LustreError: 2966:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 3 previous similar messages&lt;br/&gt;
Lustre: lustre-MDT0000: Not available for connect from 10.10.16.110@tcp (stopping)&lt;br/&gt;
Lustre: 10030:0:(client.c:1897:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: &lt;span class=&quot;error&quot;&gt;&amp;#91;sent 1383398053/real 1383398053&amp;#93;&lt;/span&gt;  req@ffff880068291c00 x1450597855266444/t0(0) o251-&amp;gt;MGC10.10.16.107@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1383398059 ref 2 fl Rpc:XN/0/ffffffff rc 0/-1&lt;br/&gt;
Lustre: server umount lustre-MDT0000 complete&lt;/p&gt;

&lt;p&gt;Might be the that mdt is being umounted while Clients are communicating with it&lt;/p&gt;

&lt;p&gt;On Client &lt;br/&gt;
LustreError: 11-0: lustre-MDT0000-mdc-ffff880037d5a400: Communicating with 10.10.16.107@tcp, operation obd_ping failed with -107.&lt;/p&gt;

</description>
                <environment></environment>
        <key id="21844">LU-4206</key>
            <summary>Sanity test_120e fails with 1 blocking RPC occured.</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="di.wang">Di Wang</assignee>
                                    <reporter username="ashehata">Amir Shehata</reporter>
                        <labels>
                    </labels>
                <created>Mon, 4 Nov 2013 19:46:39 +0000</created>
                <updated>Thu, 28 Apr 2016 23:40:17 +0000</updated>
                            <resolved>Thu, 8 May 2014 17:31:33 +0000</resolved>
                                    <version>Lustre 2.5.0</version>
                                    <fixVersion>Lustre 2.6.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="72442" author="adilger" created="Wed, 27 Nov 2013 20:37:47 +0000"  >&lt;p&gt;Only found 2 failures in the past few weeks.&lt;/p&gt;</comment>
                            <comment id="83398" author="jhammond" created="Wed, 7 May 2014 15:52:59 +0000"  >&lt;p&gt;I see this occasionally from autotest and also when running locally. IIUC, when is see it, the blocking callback is from the OST (handling OST_DESTROY) for the rename onto victim. AFAICT we don&apos;t do ELC for objects of rename victims. Is that correct?&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;00000100:00100000:0.0:1399475307.568943:0:21604:0:service.c:2090:ptlrpc_server_handle_request()) Handling RPC pname:cluuid+ref:pid:xid:nid:opc ll_ost00_004:lustre-MDT0000-mdtlov_UUID+4:11360:x1467451757057912:12345-0@lo:6
...
00010000:00010000:0.0:1399475307.569026:0:21604:0:(ldlm_lock.c:715:ldlm_add_bl_work_item()) ### lock incompatible; sending blocking AST. ns: filter-lustre-OST0000_UUID lock: ffff88016dacb158/0x2e4c6046c3721790 lrc: 2/0,0 mode: PR/PR res: [0x212:0x0:0x0].0 rrc: 2 type: EXT [0-&amp;gt;18446744073709551615] (req 0-&amp;gt;4095) flags: 0x40000000000000 nid: 0@lo remote: 0x2e4c6046c3721789 expref: 4 pid: 13418 timeout: 0 lvb_type: 0
...
00010000:00010000:0.0:1399475307.569074:0:21604:0:(ldlm_lockd.c:848:ldlm_server_blocking_ast()) ### server preparing blocking AST ns: filter-lustre-OST0000_UUID lock: ffff88016dacb158/0x2e4c6046c3721790 lrc: 3/0,0 mode: PR/PR res: [0x212:0x0:0x0].0 rrc: 2 type: EXT [0-&amp;gt;18446744073709551615] (req 0-&amp;gt;4095) flags: 0x50000000010020 nid: 0@lo remote: 0x2e4c6046c3721789 expref: 4 pid: 13418 timeout: 0 lvb_type: 0
...
00010000:00010000:0.0:1399475307.569081:0:21604:0:(ldlm_lockd.c:459:ldlm_add_waiting_lock()) ### adding to wait list(timeout: 150, AT: on) ns: filter-lustre-OST0000_UUID lock: ffff88016dacb158/0x2e4c6046c3721790 lrc: 4/0,0 mode: PR/PR res: [0x212:0x0:0x0].0 rrc: 2 type: EXT [0-&amp;gt;18446744073709551615] (req 0-&amp;gt;4095) flags: 0x70000000010020 nid: 0@lo remote: 0x2e4c6046c3721789 expref: 4 pid: 13418 timeout: 4451652756 lvb_type: 0
...
00000100:00000040:0.0:1399475307.569088:0:21604:0:(lustre_net.h:3296:ptlrpc_rqphase_move()) @@@ move req &quot;New&quot; -&amp;gt; &quot;Rpc&quot;  req@ffff88012c7032f0 x1467451757057916/t0(0) o104-&amp;gt;lustre-OST0000@0@lo:15/16 lens 296/224 e 0 to 0 dl 0 ref 1 fl New:N/0/ffffffff rc 0/-1
...
00000100:00100000:0.0:1399475307.569097:0:21604:0:(client.c:1480:ptlrpc_send_new_req()) Sending RPC pname:cluuid:pid:xid:nid:opc ll_ost00_004:lustre-OST0000_UUID:21604:1467451757057916:0@lo:104
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="83424" author="di.wang" created="Wed, 7 May 2014 18:13:35 +0000"  >&lt;p&gt;As I understand, we actually do ELC on OSC as well. But after 2.4, it is the MDT who will send destroy request to the OST, then OST  revoke the lock on the client cache and destroy the object. So client does not have chance to do ELC here, so we might see some blocking RPC here.  IIRC, test_120 only suppose to check ELC for metadata object, we probably need fix the test script here. &lt;/p&gt;</comment>
                            <comment id="83425" author="jhammond" created="Wed, 7 May 2014 18:15:59 +0000"  >&lt;p&gt;Yes, from osc_destroy() but this is not until after md_rename() has returned.&lt;/p&gt;</comment>
                            <comment id="83427" author="adilger" created="Wed, 7 May 2014 18:33:36 +0000"  >&lt;p&gt;The correct solution is to have the client cancel the OST locks for the file&apos;s objects if it thinks this is the last unlink of the file and it is not open on the client.  That should avoid the blocking RPC from the OST, and also allow the page cleanup and OST RPCs to overlap with the MDT processing of the unlink.&lt;/p&gt;</comment>
                            <comment id="83428" author="di.wang" created="Wed, 7 May 2014 18:48:34 +0000"  >&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/10250&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/10250&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="83447" author="di.wang" created="Wed, 7 May 2014 21:10:23 +0000"  >&lt;p&gt;Andreas: I just talked to jinshan, there are no proper interface for us to implement this(cancel OST locks directly from llite). And according to jinshan&apos;s idea,  there might be proper interface for us to do this after CLIO cleanup project, according to jinshan&apos;s comment, so we probably temporary fix this in script for now?&lt;/p&gt;</comment>
                            <comment id="83544" author="jlevi" created="Thu, 8 May 2014 17:31:34 +0000"  >&lt;p&gt;Patch landed to Master.&lt;/p&gt;</comment>
                            <comment id="97711" author="yujian" created="Tue, 28 Oct 2014 16:49:43 +0000"  >&lt;p&gt;The failure also occurred on Lustre b2_5 branch:&lt;br/&gt;
&lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/720ecb84-5e77-11e4-bd01-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/720ecb84-5e77-11e4-bd01-5254006e85c2&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="34951">LU-7812</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="22593">LU-4421</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="24213">LU-4909</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                                        </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzw7xr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>11434</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>