<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:17:28 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-15339] sanity test 208 fails with &apos;lease not broken over recovery&apos;</title>
                <link>https://jira.whamcloud.com/browse/LU-15339</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;During testing on some internal branches, we&apos;ve observed sanity test 208 failing.&#160; It turns out this is a timing issue.&lt;/p&gt;



&lt;p&gt;Looking at the test, we see:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;$MULTIOP $DIR/$tfile oO_RDWR:eR_E-eUc &amp;amp;
PID=$!
sleep 1

# open file to break lease and then recovery
$MULTIOP $DIR/$tfile oO_RDWR:c || error &quot;open file error&quot;
fail mds1&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;This is one multiop asking for the lease, then a 1 second wait, then the other trying an open which will break the lease.&lt;/p&gt;

&lt;p&gt;In the test logs, we see:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;==== test 5: lease broken can&apos;t be regained by replay
multiop: cannot get READ lease, ext 0: Device or resource busy (16)
multiop: apply/unlock lease error: Device or resource busy
Failing mds1 on onyx-50vm2 &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;So this is the first multiop getting EBUSY, which it would get if the file were already open.&lt;/p&gt;

&lt;p&gt;Looking at the debug logs, we see two processes working on the file, 3066 and 3068.&#160; Obviously 3066 was started earlier.&lt;br/&gt;
We see 3066 opening the file, getting an intent lock, etc, then requesting the lease.&#160; Somewhere during the earlier intent lock request, we start seeing messages from 3068, meaning it has started.&#160; This ioctl is the lease request:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;00000080:00200000:0.0:1638330788.087774:0:3066:0:(file.c:3518:ll_file_ioctl()) VFS Op:inode=[0x200005222:0x27a6:0x0](ffff9fddbb55f098), cmd=c00c66f3
00000002:00010000:0.0:1638330788.087789:0:3066:0:(mdc_locks.c:1228:mdc_intent_lock()) (name: ,[0x200005222:0x27a6:0x0]) in obj [0x200005222:0x27a6:0x0], intent: open flags 01004040000001
[...]
00000100:00100000:0.0:1638330788.087812:0:3066:0:(client.c:1685:ptlrpc_send_new_req()) Sending RPC pname:cluuid:pid:xid:nid:opc multiop:e3dc4b61-17a3-a1d0-026f-41b421fc661f:3066:1717912348006336:10.240.24.89@tcp:101
00000100:00100000:0.0:1638330788.087830:0:3066:0:(client.c:2409:ptlrpc_set_wait()) set ffff9fdd603fc880 going to sleep for 13 seconds&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;But before that can complete, we see the open from 3068 complete:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;00000100:00100000:0.0:1638330788.087874:0:3068:0:(client.c:2115:ptlrpc_check_set()) Completed RPC pname:cluuid:pid:xid:nid:opc multiop:e3dc4b61-17a3-a1d0-026f-41b421fc661f:3068:1717912348006272:10.240.24.89@tcp:101
[...]
00000002:00100000:0.0:1638330788.087906:0:3068:0:(mdc_request.c:801:mdc_set_open_replay_data()) @@@ Set up open replay data &#160;req@ffff9fdd55dc0000 x1717912348006272/t51539607555(51539607555) o101-&amp;gt;lustre-MDT0000-mdc-ffff9fdd58dea800@10.240.24.89@tcp:12/10 lens 752/616 e 0 to 0 dl 1638330803 ref 2 fl Complete:RP/4/0 rc 0/0  &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;The first message logged by 3066 is at 1638330787.082310, and we can see above, the intent lock request is starting at 1638330788.087774 - over a second later.&lt;/p&gt;

&lt;p&gt;So this is just a pure timing issue.&#160; The solution should be to increase the timeouts in this test to 2 seconds.&lt;/p&gt;</description>
                <environment></environment>
        <key id="67499">LU-15339</key>
            <summary>sanity test 208 fails with &apos;lease not broken over recovery&apos;</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="paf0186">Patrick Farrell</assignee>
                                    <reporter username="paf0186">Patrick Farrell</reporter>
                        <labels>
                    </labels>
                <created>Tue, 7 Dec 2021 21:52:49 +0000</created>
                <updated>Tue, 18 Jan 2022 15:50:17 +0000</updated>
                            <resolved>Thu, 23 Dec 2021 14:47:08 +0000</resolved>
                                                    <fixVersion>Lustre 2.15.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="320239" author="gerrit" created="Tue, 7 Dec 2021 21:56:22 +0000"  >&lt;p&gt;&quot;Patrick Farrell &amp;lt;pfarrell@whamcloud.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/45779&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/45779&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-15339&quot; title=&quot;sanity test 208 fails with &amp;#39;lease not broken over recovery&amp;#39;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-15339&quot;&gt;&lt;del&gt;LU-15339&lt;/del&gt;&lt;/a&gt; tests: Increase timeout in sanity 208&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: b3620920b75b25620e3570bfd7cc4c491aecacdb&lt;/p&gt;</comment>
                            <comment id="321412" author="gerrit" created="Thu, 23 Dec 2021 07:20:43 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; merged in patch &lt;a href=&quot;https://review.whamcloud.com/45779/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/45779/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-15339&quot; title=&quot;sanity test 208 fails with &amp;#39;lease not broken over recovery&amp;#39;&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-15339&quot;&gt;&lt;del&gt;LU-15339&lt;/del&gt;&lt;/a&gt; tests: Increase timeout in sanity 208&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: dc015fc0b51b95151366b0355cfc90b068d98b01&lt;/p&gt;</comment>
                            <comment id="321467" author="pjones" created="Thu, 23 Dec 2021 14:47:08 +0000"  >&lt;p&gt;Landed for 2.15&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                                        </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i02bw7:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>