<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:11:26 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-7732] check_for_next_transno()) lustre-MDT0000: waking for gap in transno, VBR is OFF </title>
                <link>https://jira.whamcloud.com/browse/LU-7732</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[ 2270.584467] LustreError: 30824:0:(ldlm_lib.c:1900:check_for_next_transno()) lustre-MDT0000: waking for gap in transno, VBR is OFF (skip: 4295194948, ql: 2, comp: 2, conn: 4, next: 4295194950, next_update 4295194960 last_committed: 4295194129)
[ 2270.587115] LustreError: 30824:0:(ldlm_lib.c:1900:check_for_next_transno()) lustre-MDT0000: waking for gap in transno, VBR is OFF (skip: 4295194951, ql: 2, comp: 2, conn: 4, next: 4295194952, next_update 4295194960 last_committed: 4295194129)
[ 2270.590338] LustreError: 30824:0:(ldlm_lib.c:1900:check_for_next_transno()) lustre-MDT0000: waking for gap in transno, VBR is OFF (skip: 4295194955, ql: 2, comp: 2, conn: 4, next: 4295194958, next_update 4295194960 last_committed: 4295194129)
[ 2270.592946] LustreError: 30824:0:(ldlm_lib.c:1900:check_for_next_transno()) lustre-MDT0000: waking for gap in transno, VBR is OFF (skip: 4295194959, ql: 2, comp: 2, conn: 4, next: 4295194960, next_update 4295194960 last_committed: 4295194129)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;I met a lot such message during replay test.  And it turns out those losing transaction numbers are actually coming from open/close request, because the open request (no create) will be destroyed(removed from replay list) after close, no matter if the transaction is committed or not, i.e. open request will not be replayed usually.&lt;/p&gt;

&lt;p&gt;So the question here is that should we have to allocate transno to open (no create) request? If it is necessary, then we at least should turn off these annoying console message, since they are not real error.&lt;/p&gt;

</description>
                <environment></environment>
        <key id="34438">LU-7732</key>
            <summary>check_for_next_transno()) lustre-MDT0000: waking for gap in transno, VBR is OFF </summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bogl">Bob Glossman</assignee>
                                    <reporter username="di.wang">Di Wang</reporter>
                        <labels>
                    </labels>
                <created>Tue, 2 Feb 2016 18:44:59 +0000</created>
                <updated>Wed, 26 Jun 2019 15:17:15 +0000</updated>
                            <resolved>Thu, 11 Aug 2016 12:05:31 +0000</resolved>
                                                    <fixVersion>Lustre 2.9.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>10</watches>
                                                                            <comments>
                            <comment id="149051" author="tappro" created="Fri, 15 Apr 2016 05:46:36 +0000"  >&lt;p&gt;normally open replays don&apos;t issue such error messages. These particular messages show that we have a problem here:&lt;br/&gt;
last_commited is 4295194129, replayed request is 4295194948, so it is larger than last_committed value that means there should be no gaps if all clients were connected. I see 4 clients are connected but can&apos;t see how many is expected to connect. So that is one of the reason.&lt;br/&gt;
If all clients are connected, then we have a possible problem with recovery - some replays are missing or transaction number were generated not sequentially, e.g. several transno per RPCs or so. Maybe this is result of update recovery? This can be not a problem at all, just result of changed recovery basics, e.g. if gaps become possible even after last_committed.&lt;/p&gt;</comment>
                            <comment id="149052" author="di.wang" created="Fri, 15 Apr 2016 06:03:46 +0000"  >&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;normally open replays don&apos;t issue such error messages.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;From what I see, this is because of open replay. Note: in current implementation, open (non create) request will be destroyed (removed from the replay list) when close is done, and it does not check the committed transaction at all, that is why you will see a lot skip transno here. I do not think this is related with update recovery, at least in theory.  Even in single MDT replay, you will see such annoying message.&lt;/p&gt;</comment>
                            <comment id="149100" author="tappro" created="Fri, 15 Apr 2016 15:26:08 +0000"  >&lt;p&gt;you are right, that can be the reason of gaps after last_committed. in that case I agree that D_ERROR level is not needed for such messages&lt;/p&gt;</comment>
                            <comment id="154550" author="adilger" created="Fri, 3 Jun 2016 07:45:31 +0000"  >&lt;p&gt;Mike, is this just a matter of turning this CERROR() into a CDEBUG(), or is there some way we can limit it to specific cases where open is involved?  I guess if the RPC is freed then there is no way to know whether it was an open or not?&lt;/p&gt;

&lt;p&gt;In any case, this message is spewing to the console during replay-dual testing (e.g. &lt;a href=&quot;https://testing.hpdd.intel.com/test_logs/a6e8a554-2901-11e6-a0ce-5254006e85c2/show_text&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_logs/a6e8a554-2901-11e6-a0ce-5254006e85c2/show_text&lt;/a&gt;) and should really be fixed in some way before 2.9.0 is released.  Is there a particular bug/patch that caused this message to appear that this ticket should link to, so we might better understand why it has started to appear?&lt;/p&gt;</comment>
                            <comment id="158625" author="doug" created="Wed, 13 Jul 2016 16:00:24 +0000"  >&lt;p&gt;The patch submitted by Olaf was mis-titled and is not a fix for this ticket.  There is no fix for this yet.&lt;/p&gt;</comment>
                            <comment id="158631" author="olaf" created="Wed, 13 Jul 2016 16:17:22 +0000"  >&lt;p&gt;Thanks Doug. Apologies for the confusion caused by my typo.&lt;/p&gt;</comment>
                            <comment id="158640" author="bogl" created="Wed, 13 Jul 2016 16:54:20 +0000"  >&lt;p&gt;if the needed fix is just to eliminate use of D_ERROR in this particular error message and always use D_HA, I can easily construct such a patch.  If something more clever is required it probably needs an expert on this part of the code.&lt;/p&gt;</comment>
                            <comment id="159192" author="adilger" created="Tue, 19 Jul 2016 11:14:01 +0000"  >&lt;p&gt;Bob, could you please make a patch changing D_ERROR to D_HA.  I&apos;m not sure that is all that is needed, but it is a start.&lt;/p&gt;</comment>
                            <comment id="159201" author="gerrit" created="Tue, 19 Jul 2016 13:20:30 +0000"  >&lt;p&gt;Bob Glossman (bob.glossman@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/21418&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/21418&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7732&quot; title=&quot;check_for_next_transno()) lustre-MDT0000: waking for gap in transno, VBR is OFF &quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7732&quot;&gt;&lt;del&gt;LU-7732&lt;/del&gt;&lt;/a&gt; ldlm: silence verbose &quot;waking for gap&quot; log messages&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 6e3e224388a4224cf68ed0666126ff541197c061&lt;/p&gt;</comment>
                            <comment id="161554" author="gerrit" created="Thu, 11 Aug 2016 05:51:39 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/21418/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/21418/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7732&quot; title=&quot;check_for_next_transno()) lustre-MDT0000: waking for gap in transno, VBR is OFF &quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7732&quot;&gt;&lt;del&gt;LU-7732&lt;/del&gt;&lt;/a&gt; ldlm: silence verbose &quot;waking for gap&quot; log messages&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: f073f11b860fcb42707c50b63bed2b2294ceeeba&lt;/p&gt;</comment>
                            <comment id="161586" author="pjones" created="Thu, 11 Aug 2016 12:05:31 +0000"  >&lt;p&gt;Landed for 2.9&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                                        </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                                        </outwardlinks>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzy013:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>