<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:45:07 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-11581] Not all changelog entries are returned to userspace</title>
                <link>https://jira.whamcloud.com/browse/LU-11581</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;In a Lustre 2.10+ based cluster I have observed a problem where some changelog entries are not returned to userspace. Which entries are dropped is not consistent across attempts to read them.&lt;/p&gt;

&lt;p&gt;I can reproduce this by doing the following:&lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;Register a changelog reader to enable changelog&lt;/li&gt;
	&lt;li&gt;On at least two client nodes, run a file creation/deletion loop - I use a recursive copy of /usr/include to a client-specific directory&lt;/li&gt;
	&lt;li&gt;Wait until the changelog has grown to a couple million entries.&lt;/li&gt;
	&lt;li&gt;Stop the file creation/deletion loops, and ensure the filesystem is idle.&lt;/li&gt;
	&lt;li&gt;Run &lt;tt&gt;lfs changelog&lt;/tt&gt;&#160;several times on a client and redirect the output to different files.&lt;/li&gt;
	&lt;li&gt;Compare the files.&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;What I have observed is that I got different output files from &lt;tt&gt;lfs changelog&lt;/tt&gt; every single time. Changelog records that are absent in one of the output files are present in another and vice versa. At no point were all entries that should be in the on-disk log returned.&lt;/p&gt;

&lt;p&gt;In my (admittedly CPU-starved) virtual cluster the drop rate was approximately 1 entry per 16000 records, but in a test like above having a few million on-disk records is required to consistently see the problem.&lt;/p&gt;

&lt;p&gt;Notes:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;I originally observed this with a changelog reader which has been instrumented to detect this kind of issue. The description above regards how it can be reproduced without relying on a proprietary tool.&lt;/li&gt;
	&lt;li&gt;I have not been able to reproduce this in a 2.7+ based cluster. Admittedly that one does have much more capable hardware as well.&lt;/li&gt;
	&lt;li&gt;To compare the output files with tools like &lt;tt&gt;comp&lt;/tt&gt;&#160;you need to sort them first using &apos;sort -n&apos;. This is thanks to&#160;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11426&quot; title=&quot;2/2 Olafs agree: changelog entries are emitted out of order&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11426&quot;&gt;&lt;del&gt;LU-11426&lt;/del&gt;&lt;/a&gt;&lt;/li&gt;
	&lt;li&gt;This issue may in fact be caused by&#160;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11426&quot; title=&quot;2/2 Olafs agree: changelog entries are emitted out of order&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11426&quot;&gt;&lt;del&gt;LU-11426&lt;/del&gt;&lt;/a&gt;&#160;interacting with the new (in 2.10) mechanism to return changelog entries to userspace.&lt;/li&gt;
&lt;/ul&gt;
</description>
                <environment>Lustre 2.10 based virtual cluster</environment>
        <key id="53844">LU-11581</key>
            <summary>Not all changelog entries are returned to userspace</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="olaf">Olaf Weber</reporter>
                        <labels>
                            <label>changelog</label>
                    </labels>
                <created>Mon, 29 Oct 2018 16:19:10 +0000</created>
                <updated>Tue, 6 Aug 2019 14:31:29 +0000</updated>
                            <resolved>Tue, 26 Mar 2019 15:35:32 +0000</resolved>
                                    <version>Lustre 2.10.1</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>7</watches>
                                                                            <comments>
                            <comment id="235814" author="jhammond" created="Mon, 29 Oct 2018 17:09:50 +0000"  >&lt;p&gt;I agree that this is likely to due &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11426&quot; title=&quot;2/2 Olafs agree: changelog entries are emitted out of order&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11426&quot;&gt;&lt;del&gt;LU-11426&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="235815" author="pjones" created="Mon, 29 Oct 2018 17:10:03 +0000"  >&lt;p&gt;John&lt;/p&gt;

&lt;p&gt;Can you please advise?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="252593" author="olaf" created="Tue, 6 Aug 2019 14:31:29 +0000"  >&lt;p&gt;We have now seen this issue on systems running 2.7 based code. Out of order records do seem to play a part, but the 2.10+ mechanism for returning records to userspace does not appear to be the culprit.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="53413">LU-11426</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                    <customfield id="customfield_10030" key="com.atlassian.jira.plugin.system.customfieldtypes:labels">
                        <customfieldname>Epic/Theme</customfieldname>
                        <customfieldvalues>
                                        <label>changelog</label>
    
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i0059z:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10021"><![CDATA[2]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>