<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:18:14 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1619] Subtle and infrequent failure of test 39a in sanityn.sh noticed after ns timestamp implementation</title>
                <link>https://jira.whamcloud.com/browse/LU-1619</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;The results of a Maloo automated test of the nanosecond timestamp patches (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1158&quot; title=&quot;nanosecond timestamp support for Lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1158&quot;&gt;LU-1158&lt;/a&gt;, &lt;a href=&quot;http://review.whamcloud.com/#change,3266&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,3266&lt;/a&gt; and &lt;a href=&quot;http://review.whamcloud.com/#change,3271&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,3271&lt;/a&gt;, the latter of which includes a revised sanityn.sh for testing nanosecond times) revealed a slight time desynchronization after file renaming in test 39a of the revised sanityn.sh, on the order of 500,000 nanoseconds (about 1/2 of a millisecond) or so. This failure occurred 1 time out of 100 trials. Since the previous maximum time granularity of Lustre was at second resolution, and this bug cropped up 1% of the time during testing, it is highly unlikely that this bug would be noticed or reproducible without application of the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1158&quot; title=&quot;nanosecond timestamp support for Lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1158&quot;&gt;LU-1158&lt;/a&gt; patches; for it to be noticed, of the 1% of times the bug occurs, the 1/2 millisecond difference would have to occur across a second boundary, i.e. the second timestamps would have to be different. However, since the patches do not change any logic outside of determining when a timestamp is greater than another, it is possible that this bug exists undetected in master.&lt;/p&gt;

&lt;p&gt;Links to related Maloo test session and sanityn.sh failure:&lt;br/&gt;
&lt;a href=&quot;https://maloo.whamcloud.com/test_sessions/fd767d96-ca51-11e1-a9ff-52540035b04c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://maloo.whamcloud.com/test_sessions/fd767d96-ca51-11e1-a9ff-52540035b04c&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://maloo.whamcloud.com/test_sets/34a6a812-ca53-11e1-a9ff-52540035b04c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://maloo.whamcloud.com/test_sets/34a6a812-ca53-11e1-a9ff-52540035b04c&lt;/a&gt;&lt;/p&gt;</description>
                <environment>CentOS 6.2/x86_64 on Maloo</environment>
        <key id="15183">LU-1619</key>
            <summary>Subtle and infrequent failure of test 39a in sanityn.sh noticed after ns timestamp implementation</summary>
                <type id="7" iconUrl="https://jira.whamcloud.com/images/icons/issuetypes/task_agile.png">Technical task</type>
                            <parent id="13394">LU-1158</parent>
                                    <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="adilger">Andreas Dilger</reporter>
                        <labels>
                            <label>ReviewNeeded</label>
                    </labels>
                <created>Tue, 10 Jul 2012 17:03:53 +0000</created>
                <updated>Fri, 23 Jun 2017 18:41:42 +0000</updated>
                            <resolved>Fri, 23 Jun 2017 18:41:42 +0000</resolved>
                                    <version>Lustre 2.3.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>0</watches>
                                                                            <comments>
                            <comment id="41695" author="adilger" created="Wed, 11 Jul 2012 04:37:01 +0000"  >

&lt;p&gt;comment&lt;br/&gt;
    Isami, thanks for filing this bug. It is always useful to track issues like this, even if they do not appear to be causing any significant problems, since I suspect it is a sign that the timestamps aren&apos;t being handled correctly somewhere, and could lead to much larger skews in the timestamps. just a note about the &quot;failure rate&quot; of test results in Maloo. The percentage reported in Maloo means &quot;1 of the past 100 tests failed&quot; (i.e. the test just run and failed), so it isn&apos;t at all clear whether this means &quot;every test from now on with this patch will fail&quot; or &quot;this was a fluke and the failure rate is 1 in 10000&quot;. If you want to get statistically significant numbers on how often this is failing, please try something like: &lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt; Test-Parameters: testlist=sanityn,sanityn,sanityn,sanityn,sanityn,sanityn,sanityn envdefinitions=ONLY=39 &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt; or similar, so that it runs sanityn test_39* several times. There typically shouldn&apos;t be any &quot;drift&quot; in the timestamps in this manner - everything should be derived only from the client clock. There are a couple of places that this might be introduced: - different nanosecond times on different CPU cores due to clock drift. I doubt this is the case anymore, but this used to happen in the past - skew between userspace and kernel timestamps. This could only be the case if the test is somehow using the clock from userspace - bug in the code allowing the MDS or OSS timestamps to affect the file. This is the most likely cause. It would be possible to test out the last hypothesis by setting wildly different times on the MDS and OSS nodes from the client (either or both in the past or future), and then tracing where the bad timestamp is coming from (which should be easy if the timestamps can clearly be distinguished as to which node they came from).&lt;/p&gt;
</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="29097">LU-6366</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzw1t3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>10374</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                            <customfield id="customfield_10002" key="com.atlassian.jira.plugin.system.customfieldtypes:float">
                        <customfieldname>Story Points</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>3.0</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>