<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:36:01 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-3684] LBUG/&quot;ldlm_lock_decref_internal_nolock()) ASSERTION(lock-&gt;l_readers &gt; 0) failed&quot; running Bull&apos;s NFS locktests</title>
                <link>https://jira.whamcloud.com/browse/LU-3684</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We need to fix the bug reported here: &lt;a href=&quot;https://jira.hpdd.intel.com/browse/LU-1126&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jira.hpdd.intel.com/browse/LU-1126&lt;/a&gt; before installing Sas Grid Manager.&lt;/p&gt;

&lt;p&gt;The Lustre filesystem is mounted on client using the -o flock option&lt;/p&gt;</description>
                <environment>Client Lustre 1.8.7  jenkins-wc1--PRISTINE-2.6.18-274.3.1.el5 RHEL 5.7&lt;br/&gt;
Server Lustre 1.8.7 jenkins-wc1--PRISTINE-2.6.18-274.3.1.el5_lustre.g9500ebf RHEL 5.7 </environment>
        <key id="20148">LU-3684</key>
            <summary>LBUG/&quot;ldlm_lock_decref_internal_nolock()) ASSERTION(lock-&gt;l_readers &gt; 0) failed&quot; running Bull&apos;s NFS locktests</summary>
                <type id="3" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11318&amp;avatarType=issuetype">Task</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bfaccini">Bruno Faccini</assignee>
                                    <reporter username="lustre.support">Supporto Lustre Jnet2000</reporter>
                        <labels>
                    </labels>
                <created>Thu, 1 Aug 2013 14:53:31 +0000</created>
                <updated>Wed, 16 Apr 2014 09:42:40 +0000</updated>
                            <resolved>Mon, 3 Mar 2014 12:01:59 +0000</resolved>
                                    <version>Lustre 1.8.7</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="63483" author="pjones" created="Thu, 1 Aug 2013 16:46:48 +0000"  >&lt;p&gt;Thanks for the report. We are looking into the best option&lt;/p&gt;</comment>
                            <comment id="65538" author="bfaccini" created="Mon, 2 Sep 2013 10:25:36 +0000"  >&lt;p&gt;Hello, &lt;br/&gt;
I wonder if you can better detail the issue you experience by running SAS Grid Manager on top of Lustre 1.8 ?? This could help me to better qualify this ticket and also to change its title accordingly!&lt;/p&gt;

&lt;p&gt;Since your feeling is that your problem is still the one originally addressed by &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt; and that you may already be aware of my last work/update in it, I would like you to help me determine if it is still the original one described in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt; (wrong lock mode used versus readers/writers counters) and likely to be reproduced by flock.c provided program, or the one I reported to be still present in master (race during lock destroy when overlap detection) and easily reproducible with &quot;BULL&apos;s NFS Locktests&quot;.&lt;/p&gt;

&lt;p&gt;Do you think you can again provide the Lustre debug-log (The way Oleg described in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt; would be the best since it is for a full-trace, but at least with &quot;dlmtrace&quot; enabled) of a new occurence ??&lt;/p&gt;

&lt;p&gt;Thanks in advance for your help.&lt;/p&gt;</comment>
                            <comment id="65556" author="lustre.support" created="Mon, 2 Sep 2013 15:24:49 +0000"  >&lt;p&gt;Dear Bruno, currently we don&apos;t have SAS Grid Manager installed on our system. Before installing it, sas support team require that the bug reported here  &lt;a href=&quot;https://jira.hpdd.intel.com/browse/LU-1126&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jira.hpdd.intel.com/browse/LU-1126&lt;/a&gt;  must to be fixed.&lt;br/&gt;
Instead, we have surely the bug that can be reproduced with &quot;BULL&apos;s NFS Locktests&quot;.&lt;br/&gt;
If you wish, we can provide the lustre log of the BULL test run.&lt;br/&gt;
Regards&lt;/p&gt;</comment>
                            <comment id="65557" author="bfaccini" created="Mon, 2 Sep 2013 15:35:50 +0000"  >&lt;p&gt;That would be nice if I can get the Lustre debug-log taken during &quot;BULL&apos;s NFS Locktests&quot; run at your site! Thanks in advance.&lt;/p&gt;</comment>
                            <comment id="65921" author="lustre.support" created="Fri, 6 Sep 2013 07:42:33 +0000"  >&lt;p&gt;Dear Bruno, we have attached the log of lustre client crashed during the execution  of the BULL test. &lt;br/&gt;
Regards&lt;/p&gt;</comment>
                            <comment id="66087" author="bfaccini" created="Mon, 9 Sep 2013 16:32:10 +0000"  >&lt;p&gt;I checked the lustre-log you provided and it is definitelly the same problem triggered by &quot;BULL&apos;s NFS Locktests&quot; (and not the original one in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt; with custom reproducer) that I fixed with change &lt;a href=&quot;http://review.whamcloud.com/7134&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/7134&lt;/a&gt; in master, like already described in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="66183" author="bfaccini" created="Tue, 10 Sep 2013 14:57:52 +0000"  >&lt;p&gt;Since this ticket definitely addresses a different scenario (even if LBUG/&quot;ldlm_lock_decref_internal_nolock()) ASSERTION(lock-&amp;gt;l_readers &amp;gt; 0) failed&quot; is the same!!) than the original reported as part of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt; but where it was also/already described, I would like to set this ticket as the main/tracking one for this particular case.&lt;/p&gt;

&lt;p&gt;Just to be complete about the differences between the 2 problems for &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt; and this ticket :&lt;/p&gt;

&lt;p&gt;          _ &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt;, has been open for a particular race/scenario where a transient lock, result/return for a F_GETLK request, has to be destroyed due to overlap, but this occurs during the very short window where its changed mode (PR/PW) has become incompatible with its counter (l_readers/l_writers). Thus the LBUG occurs because the wrong counter will be decremented. This particular problem only show up with the custom reproducer (flock.c) provided for &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;          _ this is different to the scenario for this ticket&apos;s problem, where a race can occur between 2 threads who want to destroy (one to finish corresponding request processing, the other due to overlap rules) the same lock, mainly during F_UNLCK multiple/concurent requests handling. Thus the LBUG occurs because 2nd thread found counter already set to 0. This particular problem show up very easily when running, as you experienced, &quot;Bull&apos;s NFS Locktests&quot;. This test is available at &lt;a href=&quot;http://nfsv4.bullopensource.org/tools/tests/locktest.php&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://nfsv4.bullopensource.org/tools/tests/locktest.php&lt;/a&gt;, and provided as &quot;locktests.tar.gz&quot; distro I attached here. Easy way to reproduce is to run in pthread mode like &quot;locktests -n 10 -T -f &amp;lt;Lustre-File&amp;gt;&quot; on a single+full Lustre node (ie, like after intalling Lustre and running &quot;llmount.sh&quot;).&lt;/p&gt;

&lt;p&gt;As I said, problem has been fixed in master with &lt;a href=&quot;http://review.whamcloud.com/7134&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/7134&lt;/a&gt;, b1_8 patch is at &lt;a href=&quot;http://review.whamcloud.com/7420&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/7420&lt;/a&gt; now.&lt;/p&gt;

&lt;p&gt;Also, I would like to change this ticket&apos;s title as it is definitely not the same problem/race than the one addressed in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt;, something like &quot;&lt;span class=&quot;error&quot;&gt;&amp;#91;ldlm_lock_decref_internal_nolock()) ASSERTION(lock-&amp;gt;l_readers &amp;gt; 0) failed&amp;#93;&lt;/span&gt; running Bull&apos;s NFS locktests&quot;.&lt;/p&gt;

&lt;p&gt;Will also add reference to this ticket in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1126&quot; title=&quot;Client file locking issue. Assertion triggered when decrementing a read lock on an item that has no existing read locks.&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1126&quot;&gt;&lt;del&gt;LU-1126&lt;/del&gt;&lt;/a&gt; to complete split-up between the 2 different problems.&lt;/p&gt;
</comment>
                            <comment id="78101" author="gabriele.paciucci" created="Fri, 28 Feb 2014 15:50:35 +0000"  >&lt;p&gt;The customer is currently in the process to upgrade to 1.8.9+patch. So please close this ticket.&lt;/p&gt;</comment>
                            <comment id="81718" author="bfaccini" created="Wed, 16 Apr 2014 09:42:40 +0000"  >&lt;p&gt;b2_4 patch version is at &lt;a href=&quot;http://review.whamcloud.com/9968&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/9968&lt;/a&gt;.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="13450" name="locktests.tar.gz" size="12354" author="bfaccini" created="Tue, 10 Sep 2013 15:02:07 +0000"/>
                            <attachment id="13435" name="lustre_log.txt.gz" size="3089215" author="lustre.support" created="Fri, 6 Sep 2013 07:42:33 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvx0f:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9511</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>