<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:44:08 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-4592] mdt_reint_open()) @@@ OPEN &amp; CREAT not in open replay</title>
                <link>https://jira.whamcloud.com/browse/LU-4592</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;As part of preparation testing, the customer performed a failover tests.  The customer rebooted the primary MDS in order to confirm the standby  MDS would takeover and not interrupt the job.  The job died when the client was unable to open a file.&lt;/p&gt;

&lt;p&gt;3 files attached.&lt;br/&gt;
mds00.20140131.17  Primary MDS that was rebooted&lt;br/&gt;
mds01.20140131.17  Secondary MDS that took over when mds00 went down&lt;br/&gt;
client.807442  Client logs from the 2 compute nodes running the job (#807422) that failed. (The two nodes are mu0104 and mu0105)&lt;/p&gt;


&lt;p&gt;Error reported on MDS01 -&lt;br/&gt;
Jan 31 17:07:12 l1-mds01 kernel: : LustreError: 18626:0:(mdt_open.c:1314:mdt_reint_open()) @@@ OPEN &amp;amp; CREAT not in open replay.  req@ffff881006dda400 x1458783605491287/t0(30064772087) o101-&amp;gt;8eb15a41-9744-ff91-d294-57256d6605bc@10.11.16.104@tcp:0/0 lens 544/4552 e 0 to 0 dl 1391213274 ref 1 fl Interpret:/4/0 rc 0/0&lt;/p&gt;

&lt;p&gt;ERRORs on client -&lt;br/&gt;
Jan 31 17:07:12 mu0104 kernel: : LustreError: 2376:0:(client.c:2634:ptlrpc_replay_interpret()) @@@ status &lt;del&gt;116, old was 0  req@ffff88025258e400 x1458783605491285/t30064772084(30064772084) o35&lt;/del&gt;&amp;gt;l1-MDT0000-mdc-ffff8804014a9000@10.1.15.2@o2ib5:23/10 lens 360/424 e 0 to 0 dl 1391213270 ref 2 fl Interpret:R/4/0 rc -116/-116 &lt;br/&gt;
Jan 31 17:07:13 mu0104 kernel: : LustreError: 2376:0:(client.c:2634:ptlrpc_replay_interpret()) @@@ status &lt;del&gt;116, old was 0  req@ffff88017c924400 x1458783605697158/t30064772108(30064772108) o35&lt;/del&gt;&amp;gt;l1-MDT0000-mdc-ffff8804014a9000@10.1.15.2@o2ib5:23/10 lens 360/424 e 0 to 0 dl 1391213270 ref 2 fl Interpret:R/4/0 rc -116/-116 &lt;/p&gt;</description>
                <environment>Lustre 2.1.5 servesr, LLNL Chaos clients</environment>
        <key id="23024">LU-4592</key>
            <summary>mdt_reint_open()) @@@ OPEN &amp; CREAT not in open replay</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="hongchao.zhang">Hongchao Zhang</assignee>
                                    <reporter username="orentas">Oz Rentas</reporter>
                        <labels>
                    </labels>
                <created>Wed, 5 Feb 2014 23:45:35 +0000</created>
                <updated>Sat, 23 Jan 2016 01:29:16 +0000</updated>
                            <resolved>Sat, 23 Jan 2016 01:29:16 +0000</resolved>
                                    <version>Lustre 2.1.5</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="76354" author="pjones" created="Thu, 6 Feb 2014 15:37:42 +0000"  >&lt;p&gt;Hongchao&lt;/p&gt;

&lt;p&gt;Could you please advise on this one?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="76465" author="hongchao.zhang" created="Fri, 7 Feb 2014 14:26:05 +0000"  >&lt;p&gt;Hi Oz,&lt;/p&gt;

&lt;p&gt;do you mount the Lustre with ACL enabled and disabled the &quot;identity_upcall&quot;?&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Jan 31 17:17:55 l1-mds01 kernel: : Lustre: 22128:0:(mdt_lproc.c:414:lprocfs_wr_identity_upcall()) l1-MDT0000: disable &quot;identity_upcall&quot; with ACL enabled maybe cause unexpected &quot;EACCESS&quot;
Jan 31 17:17:55 l1-mds01 kernel: : Lustre: 22128:0:(mdt_lproc.c:416:lprocfs_wr_identity_upcall()) l1-MDT0000: identity upcall set to NONE
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;the problem in the job is just -EACCESS,&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;...
Rank 26 Host mu0104.localdomain FATAL ERROR 1391213863: Unable to open file /lustre/lscratch1/atorrez/out.1391213561.26 for read. (errno=Permission denied) (MPI_Error = 42)
Rank 28 Host mu0104.localdomain FATAL ERROR 1391213863: Unable to open file /lustre/lscratch1/atorrez/out.1391213561.28 for read. (errno=Permission denied) (MPI_Error = 42)
Rank 29 Host mu0104.localdomain FATAL ERROR 1391213863: Unable to open file /lustre/lscratch1/atorrez/out.1391213561.29 for read. (errno=Permission denied) (MPI_Error = 42)
Rank 47 Host mu0104.localdomain FATAL ERROR 1391213863: Unable to open file /lustre/lscratch1/atorrez/out.1391213561.47 for read. (errno=Permission denied) (MPI_Error = 42)
...
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;could you please test it without ACL to check whether it is the cause? Thanks!&lt;/p&gt;</comment>
                            <comment id="77098" author="orentas" created="Fri, 14 Feb 2014 17:17:59 +0000"  >&lt;p&gt;The customer reports they are not mounting with ACL support, as seen here:&lt;br/&gt;
/dev/mapper/vg_l1-mdt on /lustre/l1/mdt type lustre (rw)&lt;/p&gt;

&lt;p&gt;Any other suggestions on where we can look?&lt;/p&gt;

&lt;p&gt;Side note - On my system I was able to duplicate the error they received by setting upcall_identity to NONE, and mounting with ACL:&lt;br/&gt;
Feb 11 09:56:12 es0 kernel: : Lustre: 7799:0:(mdt_lproc.c:372:lprocfs_wr_identity_upcall()) testfs-MDT0000: disable &quot;identity_upcall&quot; with ACL enabled maybe cause unexpected &quot;EACCESS&quot;&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;root@es0 ~&amp;#93;&lt;/span&gt;# mount |grep mdt&lt;br/&gt;
/dev/mapper/vg_testfs-mdt on /lustre/testfs/mdt type lustre (rw,acl)&lt;/p&gt;

</comment>
                            <comment id="77292" author="orentas" created="Tue, 18 Feb 2014 21:43:39 +0000"  >&lt;p&gt;Any updates on this one? &lt;/p&gt;</comment>
                            <comment id="77351" author="hongchao.zhang" created="Wed, 19 Feb 2014 13:56:05 +0000"  >&lt;p&gt;sorry for delayed response.&lt;/p&gt;

&lt;p&gt;from the code, this debug line is only printed when mounted with ACL&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;&lt;span class=&quot;code-keyword&quot;&gt;static&lt;/span&gt; &lt;span class=&quot;code-object&quot;&gt;int&lt;/span&gt; lprocfs_wr_identity_upcall(struct file *file, &lt;span class=&quot;code-keyword&quot;&gt;const&lt;/span&gt; &lt;span class=&quot;code-object&quot;&gt;char&lt;/span&gt; *buffer,
                                      unsigned &lt;span class=&quot;code-object&quot;&gt;long&lt;/span&gt; count, void *data)
{
        ...
        &lt;span class=&quot;code-keyword&quot;&gt;if&lt;/span&gt; (strcmp(hash-&amp;gt;uc_upcall, &lt;span class=&quot;code-quote&quot;&gt;&quot;NONE&quot;&lt;/span&gt;) == 0 &amp;amp;&amp;amp; mdt-&amp;gt;mdt_opts.mo_acl)   &amp;lt;---- here, &lt;span class=&quot;code-quote&quot;&gt;&quot;mo_acl is 1&quot;&lt;/span&gt;
                CWARN(&lt;span class=&quot;code-quote&quot;&gt;&quot;%s: disable \&quot;&lt;/span&gt;identity_upcall\&lt;span class=&quot;code-quote&quot;&gt;&quot; with ACL enabled maybe &quot;&lt;/span&gt;
                      &lt;span class=&quot;code-quote&quot;&gt;&quot;cause unexpected \&quot;&lt;/span&gt;EACCESS\&lt;span class=&quot;code-quote&quot;&gt;&quot;\n&quot;&lt;/span&gt;, mdt_obd_name(mdt));
        ...
}
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Is it possible that mds00 mounts without ACL but mds01 with it?&lt;/p&gt;

&lt;p&gt;Thanks!&lt;/p&gt;</comment>
                            <comment id="80333" author="bobbielind" created="Wed, 26 Mar 2014 19:58:42 +0000"  >&lt;p&gt;After being onsite with customer I can confirm that when running the mount command that the system appears to NOT be mounting with acls.&lt;/p&gt;

&lt;p&gt;/dev/mapper/vg_l1-mdt on /lustre/l1/mdt type lustre (rw)&lt;/p&gt;

&lt;p&gt;Re-asking to Oz&apos;s question, is there another place that it may show as being mounted with acl&apos;s that I can check the next time I&apos;m onsite?&lt;/p&gt;</comment>
                            <comment id="81422" author="hongchao.zhang" created="Fri, 11 Apr 2014 14:39:47 +0000"  >&lt;p&gt;currently, the mount options is not printed when showing the mount info if the mount type is &quot;lustre&quot; (and it will show when mounting it with &quot;ldiskfs&quot; type)&lt;/p&gt;

&lt;p&gt;the default mount options could contain &quot;ACL&quot; (it&apos;s the case in my local node RHEL6.5/x86_64),&lt;br/&gt;
could you please mount the MDT with &quot;-o noacl&quot; explicitly and retest it, Thanks&lt;/p&gt;</comment>
                            <comment id="139708" author="hongchao.zhang" created="Fri, 22 Jan 2016 05:33:14 +0000"  >&lt;p&gt;Hi Oz,&lt;br/&gt;
Do you need more works on this ticket? Or can we close it?&lt;br/&gt;
Thanks&lt;/p&gt;</comment>
                            <comment id="139710" author="orentas" created="Fri, 22 Jan 2016 05:37:06 +0000"  >&lt;p&gt;yes, it can be closed. thanks.&lt;/p&gt;</comment>
                            <comment id="139812" author="jfc" created="Sat, 23 Jan 2016 01:29:16 +0000"  >&lt;p&gt;Thanks Oz and Hongchao.&lt;/p&gt;

&lt;p&gt;~ jfc.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="14049" name="client.807442" size="27517" author="orentas" created="Wed, 5 Feb 2014 23:45:35 +0000"/>
                            <attachment id="14050" name="mds00.20140131.17" size="93987" author="orentas" created="Wed, 5 Feb 2014 23:45:35 +0000"/>
                            <attachment id="14051" name="mds01.20140131.17" size="18683" author="orentas" created="Wed, 5 Feb 2014 23:45:35 +0000"/>
                            <attachment id="14052" name="slurm-807442.out" size="22190" author="orentas" created="Wed, 5 Feb 2014 23:45:35 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzween:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>12545</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10021"><![CDATA[2]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>