<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:41:08 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-11123] LustreError in ll_xattr_list() server bug: replied size 236 &gt; 132</title>
                <link>https://jira.whamcloud.com/browse/LU-11123</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Hello,&lt;/p&gt;

&lt;p&gt;Today our users started to report intermittent file access issues on Oak. I noticed the following messages on one client (2.10.4):&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
Jul 05 14:32:21 sh-ln01.stanford.edu kernel: LustreError: 155141:0:(xattr.c:377:ll_xattr_list()) server bug: replied size 236 &amp;gt; 132
Jul 05 14:32:41 sh-ln01.stanford.edu kernel: LustreError: 171588:0:(xattr.c:377:ll_xattr_list()) server bug: replied size 164 &amp;gt; 132
Jul 05 14:32:41 sh-ln01.stanford.edu kernel: LustreError: 171588:0:(xattr.c:377:ll_xattr_list()) Skipped 5 previous similar messages
Jul 05 14:32:47 sh-ln01.stanford.edu kernel: LustreError: 176583:0:(xattr.c:377:ll_xattr_list()) server bug: replied size 172 &amp;gt; 132
Jul 05 14:32:47 sh-ln01.stanford.edu kernel: LustreError: 176583:0:(xattr.c:377:ll_xattr_list()) Skipped 59 previous similar messages
Jul 05 14:33:23 sh-ln01.stanford.edu kernel: LustreError: 10776:0:(xattr.c:377:ll_xattr_list()) server bug: replied size 172 &amp;gt; 132
Jul 05 14:33:23 sh-ln01.stanford.edu kernel: LustreError: 10776:0:(xattr.c:377:ll_xattr_list()) Skipped 58 previous similar messages
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;These errors messages are the only Lustre Error I can see on this impacted client, however they are not very helpful as I&apos;m not even sure it happened on Oak or another Lustre filesystem...&lt;/p&gt;

&lt;p&gt;The impacted directories are using ACLs but only a very few, less than 10. We have other directories with &amp;gt;32 ACLs and haven&apos;t seen this issue.&lt;/p&gt;

&lt;p&gt;The issue doesn&apos;t seem to be easily reproducible neither. I&apos;m still investigating.&lt;/p&gt;

&lt;p&gt;If you have any ideas on how to troubleshoot this, please let me know.&lt;/p&gt;

&lt;p&gt;Thanks!&lt;br/&gt;
 Stephane&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</description>
                <environment>clients: 2.10.4 clients, servers: 2.10.3 + &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10783&quot; title=&quot;kernel update [RHEL7.4 3.10.0-693.21.1.el7]&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10783&quot;&gt;&lt;strike&gt;LU-10783&lt;/strike&gt;&lt;/a&gt; (kernel update RHEL7.4)</environment>
        <key id="52647">LU-11123</key>
            <summary>LustreError in ll_xattr_list() server bug: replied size 236 &gt; 132</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="5">Cannot Reproduce</resolution>
                                        <assignee username="jhammond">John Hammond</assignee>
                                    <reporter username="sthiell">Stephane Thiell</reporter>
                        <labels>
                    </labels>
                <created>Thu, 5 Jul 2018 21:56:06 +0000</created>
                <updated>Sun, 29 Jul 2018 05:13:48 +0000</updated>
                            <resolved>Sun, 29 Jul 2018 05:11:12 +0000</resolved>
                                    <version>Lustre 2.10.3</version>
                    <version>Lustre 2.10.4</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="229990" author="sthiell" created="Thu, 5 Jul 2018 23:23:24 +0000"  >&lt;p&gt;NOTE: I&apos;m not actually sure I need to post here for our former Intel Oak support, please advise.&lt;/p&gt;

&lt;p&gt;But... After further investigations, it seems that these messages could be a side effect of a known&#160;limitation of nodemapping/Lustre permissions/caching, but not the root cause of our issue, which has been identified.&lt;/p&gt;

&lt;p&gt;It would be nice to definitively fix the client inode cache on Lustre to avoid confusion, as already&#160;explained in&#160;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-10884&quot; title=&quot;stat() on lustre mount point / limited client trust in l_getidentity&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-10884&quot;&gt;LU-10884&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="230016" author="pjones" created="Fri, 6 Jul 2018 17:27:51 +0000"  >&lt;p&gt;John&lt;/p&gt;

&lt;p&gt;Can you please advise?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="230109" author="jhammond" created="Tue, 10 Jul 2018 14:05:17 +0000"  >&lt;p&gt;Hi Stephane,&lt;/p&gt;

&lt;p&gt;Could you give any more detail about the file access issues? Is there any Samba or NFS export involved on this node? Some thoughts:&lt;/p&gt;

&lt;p&gt;This error message cannot be reached for the &quot;system.posix_acl_access&quot; xattr but it can be for the &quot;system.posix_acl_default&quot; and the expected and returned value sizes look right for that xattr. It&apos;s not clear who or what is asking for &quot;system.posix_acl_default&quot; since this xattr is really used on the server. Note that this message is easy to produce by creating a directory with enough default ACLs and then using setfacl or getfacl on it.&lt;/p&gt;

&lt;p&gt;The message is a bit misleading since the server does not actually consider the size of the client size buffer. Instead it just creates the reply with a large enough buffer and sends the value back. And the client side getxattr code is actually handing this correctly by returning &lt;tt&gt;-ERANGE&lt;/tt&gt;.&lt;/p&gt;

&lt;p&gt;The change &lt;a href=&quot;https://review.whamcloud.com/#/c/32739/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/32739/&lt;/a&gt; (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11074&quot; title=&quot;Invalid argument reading file caps&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11074&quot;&gt;&lt;del&gt;LU-11074&lt;/del&gt;&lt;/a&gt; mdc: set correct body eadatasize for getxattr()) may help you avoid this situation. Would you be willing to try it?&lt;/p&gt;</comment>
                            <comment id="231032" author="sthiell" created="Sun, 29 Jul 2018 03:20:42 +0000"  >&lt;p&gt;Hi John,&lt;br/&gt;
Thanks for your reply and detailed explanation (and sorry for the delay, all notification emails from whamcloud.com got into my Clutter mailbox...).&lt;br/&gt;
This was on a Sherlock login node, so no SMB/NFS export involved there. We haven&apos;t seen the problem again so I don&apos;t think it&apos;s worth patching just for that at this point.&lt;/p&gt;</comment>
                            <comment id="231034" author="pjones" created="Sun, 29 Jul 2018 05:11:12 +0000"  >&lt;p&gt;ok Stephane. Meanwhile the fix is queued up for a future LTS release so hopefully you&apos;ll get it in due course anyway.&lt;/p&gt;</comment>
                            <comment id="231035" author="sthiell" created="Sun, 29 Jul 2018 05:13:16 +0000"  >&lt;p&gt;Awesome, thanks Peter.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="52524">LU-11074</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzyt3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>