<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:39:04 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-4034] Cannot allocate memory on clients with 2.4.X</title>
                <link>https://jira.whamcloud.com/browse/LU-4034</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;One of our user noticed a strange problem during metadata operations, it looks like a memory allocation issue:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;root@XXX ~&amp;#93;&lt;/span&gt;# ls -l /mnt/lustre/scratch/people/YYYY/SPE.SPIN/050524/28/temp.a438&lt;br/&gt;
ls: cannot access /mnt/lustre/scratch/people/YYYY/SPE.SPIN/050524/28/temp.a438: Cannot allocate memory&lt;/p&gt;

&lt;p&gt;The client log says:&lt;br/&gt;
Oct  1 16:20:11 zeus kernel: LustreError: 11-0: scratch-OST0013-osc-ffff8804925f1400: Communicating with 172.16.126.4@tcp, operation ldlm_enqueue failed with -12.&lt;br/&gt;
Oct  1 16:20:11 zeus kernel: LustreError: 23207:0:(cl_lock.c:1420:cl_unuse_try()) result = -12, this is unlikely!&lt;/p&gt;

&lt;p&gt;OSS log has:&lt;br/&gt;
Oct  1 16:20:11 scratch02 kernel: LustreError: 4630:0:(ldlm_resource.c:1165:ldlm_resource_get()) scratch-OST0013: lvbo_init failed for resource 0x40d9dcf:0x0: rc = -2&lt;/p&gt;

&lt;p&gt;Of course both servers and cients still have plenty of memory available. I&apos;ve tried to look at similar issues in Jira, however I wasn&apos;t able to find a ticket with 1:1 relation to our issue.&lt;/p&gt;</description>
                <environment>SL6.4, 2.4.1 servers and clients with some patches, which have landed to b2_4 after 2.4.1 freeze.</environment>
        <key id="21212">LU-4034</key>
            <summary>Cannot allocate memory on clients with 2.4.X</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="5">Cannot Reproduce</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="m.magrys">Marek Magrys</reporter>
                        <labels>
                    </labels>
                <created>Tue, 1 Oct 2013 14:22:53 +0000</created>
                <updated>Fri, 21 Jan 2022 01:25:07 +0000</updated>
                            <resolved>Fri, 21 Jan 2022 01:25:07 +0000</resolved>
                                    <version>Lustre 2.4.1</version>
                                                        <due></due>
                            <votes>3</votes>
                                    <watches>13</watches>
                                                                            <comments>
                            <comment id="68052" author="jhammond" created="Tue, 1 Oct 2013 14:31:16 +0000"  >&lt;p&gt;This reminds me that the error reporting could be improved here. ldlm_resource_get() returned NULL because the resource could not be found. Then ldlm_lock_create() returns NULL. The ldlm_handle_enqueue0() misinterprets the returned NULL as being due to an allocation failure.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;        /* The lock&apos;s callback data might be set in the policy function */
        lock = ldlm_lock_create(ns, &amp;amp;dlm_req-&amp;gt;lock_desc.l_resource.lr_name,
                                dlm_req-&amp;gt;lock_desc.l_resource.lr_type,
                                dlm_req-&amp;gt;lock_desc.l_req_mode,
                                cbs, NULL, 0, LVB_T_NONE);
        if (!lock)
                GOTO(out, rc = -ENOMEM);
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="73870" author="m.magrys" created="Thu, 19 Dec 2013 17:51:25 +0000"  >&lt;p&gt;Any ideas anyone? I totally agree on the error reporting improvement idea, but it&apos;s a side problem here I guess. It still looks like that some object cannot be located. I&apos;ve found a bunch of files with the same problem and I wonder if it&apos;s possible to find a solution other than taking the fs offline and running lfsck?&lt;/p&gt;</comment>
                            <comment id="75397" author="kitwestneat" created="Tue, 21 Jan 2014 23:05:39 +0000"  >&lt;p&gt;I create &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-4524&quot; title=&quot;ldlm_handle_enqueue0 incorrectly interprets ldlm_lock_create&amp;#39;s NULL return value&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-4524&quot;&gt;&lt;del&gt;LU-4524&lt;/del&gt;&lt;/a&gt; for the error code problem, as we have run into that too. &lt;/p&gt;

&lt;p&gt;Marek, it looks like the object is missing on the OSS. Basically you need to figure out why the objects are missing. Was there a hard crash or data corruption on the OSS? I would look for something earlier in the logs that might explain where the objects (0x40d9dcf in this case) went.&lt;/p&gt;

&lt;p&gt;AFAIK the lfsck in 2.4.1 does not do this kind of cleanup, that is going to be in a later phase of the lfsck rewrite. You should be able to do what the old lfsck would do and unlink the files.&lt;/p&gt;</comment>
                            <comment id="165246" author="alex.ku" created="Thu, 8 Sep 2016 02:50:47 +0000"  >&lt;p&gt;I&apos;m getting similar error when trying to &quot;ls&quot; file which failed to create object (lustre 2.5.3). It looks like the error during &quot;ls&quot; is addressed at &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-4524&quot; title=&quot;ldlm_handle_enqueue0 incorrectly interprets ldlm_lock_create&amp;#39;s NULL return value&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-4524&quot;&gt;&lt;del&gt;LU-4524&lt;/del&gt;&lt;/a&gt;.&lt;br/&gt;
Should this ticket &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-4034&quot; title=&quot;Cannot allocate memory on clients with 2.4.X&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-4034&quot;&gt;&lt;del&gt;LU-4034&lt;/del&gt;&lt;/a&gt; be closed?&lt;/p&gt;</comment>
                            <comment id="233253" author="simmonsja" created="Mon, 10 Sep 2018 16:34:37 +0000"  >&lt;p&gt;Can we close this?&lt;/p&gt;</comment>
                            <comment id="233294" author="lflis" created="Mon, 10 Sep 2018 19:39:07 +0000"  >&lt;p&gt;We haven&apos;t seen this problem since upgrading to&#160; 2.5 and newer releases in Cyfronet.&lt;/p&gt;

&lt;p&gt;I&apos;d vote for closing it and Marek will probably confirm&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzw4i7:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>10838</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>