<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:21:04 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1947] OST ZFS grant shortage on precreate</title>
                <link>https://jira.whamcloud.com/browse/LU-1947</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;It seems there is some kind of grant leak or shortage for precreating objects, or the ZFS estimation of how many objects can be created.  This is happening with small OSTs, but I suspect the same problem will happen when larger OSTs are full, seen during a run of sanity.sh.  The first error is seen early on:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;ep 15 03:21:13 sookie-gig kernel: Lustre: DEBUG MARKER: == sanity test 27b: create two stripe file =========================================================== 03:21:13 (1347700873)
Sep 15 03:21:23 sookie-gig kernel: LustreError: 11997:0:(lov_request.c:593:lov_update_create_set()) error creating fid 0x62 sub-object on OST idx 2/2: rc = -28
Sep 15 03:21:23 sookie-gig kernel: LustreError: 11997:0:(lov_request.c:593:lov_update_create_set()) error creating fid 0x62 sub-object on OST idx 2/2: rc = -5
Sep 15 03:25:25 sookie-gig kernel: Lustre: 4427:0:(ofd_obd.c:1168:ofd_create()) testfs-OST0001: failed to acquire grant space to precreate 0 objects
Sep 15 03:25:25 sookie-gig kernel: Lustre: 4427:0:(ofd_obd.c:1168:ofd_create()) Skipped 2293017 previous similar messages
Sep 15 03:25:25 sookie-gig kernel: Lustre: 15270:0:(ofd_obd.c:1192:ofd_create()) testfs-OST0002: unable to precreate [0x0:0xe1:0x0]: rc = -28
Sep 15 03:25:25 sookie-gig kernel: Lustre: 15270:0:(ofd_obd.c:1192:ofd_create()) Skipped 2293021 previous similar messages
Sep 15 03:33:57 sookie-gig kernel: Lustre: 4427:0:(ofd_obd.c:1168:ofd_create()) testfs-OST0001: failed to acquire grant space to precreate 0 objects
Sep 15 03:33:57 sookie-gig kernel: Lustre: 4427:0:(ofd_obd.c:1168:ofd_create()) Skipped 4886890 previous similar messages
Sep 15 03:33:57 sookie-gig kernel: Lustre: 4428:0:(ofd_obd.c:1192:ofd_create()) testfs-OST0001: unable to precreate [0x0:0x101:0x0]: rc = -28
Sep 15 03:33:57 sookie-gig kernel: Lustre: 4428:0:(ofd_obd.c:1192:ofd_create()) Skipped 4886885 previous similar messages
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;It appears that the precreate code is busy-looping in precreate, since it is calling this millions of times in a few minutes.  The ofd_create() messages are changed from CDEBUG() to CWARN() for debugging.&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[root@sookie-gig lustre-head]# df
Filesystem           1K-blocks      Used Available Use% Mounted on
testfs-ost1/ost1        999424     71296    926080   8% /mnt/ost1
testfs-ost2/ost2        999424     39296    958080   4% /mnt/ost2
testfs-ost3/ost3        999424     35200    962176   4% /mnt/ost3
[root@sookie-gig lustre-head]# df -i
Filesystem            Inodes   IUsed   IFree IUse% Mounted on
testfs-ost1/ost1        8011     760    7251   10% /mnt/ost1
testfs-ost2/ost2        7962     461    7501    6% /mnt/ost2
testfs-ost3/ost3        7931     398    7533    6% /mnt/ost3
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Only about 10% of the filesystem is full, and ZFS itself thinks that there are free inodes that could be created.  The grant statistics also don&apos;t appear to show a shortage of grant:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# lctl get_param obdfilter.*.grant_* 
obdfilter.testfs-OST0000.grant_compat_disable=0
obdfilter.testfs-OST0000.grant_precreate=0
obdfilter.testfs-OST0000.grant_ratio=19%
obdfilter.testfs-OST0001.grant_compat_disable=0
obdfilter.testfs-OST0001.grant_precreate=0
obdfilter.testfs-OST0001.grant_ratio=19%
obdfilter.testfs-OST0002.grant_compat_disable=0
obdfilter.testfs-OST0002.grant_precreate=0
obdfilter.testfs-OST0002.grant_ratio=19%
# lctl get_param obdfilter.*.tot*
obdfilter.testfs-OST0000.tot_dirty=0
obdfilter.testfs-OST0000.tot_granted=0
obdfilter.testfs-OST0000.tot_pending=0
obdfilter.testfs-OST0001.tot_dirty=0
obdfilter.testfs-OST0001.tot_granted=0
obdfilter.testfs-OST0001.tot_pending=0
obdfilter.testfs-OST0002.tot_dirty=0
obdfilter.testfs-OST0002.tot_granted=0
obdfilter.testfs-OST0002.tot_pending=0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Sep 15 11:23:57 sookie-gig kernel: Lustre: 15270:0:(ofd_obd.c:1168:ofd_create()) testfs-OST0002: failed to acquire grant space to precreate 0 objects
Sep 15 11:23:57 sookie-gig kernel: Lustre: 15270:0:(ofd_obd.c:1168:ofd_create()) Skipped 5756702 previous similar messages
Sep 15 11:23:57 sookie-gig kernel: Lustre: 15270:0:(ofd_obd.c:1192:ofd_create()) testfs-OST0001: unable to precreate [0x0:0x101:0x0]: rc = -28
Sep 15 11:23:57 sookie-gig kernel: Lustre: 15270:0:(ofd_obd.c:1192:ofd_create()) Skipped 5756716 previous similar messages
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;It&apos;s still going hours later...  I had started with 256MB OSTs, but hit this problem immediately.  Increasing to 1GB OSTs allowed some testing to pass, but failed on a second test (after reboot and such)&lt;/p&gt;</description>
                <environment>Single node client+MDS+OSS, with 3x 1GB OSTs, 256MB MDT, x86_64, 4GB RAM, OSTFSTYPE=zfs, USE_OFD=yes</environment>
        <key id="15990">LU-1947</key>
            <summary>OST ZFS grant shortage on precreate</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="adilger">Andreas Dilger</reporter>
                        <labels>
                            <label>zfs</label>
                    </labels>
                <created>Sat, 15 Sep 2012 13:51:56 +0000</created>
                <updated>Tue, 22 Apr 2014 18:21:33 +0000</updated>
                            <resolved>Tue, 22 Apr 2014 18:21:33 +0000</resolved>
                                    <version>Lustre 2.4.0</version>
                                    <fixVersion>Lustre 2.6.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="44951" author="adilger" created="Sat, 15 Sep 2012 14:20:19 +0000"  >&lt;p&gt;It may be that part of the problem is the ZFS pool does not get recreated, even after reboot of the filesystem.&lt;/p&gt;</comment>
                            <comment id="46569" author="johann" created="Mon, 15 Oct 2012 07:51:49 +0000"  >&lt;blockquote&gt;
&lt;p&gt;obdfilter.testfs-OST0000.grant_precreate=0&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;Grant for precreate is allocated in -&amp;gt;ldo_recovery_complete which wasn&apos;t called for OFD in master. I have fixed this in  &lt;a href=&quot;http://review.whamcloud.com/4182&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/4182&lt;/a&gt;, so i think it is worth trying again now, this bug might be fixed already (on master, at least).&lt;/p&gt;</comment>
                            <comment id="62064" author="adilger" created="Wed, 10 Jul 2013 22:52:32 +0000"  >&lt;p&gt;This might have also been fixed by the recent landing of &lt;a href=&quot;http://review.whamcloud.com/6546&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/6546&lt;/a&gt; from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3421&quot; title=&quot;(ost_handler.c:1762:ost_blocking_ast()) Error -2 syncing data on lock cancel causes first ENOSPC client issues then MDS server locks up&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3421&quot;&gt;&lt;del&gt;LU-3421&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="78273" author="jlevi" created="Mon, 3 Mar 2014 21:56:58 +0000"  >&lt;p&gt;Can this ticket be closed?&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="19231">LU-3421</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvcuv:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>5675</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>