<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:14:29 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-8081] Lustre 2.8.0, osd_trans_exec_op(), osd_trans_dump_creds()</title>
                <link>https://jira.whamcloud.com/browse/LU-8081</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;On the OSS&apos;s, a few times a day, I see in the logs:&lt;/p&gt;

&lt;p&gt;Apr 27 16:00:57 oss1 kernel: Lustre: 41839:0:(osd_handler.c:1265:osd_trans_dump_creds())   create: 0/0/0, destroy: 0/0/0&lt;br/&gt;
Apr 27 16:00:57 oss1 kernel: Lustre: 41839:0:(osd_handler.c:1272:osd_trans_dump_creds())   attr_set: 1/1/0, xattr_set: 2/15/0&lt;br/&gt;
Apr 27 16:00:57 oss1 kernel: Lustre: 41839:0:(osd_handler.c:1282:osd_trans_dump_creds())   write: 2/12/0, punch: 1/4/0, quota 4/4/0&lt;br/&gt;
Apr 27 16:00:57 oss1 kernel: Lustre: 41839:0:(osd_handler.c:1289:osd_trans_dump_creds())   insert: 0/0/0, delete: 0/0/0&lt;br/&gt;
Apr 27 16:00:57 oss1 kernel: Lustre: 41839:0:(osd_handler.c:1296:osd_trans_dump_creds())   ref_add: 0/0/0, ref_del: 0/0/0&lt;br/&gt;
Apr 27 16:00:57 oss1 kernel: LustreError: 41839:0:(osd_internal.h:1073:osd_trans_exec_op()) blizzard-OST0000-osd: op = 7, rb = 7&lt;/p&gt;

&lt;p&gt;Not sure what triggers it for sure.  We have a mixed workload of mpi, and serial type jobs.&lt;/p&gt;

&lt;p&gt;The values are always the same: write 2/10/0, punch 1/4/0 , etc.  A limit I imagine.  It almost seems like debug type logs that are not super important.  &lt;/p&gt;</description>
                <environment>EL 6.7, x86_64, mellanox FDR, lustre 2.8.0 client and server.  1 combined mds/mdt, and 7 combined oss/ost.</environment>
        <key id="36454">LU-8081</key>
            <summary>Lustre 2.8.0, osd_trans_exec_op(), osd_trans_dump_creds()</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bfaccini">Bruno Faccini</assignee>
                                    <reporter username="cbc">christopher coffey</reporter>
                        <labels>
                    </labels>
                <created>Thu, 28 Apr 2016 16:51:29 +0000</created>
                <updated>Tue, 18 Oct 2016 21:13:44 +0000</updated>
                            <resolved>Sat, 10 Sep 2016 04:18:28 +0000</resolved>
                                    <version>Lustre 2.8.0</version>
                                    <fixVersion>Lustre 2.9.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                            <comments>
                            <comment id="150483" author="pjones" created="Thu, 28 Apr 2016 17:38:55 +0000"  >&lt;p&gt;Bruno&lt;/p&gt;

&lt;p&gt;Could you please advise?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="150485" author="adilger" created="Thu, 28 Apr 2016 18:16:46 +0000"  >&lt;p&gt;Hi Christopher,&lt;br/&gt;
thanks for filing this report.  In addition to the above osd_trans_dump_creds() message, is there a stack trace or other debugging messages in the logs at the same time?&lt;/p&gt;

&lt;p&gt;You are correct that this is partly a debugging message, but it indicates that the internal ldiskfs transaction accounting is incorrect, and in some rare cases this might result in a failure in lower layers of the code.  In our regression testing this would trigger an immediate assertion, but for production use the Lustre-level assertion is disabled to allow the system to continue running in the vast majority of cases.&lt;/p&gt;

&lt;p&gt;If you are able to take a short outage at some point in the future, you could set &lt;tt&gt;lctl set_param osd-ldiskfs.track_declares_assert=1&lt;/tt&gt; on the OSS nodes so that the failure triggers an assertion and dumps the stack of the server process running this transaction.&lt;/p&gt;

&lt;p&gt;*&lt;b&gt;Please Note&lt;/b&gt;* that &lt;tt&gt;track_declares_assert=1&lt;/tt&gt; will require that the OSS be rebooted after the assertion has triggered, and should only be enabled if you are interested to debug this problem further, and rebooting the OSS is acceptable for your environment.  If not, then we can try and find some other way to debug this problem.&lt;/p&gt;</comment>
                            <comment id="150487" author="adilger" created="Thu, 28 Apr 2016 18:21:03 +0000"  >&lt;p&gt;Another question that comes to mind is what the journal size is on your OSTs?  This can be checked with &lt;tt&gt;dumpe2fs -h /dev/ostXXX | grep &quot;Journal size&quot;&lt;/tt&gt; on each of the OST devices.  Typically they should be 400MB in size, allowing individual transactions up to 100MB (25000 blocks), so it is a bit surprising that this debugging message is being printed with only a few tens of blocks reserved in this transaction.&lt;/p&gt;</comment>
                            <comment id="150488" author="bzzz" created="Thu, 28 Apr 2016 18:23:35 +0000"  >&lt;p&gt;Hmm, actually it looks like credits were consumed outside of our tracking mechanism:&lt;br/&gt;
we reserved 12 blocks for write, but at write we observed that &amp;lt;12 left in the transaction.&lt;br/&gt;
at the same time none of the previous operations consumed more than declared.&lt;/p&gt;</comment>
                            <comment id="150499" author="gerrit" created="Thu, 28 Apr 2016 19:08:47 +0000"  >&lt;p&gt;Andreas Dilger (andreas.dilger@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/19865&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/19865&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8081&quot; title=&quot;Lustre 2.8.0, osd_trans_exec_op(), osd_trans_dump_creds()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8081&quot;&gt;&lt;del&gt;LU-8081&lt;/del&gt;&lt;/a&gt; osd-ldiskfs: improve transaction debug message&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 8219dd1e5da64670ddcf84e7e779e315169e3b63&lt;/p&gt;</comment>
                            <comment id="165563" author="gerrit" created="Sat, 10 Sep 2016 03:23:41 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/19865/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/19865/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8081&quot; title=&quot;Lustre 2.8.0, osd_trans_exec_op(), osd_trans_dump_creds()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8081&quot;&gt;&lt;del&gt;LU-8081&lt;/del&gt;&lt;/a&gt; osd-ldiskfs: improve transaction debug message&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 0cf638409812ce2533043908073b68698fb46c84&lt;/p&gt;</comment>
                            <comment id="165577" author="pjones" created="Sat, 10 Sep 2016 04:18:28 +0000"  >&lt;p&gt;Landed for 2.9&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzy9on:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>