<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:00:31 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-6471] Unexpected Lustre Client LBUG in llog_write()</title>
                <link>https://jira.whamcloud.com/browse/LU-6471</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Some IEEL2.2.x and Lustre 2.5 installations have triggered this unexpected LBUG on Lustre Clients consuming ChangeLogs :&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&amp;lt;3&amp;gt;LustreError: 31536:0:(llog_cat.c:192:llog_cat_id2handle()) store2-MDT0000-mdc-ffff88206515bc00: error opening log id 0x2def:1:0: rc = -2
&amp;lt;3&amp;gt;LustreError: 31536:0:(llog_cat.c:565:llog_cat_process_cb()) store2-MDT0000-mdc-ffff88206515bc00: cannot find handle for llog 0x2def:1: -2
&amp;lt;0&amp;gt;LustreError: 31536:0:(llog.c:861:llog_write()) ASSERTION( loghandle-&amp;gt;lgh_obj != ((void *)0) ) failed:
&amp;lt;0&amp;gt;LustreError: 31536:0:(llog.c:861:llog_write()) LBUG
&amp;lt;4&amp;gt;Pid: 31536, comm: mdc_clg_send_th
&amp;lt;4&amp;gt;
&amp;lt;4&amp;gt;Call Trace:
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0437895&amp;gt;] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0437e97&amp;gt;] lbug_with_loc+0x47/0xb0 [libcfs]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0553c52&amp;gt;] llog_write+0x352/0x420 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0553e0c&amp;gt;] llog_cancel_rec+0xbc/0x7c0 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0558bcc&amp;gt;] llog_cat_cleanup+0x9c/0x260 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0559fd8&amp;gt;] llog_cat_process_cb+0x588/0x630 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0554e6b&amp;gt;] llog_process_thread+0x95b/0xd70 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0559a50&amp;gt;] ? llog_cat_process_cb+0x0/0x630 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0556a67&amp;gt;] llog_process_or_fork+0x127/0x550 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0557f09&amp;gt;] llog_cat_process_or_fork+0x89/0x350 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8152aede&amp;gt;] ? mutex_lock+0x1e/0x50
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa094fe10&amp;gt;] ? changelog_kkuc_cb+0x0/0x250 [mdc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa05581e9&amp;gt;] llog_cat_process+0x19/0x20 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa09541eb&amp;gt;] mdc_changelog_send_thread+0x4cb/0x810 [mdc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0953d20&amp;gt;] ? mdc_changelog_send_thread+0x0/0x810 [mdc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8109e66e&amp;gt;] kthread+0x9e/0xc0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8109e5d0&amp;gt;] ? kthread+0x0/0xc0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
&amp;lt;4&amp;gt;
&amp;lt;0&amp;gt;Kernel panic - not syncing: LBUG
&amp;lt;4&amp;gt;Pid: 31536, comm: mdc_clg_send_th Not tainted 2.6.32-504.12.2.el6.x86_64 #1
&amp;lt;4&amp;gt;Call Trace:
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8152933c&amp;gt;] ? panic+0xa7/0x16f
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0437eeb&amp;gt;] ? lbug_with_loc+0x9b/0xb0 [libcfs]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0553c52&amp;gt;] ? llog_write+0x352/0x420 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0553e0c&amp;gt;] ? llog_cancel_rec+0xbc/0x7c0 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0558bcc&amp;gt;] ? llog_cat_cleanup+0x9c/0x260 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0559fd8&amp;gt;] ? llog_cat_process_cb+0x588/0x630 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0554e6b&amp;gt;] ? llog_process_thread+0x95b/0xd70 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0559a50&amp;gt;] ? llog_cat_process_cb+0x0/0x630 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0556a67&amp;gt;] ? llog_process_or_fork+0x127/0x550 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0557f09&amp;gt;] ? llog_cat_process_or_fork+0x89/0x350 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8152aede&amp;gt;] ? mutex_lock+0x1e/0x50
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa094fe10&amp;gt;] ? changelog_kkuc_cb+0x0/0x250 [mdc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa05581e9&amp;gt;] ? llog_cat_process+0x19/0x20 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa09541eb&amp;gt;] ? mdc_changelog_send_thread+0x4cb/0x810 [mdc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0953d20&amp;gt;] ? mdc_changelog_send_thread+0x0/0x810 [mdc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8109e66e&amp;gt;] ? kthread+0x9e/0xc0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c20a&amp;gt;] ? child_rip+0xa/0x20
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8109e5d0&amp;gt;] ? kthread+0x0/0xc0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The problem seems to have been introduced by patch for &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5195&quot; title=&quot;HSM: mdt_hsm_cdt_actions.c:104:cdt_llog_process() failed to process HSM_ACTIONS llog&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5195&quot;&gt;&lt;del&gt;LU-5195&lt;/del&gt;&lt;/a&gt;, which does not handle the case where code is running on a Lustre Client (ie, cathandle-&amp;gt;lgh_obj == NULL) and is only a ChangeLog/LLOG consumer/replicator.&lt;/p&gt;

&lt;p&gt;This problem had not been seen before since concerned change is only being executed when triggering some catalog inconsistency. Looks like master is also impacted, will push a patch soon.&lt;/p&gt;</description>
                <environment></environment>
        <key id="29540">LU-6471</key>
            <summary>Unexpected Lustre Client LBUG in llog_write()</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bfaccini">Bruno Faccini</assignee>
                                    <reporter username="bfaccini">Bruno Faccini</reporter>
                        <labels>
                    </labels>
                <created>Fri, 17 Apr 2015 09:25:06 +0000</created>
                <updated>Thu, 14 Jun 2018 21:41:37 +0000</updated>
                            <resolved>Mon, 25 May 2015 23:09:15 +0000</resolved>
                                    <version>Lustre 2.5.3</version>
                                    <fixVersion>Lustre 2.8.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>13</watches>
                                                                            <comments>
                            <comment id="112297" author="gerrit" created="Fri, 17 Apr 2015 09:46:34 +0000"  >&lt;p&gt;Faccini Bruno (bruno.faccini@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/14489&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/14489&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-6471&quot; title=&quot;Unexpected Lustre Client LBUG in llog_write()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-6471&quot;&gt;&lt;del&gt;LU-6471&lt;/del&gt;&lt;/a&gt; obdclass: fix llog_cat_cleanup() usage on Client&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 9219f1ba884874f5f25df7499a13956c4eb65bea&lt;/p&gt;</comment>
                            <comment id="112299" author="apercher" created="Fri, 17 Apr 2015 14:27:21 +0000"  >&lt;p&gt;We meet the same issue on T100 system with lustre 2.5.3.90 on the robinhood node&lt;/p&gt;</comment>
                            <comment id="112307" author="bfaccini" created="Fri, 17 Apr 2015 16:41:50 +0000"  >&lt;p&gt;Having a look to old (ie, running with Lustre version not including patch from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5195&quot; title=&quot;HSM: mdt_hsm_cdt_actions.c:104:cdt_llog_process() failed to process HSM_ACTIONS llog&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5195&quot;&gt;&lt;del&gt;LU-5195&lt;/del&gt;&lt;/a&gt;) ChangeLogs consumer Client&apos;s syslogs, I have found they already contained the 2 msgs preceding the LBUG now :&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&amp;lt;3&amp;gt;LustreError: 31536:0:(llog_cat.c:192:llog_cat_id2handle()) store2-MDT0000-mdc-ffff88206515bc00: error opening log id 0x2def:1:0: rc = -2
&amp;lt;3&amp;gt;LustreError: 31536:0:(llog_cat.c:565:llog_cat_process_cb()) store2-MDT0000-mdc-ffff88206515bc00: cannot find handle for llog 0x2def:1: -2
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;This means that since Servers/MDS also did not run with &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5195&quot; title=&quot;HSM: mdt_hsm_cdt_actions.c:104:cdt_llog_process() failed to process HSM_ACTIONS llog&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5195&quot;&gt;&lt;del&gt;LU-5195&lt;/del&gt;&lt;/a&gt; patch in at this time, this fix is not the responsible of the LBUG by itself, but that some kind of a race possibly exists to allow a Client to work with an out-of-sync Catalog content ... This is confirmed by the lustre debug trace logs, extracted fro crash-dump upon LBUG, where it appears that the reported missing plan log has been available and used a few times before.&lt;br/&gt;
So finally, this may be a new consequence of the fact that the plain log destroy and catlog index record deletion are not atomic, already addressed for Server side (only!) in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5195&quot; title=&quot;HSM: mdt_hsm_cdt_actions.c:104:cdt_llog_process() failed to process HSM_ACTIONS llog&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5195&quot;&gt;&lt;del&gt;LU-5195&lt;/del&gt;&lt;/a&gt; patch.&lt;/p&gt;


</comment>
                            <comment id="113207" author="gerrit" created="Thu, 23 Apr 2015 14:04:06 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/14489/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/14489/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-6471&quot; title=&quot;Unexpected Lustre Client LBUG in llog_write()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-6471&quot;&gt;&lt;del&gt;LU-6471&lt;/del&gt;&lt;/a&gt; obdclass: fix llog_cat_cleanup() usage on Client&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 485f3ba8743365c7fb5823893a7eb45030cdffb5&lt;/p&gt;</comment>
                            <comment id="116350" author="pjones" created="Mon, 25 May 2015 23:09:15 +0000"  >&lt;p&gt;Landed for 2.8&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="25154">LU-5195</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzxayv:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>