<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:03:34 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-81] Some JBD2 journaling deadlock at BULL</title>
                <link>https://jira.whamcloud.com/browse/LU-81</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;BULL reports at the bugzilla that there are some possible deadlock issues on MDS with jbd2 (just run away transactions?):&lt;/p&gt;

&lt;p&gt;At CEA, they have encountered several occurrences of the same scenario where all Lustre activity is&lt;br/&gt;
hung. Each time they live-debug the problem, they end-up on the MDS node where all Lustre&lt;br/&gt;
operations appear to be frozen.&lt;/p&gt;

&lt;p&gt;As a consequence, MDS has to be rebooted and Lustre layer has to be restarted on it with recovery.&lt;/p&gt;

&lt;p&gt;The MDS threads which appear to be strongly involved in the frozen situation have the following&lt;br/&gt;
stack traces, taken from one of the forced crash-dumps:&lt;br/&gt;
==================================&lt;/p&gt;

&lt;p&gt;There are about 234 tasks with the same following stack:&lt;/p&gt;

&lt;p&gt; PID 5250 mdt_rdpg_143&lt;br/&gt;
   schedule()&lt;br/&gt;
   start_this_handle()&lt;br/&gt;
   jbd2_journal_start()&lt;br/&gt;
   ldiskfs_journal_start_sb()&lt;br/&gt;
   osd_trans_start()&lt;br/&gt;
   mdd_trans_start()&lt;br/&gt;
   cml_close()&lt;/p&gt;

&lt;p&gt;One is with:&lt;/p&gt;

&lt;p&gt; Pid: 4990 mdt_395&lt;br/&gt;
   schedule()&lt;br/&gt;
   jbd2_log_wait_commit()&lt;br/&gt;
   jbd2_journal_stop()&lt;br/&gt;
   __ldiskfs_journal_stop()&lt;br/&gt;
   osd_trans_stop()&lt;br/&gt;
   mdd_trans_stop()&lt;br/&gt;
   mdd_attr_set()&lt;br/&gt;
   cml_attr_set() &lt;/p&gt;

&lt;p&gt;And another with:&lt;/p&gt;

&lt;p&gt; Pid: 4534 &quot;jbd2/sdd-8&quot;&lt;br/&gt;
   schedule()&lt;br/&gt;
   jbd2_journal_commit_transaction()&lt;br/&gt;
   kjournald2()&lt;br/&gt;
   kthread()&lt;br/&gt;
   kernel_thread()&lt;/p&gt;

&lt;p&gt;==================================&lt;/p&gt;

&lt;p&gt;Analyzing the crash dump shows that the task hung in jbd2_journal_commit_transaction() is in this&lt;br/&gt;
state since a very long time.&lt;/p&gt;

&lt;p&gt;This problem looks like bug 16667, but unfortunately it is not applicable &apos;as is&apos; as it dates back&lt;br/&gt;
to 1.6. Here it seems there is a race or deadlock in Lustre/JBD2 layers.&lt;br/&gt;
As a workaround the customer deactivated the ChangeLog feature, and since then the problem never&lt;br/&gt;
reoccurred. Sadly ChangeLogs are required by HSM so this workaround cannot last...&lt;/p&gt;

&lt;p&gt;Can you see the reason for this deadlock?&lt;/p&gt;

&lt;p&gt;I have to precise that this bug is critical as it blocks normal cluster operation (ie with HSM).&lt;/p&gt;
</description>
                <environment></environment>
        <key id="10359">LU-81</key>
            <summary>Some JBD2 journaling deadlock at BULL</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="niu">Niu Yawei</assignee>
                                    <reporter username="green">Oleg Drokin</reporter>
                        <labels>
                    </labels>
                <created>Wed, 9 Feb 2011 13:10:27 +0000</created>
                <updated>Fri, 24 Nov 2017 16:00:27 +0000</updated>
                            <resolved>Thu, 29 Mar 2012 09:50:28 +0000</resolved>
                                    <version>Lustre 2.0.0</version>
                                    <fixVersion>Lustre 2.2.0</fixVersion>
                    <fixVersion>Lustre 2.1.2</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>10</watches>
                                                                            <comments>
                            <comment id="10663" author="bzzz" created="Tue, 15 Feb 2011 23:03:35 +0000"  >&lt;p&gt;is there possibility to reproduce the issue and grab crash image to have access to the stacks with the offsets? or probably the customer saved the crash?&lt;/p&gt;

&lt;p&gt;also whether changelog consumer (HSM userspace agent) were running? it&apos;s important to understand whether MDS was just generating records or records got cancelled as well.&lt;/p&gt;</comment>
                            <comment id="10664" author="pjones" created="Wed, 16 Feb 2011 07:18:52 +0000"  >&lt;p&gt;I have added Bull to this ticket in the hope that someone there can answer Alex&apos;s question and help move this issue forward&lt;/p&gt;</comment>
                            <comment id="10665" author="sebastien.buisson" created="Wed, 16 Feb 2011 07:59:20 +0000"  >&lt;p&gt;Hi Alex, Peter,&lt;/p&gt;

&lt;p&gt;Thanks to open this Jira ticket.&lt;/p&gt;

&lt;p&gt;I think CEA saved the crash dumps, but as the cluster is classified it is not possible to get them out. So please tell us precisely what you need, and I will have our on-site Support team send it (forearch bt? bt -a? ...).&lt;/p&gt;

&lt;p&gt;I do not know if HSM userspace agent was running, I will forward this question.&lt;/p&gt;

&lt;p&gt;Cheers,&lt;br/&gt;
Sebastien.&lt;/p&gt;</comment>
                            <comment id="10678" author="bzzz" created="Wed, 16 Feb 2011 23:49:21 +0000"  >&lt;p&gt;Hello Sebastien,&lt;/p&gt;

&lt;p&gt;I think the very first info we need is detailed stacks for all the processes.&lt;/p&gt;</comment>
                            <comment id="11090" author="pjones" created="Tue, 15 Mar 2011 05:48:17 +0000"  >&lt;p&gt;Update from Bull is that the onsite support team are working on getting this information&lt;/p&gt;</comment>
                            <comment id="11116" author="spitzcor" created="Tue, 15 Mar 2011 14:02:04 +0000"  >&lt;p&gt;Given the information presented here, I was reminded of Lustre Bug 21406.  Perhaps that ticket could be inspected to see if the conditions are similar.  Further, implementing the workaround from attachment 28496 (&lt;a href=&quot;https://bugzilla.lustre.org/attachment.cgi?id=28496&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://bugzilla.lustre.org/attachment.cgi?id=28496&lt;/a&gt;), which did not land to 2.x, may be a useful experiment if the problem can be easily reproduced.  However, I also remember that 21406 was associated with OST threads, not MDT threads, so perhaps it doesn&apos;t apply.&lt;/p&gt;</comment>
                            <comment id="11117" author="pjones" created="Tue, 15 Mar 2011 14:04:24 +0000"  >&lt;p&gt;Alex&lt;/p&gt;

&lt;p&gt;Cray observed on the 2.1 call that this seems somewhat similar to bz 21760. Does this seem plausible from the evidence available?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="11118" author="pjones" created="Tue, 15 Mar 2011 14:06:05 +0000"  >&lt;p&gt;Johann&lt;/p&gt;

&lt;p&gt;You were involved in 21760. Are you able to comment on this theory? If so, what evidence should the on-site Bull support staff look for to prove\disprove this theory? Is there a workaround\fix that could be tried out to see if it prevents this problem?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="11121" author="spitzcor" created="Tue, 15 Mar 2011 14:25:44 +0000"  >&lt;p&gt;Oops, did I say 21760?  I meant 21406, but I also missed that this issue was MDT related.  See my earlier (edited) comment.  Sorry if I caused any misdirection.&lt;/p&gt;</comment>
                            <comment id="11123" author="pjones" created="Tue, 15 Mar 2011 17:16:12 +0000"  >&lt;p&gt;Heh. Actually, my notes said 21706 so I guess the wrong transposition &lt;img class=&quot;emoticon&quot; src=&quot;https://jira.whamcloud.com/images/icons/emoticons/smile.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt; Of course, Johann was still involved in that one too so my question still stands with the corrected id...&lt;/p&gt;</comment>
                            <comment id="11130" author="johann" created="Wed, 16 Mar 2011 01:54:30 +0000"  >&lt;p&gt;I don&apos;t think that bugzilla ticket 21706 is related to this issue.&lt;br/&gt;
As Cory noted, 21706 is a problem with OSS read cache whereas this bug is a MDS deadlock.&lt;/p&gt;

&lt;p&gt;That said, i have noticed that the jbd2-commit-timer-no-jiffies-rounding.diff patch&lt;br/&gt;
is missing from the RHEL6 patch series. The round_jiffies() used to cause significant&lt;br/&gt;
delays in transaction commit, see bugzilla ticket 19321. I am not totally sure this&lt;br/&gt;
can fix this bug, but it is worth adding the missing patch and trying again IMO.&lt;/p&gt;

&lt;p&gt;HTH&lt;/p&gt;</comment>
                            <comment id="11340" author="pjones" created="Thu, 24 Mar 2011 11:49:20 +0000"  >&lt;p&gt;I think that the patch Johann mentions is &lt;a href=&quot;http://review.whamcloud.com/#change,358&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,358&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="12035" author="pjones" created="Mon, 4 Apr 2011 07:02:08 +0000"  >&lt;p&gt;Any word back from CEA as to whether this issue still manifests itself with the missing patch applied?&lt;/p&gt;</comment>
                            <comment id="12036" author="pjones" created="Mon, 4 Apr 2011 07:02:20 +0000"  >&lt;p&gt;Any word back from CEA as to whether this issue still manifests itself with the missing patch applied?&lt;/p&gt;</comment>
                            <comment id="12038" author="sebastien.buisson" created="Mon, 4 Apr 2011 07:10:05 +0000"  >&lt;p&gt;Hi Peter,&lt;/p&gt;

&lt;p&gt;Still no news from CEA on this. At least we will have more information on Thursday.&lt;/p&gt;

&lt;p&gt;Cheers,&lt;br/&gt;
Sebastien.&lt;/p&gt;</comment>
                            <comment id="12360" author="sebastien.buisson" created="Wed, 6 Apr 2011 06:57:30 +0000"  >&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;Bad news from CEA. They reactivated Changelogs yesterday evening, and this bug appeared this afternoon.&lt;br/&gt;
They are currently running with a patched kernel including the patch lustre-jbd2-commit-timer-no-jiffies-rounding-rhel6.patch.&lt;/p&gt;

&lt;p&gt;Any &apos;new&apos; ideas on how to tackle this issue?&lt;/p&gt;

&lt;p&gt;Sebastien.&lt;/p&gt;</comment>
                            <comment id="12375" author="johann" created="Wed, 6 Apr 2011 07:48:55 +0000"  >&lt;p&gt;Not without looking at the crash dump.&lt;br/&gt;
Have you tried to reproduce the problem on one of your internal cluster?&lt;/p&gt;</comment>
                            <comment id="12377" author="sebastien.buisson" created="Wed, 6 Apr 2011 07:56:20 +0000"  >&lt;p&gt;Hi Johann,&lt;/p&gt;

&lt;p&gt;&amp;gt; Not without looking at the crash dump.&lt;br/&gt;
Unfortunately the crash dump cannot be taken out of CEA. What crash commands would you like Bruno to launch?&lt;/p&gt;

&lt;p&gt;&amp;gt; Not without looking at the crash dump.&lt;br/&gt;
If Changelogs are activated by default, then yes. Indeed, CEA hits this bug as soon as they activate the Changelogs. But internally we have never seen it.&lt;/p&gt;

&lt;p&gt;Cheers,&lt;br/&gt;
Sebastien.&lt;/p&gt;</comment>
                            <comment id="12378" author="johann" created="Wed, 6 Apr 2011 08:10:13 +0000"  >&lt;p&gt;Hi Sebastien,&lt;/p&gt;

&lt;p&gt;No, changelogs are not activated by default, you need to register a changelog user to enable it.&lt;br/&gt;
As for the crash dump, we need to look at the jbd structures to understand why everyone thinks&lt;br/&gt;
that the transaction is committing while the jdb commit thread is sitting idle.&lt;/p&gt;</comment>
                            <comment id="12379" author="bzzz" created="Wed, 6 Apr 2011 08:36:19 +0000"  >&lt;p&gt;&amp;gt; Unfortunately the crash dump cannot be taken out of CEA. What crash commands would you like Bruno to launch?&lt;/p&gt;


&lt;p&gt;list of all the threads with backtraces would be a good start.&lt;/p&gt;</comment>
                            <comment id="12497" author="pjones" created="Thu, 7 Apr 2011 06:48:31 +0000"  >&lt;p&gt;update from Bull &quot;problem reoccurred yesterday, after less than 24 hours with ChangeLogs activated.&lt;br/&gt;
 CEA will be able to send &apos;foreach bt&apos; from live debug (no dump available)&quot;&lt;/p&gt;</comment>
                            <comment id="13180" author="pjones" created="Thu, 21 Apr 2011 07:11:20 +0000"  >&lt;p&gt;As per Bull, CEA do not expect to be able to gather this data until the end of May.&lt;/p&gt;</comment>
                            <comment id="15342" author="sebastien.buisson" created="Wed, 1 Jun 2011 00:05:33 +0000"  >&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;Here is the the long-time awaited &apos;foreach bt&apos; (in fact the Alt+SysRq+T console output taken live during one occurrence of the problem).&lt;/p&gt;

&lt;p&gt;Cheers,&lt;br/&gt;
Sebastien.&lt;/p&gt;</comment>
                            <comment id="16743" author="bzzz" created="Wed, 22 Jun 2011 07:05:30 +0000"  >&lt;p&gt;PID: 26299  TASK: ffff88047d851620  CPU: 28  COMMAND: &quot;llog_process_th&quot;&lt;br/&gt;
 #0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65900&amp;#93;&lt;/span&gt; schedule at ffffffff81452851&lt;br/&gt;
 #1 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a659c8&amp;#93;&lt;/span&gt; start_this_handle at ffffffffa08ec0d7&lt;br/&gt;
 #2 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65a88&amp;#93;&lt;/span&gt; jbd2_journal_start at ffffffffa08ec520&lt;br/&gt;
 #3 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65ad8&amp;#93;&lt;/span&gt; ldiskfs_journal_start_sb at ffffffffa0936fb8&lt;br/&gt;
 #4 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65ae8&amp;#93;&lt;/span&gt; fsfilt_ldiskfs_write_record at ffffffffa098a0fc&lt;br/&gt;
 #5 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65b68&amp;#93;&lt;/span&gt; llog_lvfs_write_blob at ffffffffa050917c&lt;br/&gt;
 #6 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65c18&amp;#93;&lt;/span&gt; llog_lvfs_write_rec at ffffffffa050a722&lt;br/&gt;
 #7 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65cf8&amp;#93;&lt;/span&gt; llog_cancel_rec at ffffffffa05010a4&lt;br/&gt;
 #8 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65d58&amp;#93;&lt;/span&gt; llog_cat_cancel_records at ffffffffa0505de2&lt;br/&gt;
 #9 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65de8&amp;#93;&lt;/span&gt; llog_changelog_cancel_cb at ffffffffa099ec12&lt;br/&gt;
#10 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65e68&amp;#93;&lt;/span&gt; llog_process_thread at ffffffffa0503573&lt;br/&gt;
#11 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff880998a65f48&amp;#93;&lt;/span&gt; kernel_thread at ffffffff8100d1aa&lt;/p&gt;

&lt;p&gt;PID: 22091  TASK: ffff8808695bad90  CPU: 22  COMMAND: &quot;mdt_attr_101&quot;&lt;br/&gt;
 #0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1358&amp;#93;&lt;/span&gt; schedule at ffffffff81452851&lt;br/&gt;
 #1 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1420&amp;#93;&lt;/span&gt; rwsem_down_failed_common at ffffffff81454cb5&lt;br/&gt;
 #2 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1470&amp;#93;&lt;/span&gt; rwsem_down_read_failed at ffffffff81454e46&lt;br/&gt;
 #3 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c14b0&amp;#93;&lt;/span&gt; call_rwsem_down_read_failed at ffffffff81248024&lt;br/&gt;
 #4 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1518&amp;#93;&lt;/span&gt; llog_cat_current_log.clone.0 at ffffffffa05068a5&lt;br/&gt;
 #5 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c15f8&amp;#93;&lt;/span&gt; llog_cat_add_rec at ffffffffa050785a&lt;br/&gt;
 #6 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1678&amp;#93;&lt;/span&gt; llog_obd_origin_add at ffffffffa050e196&lt;br/&gt;
 #7 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c16d8&amp;#93;&lt;/span&gt; llog_add at ffffffffa050e371&lt;br/&gt;
 #8 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1748&amp;#93;&lt;/span&gt; mdd_changelog_llog_write at ffffffffa09dc905&lt;br/&gt;
 #9 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c17c8&amp;#93;&lt;/span&gt; mdd_changelog_data_store at ffffffffa09b3c1a&lt;br/&gt;
#10 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1858&amp;#93;&lt;/span&gt; mdd_attr_set at ffffffffa09bc0a2&lt;br/&gt;
#11 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1968&amp;#93;&lt;/span&gt; cml_attr_set at ffffffffa0a9775f&lt;br/&gt;
#12 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c19c8&amp;#93;&lt;/span&gt; mdt_attr_set at ffffffffa0a256e4&lt;br/&gt;
#13 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1a58&amp;#93;&lt;/span&gt; mdt_reint_setattr at ffffffffa0a25e36&lt;br/&gt;
#14 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1ae8&amp;#93;&lt;/span&gt; mdt_reint_rec at ffffffffa0a2167f&lt;br/&gt;
#15 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1b38&amp;#93;&lt;/span&gt; mdt_reint_internal at ffffffffa0a18a34&lt;br/&gt;
#16 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1bc8&amp;#93;&lt;/span&gt; mdt_reint at ffffffffa0a18d9c&lt;br/&gt;
#17 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1c18&amp;#93;&lt;/span&gt; mdt_handle_common at ffffffffa0a0da45&lt;br/&gt;
#18 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1c98&amp;#93;&lt;/span&gt; mdt_regular_handle at ffffffffa0a0ea55&lt;br/&gt;
#19 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1ca8&amp;#93;&lt;/span&gt; ptlrpc_server_handle_request at ffffffffa0648b11&lt;br/&gt;
#20 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1de8&amp;#93;&lt;/span&gt; ptlrpc_main at ffffffffa0649f0a&lt;br/&gt;
#21 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8808695c1f48&amp;#93;&lt;/span&gt; kernel_thread at ffffffff8100d1aa&lt;/p&gt;


&lt;p&gt;seem to be known ordering issue with journal_start() vs. catalog semaphore.&lt;br/&gt;
I&apos;m looking for the bug in bugzilla ...&lt;/p&gt;</comment>
                            <comment id="17604" author="sebastien.buisson" created="Tue, 12 Jul 2011 08:56:44 +0000"  >&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;Any news about this?&lt;/p&gt;

&lt;p&gt;TIA,&lt;br/&gt;
Sebastien.&lt;/p&gt;</comment>
                            <comment id="17951" author="bzzz" created="Mon, 18 Jul 2011 08:33:00 +0000"  >&lt;p&gt;Hello Sebastien,&lt;/p&gt;

&lt;p&gt;the fix I was thinking of was work being done for the Sequoia project. we can&apos;t land it onto master due&lt;br/&gt;
to many related changes in the code. probably someone else work on a different workaround for master branch.&lt;/p&gt;

&lt;p&gt;in general, canceling code should follow &quot;start transaction first, then do locking in llog&quot; rule.&lt;/p&gt;</comment>
                            <comment id="18627" author="louveta" created="Tue, 2 Aug 2011 10:50:19 +0000"  >&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;Just wanted to report that the hit frequency of this problem did increased recently. We are know at about 2 or 3 hangs a day. Is there anything we can provide to help ?&lt;/p&gt;

&lt;p&gt;Alex.&lt;/p&gt;</comment>
                            <comment id="18715" author="dmoreno" created="Thu, 4 Aug 2011 08:33:38 +0000"  >&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;As the priority of this issue is rising up, just another question: do you think we can deploy any kind of work-around (different from just &quot;deactivate changelog&quot;, of course)?&lt;/p&gt;

&lt;p&gt;Thanks,&lt;/p&gt;</comment>
                            <comment id="18718" author="pjones" created="Thu, 4 Aug 2011 10:31:00 +0000"  >&lt;p&gt;Niu&lt;/p&gt;

&lt;p&gt;Could you please look into a workaround\fix for this issue that will work with the existing master code?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="18771" author="patrick.valentin" created="Fri, 5 Aug 2011 13:32:26 +0000"  >&lt;p&gt;Hi,&lt;br/&gt;
I have prepared a compressed tarball file containing the kernel core dump and the kernel image, but the file size is about 460 Mb and I just saw that attachements are limited to 10 Mb. Do you have a ftp server on which I can put this file, or is there another way to provide large files.&lt;/p&gt;

&lt;p&gt;TIA&lt;br/&gt;
Cheers&lt;br/&gt;
Patrick&lt;/p&gt;</comment>
                            <comment id="18792" author="pjones" created="Sat, 6 Aug 2011 11:11:29 +0000"  >&lt;p&gt;Patrick&lt;/p&gt;

&lt;p&gt;I have sent you information on this&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="18923" author="patrick.valentin" created="Tue, 9 Aug 2011 08:13:19 +0000"  >&lt;p&gt;The tarball containing the kernel core dump and kernel image is available on whamcloud ftp server.&lt;br/&gt;
The first transfer was aborted and the &quot;dump.tar.gz&quot; corresponding file is to be removed.&lt;br/&gt;
The name of the correct file is &quot;20110809_1021_dump.tar.gz&quot;. Its size is 468396374 bytes and the &quot;sum&quot; command gives:&lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;sum 20110809_1021_dump.tar.gz&lt;br/&gt;
    65153 457419&lt;br/&gt;
It contains:&lt;br/&gt;
    the kernel core dump: vmcore&lt;br/&gt;
    the kernel image: vmlinux_2.6.32-71.14.1.el6.Bull.23&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;It must be analysed on a 2.6.32 kernel using the corresponding crash command (5.0.0).&lt;/p&gt;

&lt;p&gt;In case of troubles, crash_5.1.7 (&quot;crash_5.1.7.tar.gz&quot;) is also available in the traball. To use it, you have to set the following variables:&lt;br/&gt;
    export PATH=/your/home/local/crash/5.1.7/bin:${PATH}&lt;br/&gt;
    export CRASH_EXTENSIONS=/your/home/local/crash/5.1.7/usr/lib64/extensions&lt;/p&gt;

&lt;p&gt;Let me know if you need additional information&lt;/p&gt;

&lt;p&gt;Regards,&lt;br/&gt;
Patrick&lt;/p&gt;</comment>
                            <comment id="19330" author="niu" created="Wed, 17 Aug 2011 02:48:44 +0000"  >&lt;p&gt;Hi, Alex/Johann&lt;/p&gt;

&lt;p&gt;Given that it&apos;s difficult to port the Orion llog changes onto master, I think we could probably introduce a simple workaround temporarily for master: Let&apos;s invent a rw lock for each mdd to protect the changelog, each changelog adding will takes the read lock, and the changelog cancelling has to hold the write lock, since changelog cancelling only happens when user issue the changelog clear command, I think the performance impact will be acceptable. &lt;/p&gt;

&lt;p&gt;Considering it&apos;s just a temporary workaround, I want to minimize the code changes as much as possible, and another advantage of this approach is that it doesn&apos;t affect other llog users except the changelog.&lt;/p&gt;

&lt;p&gt;If this workaround sounds ok to you, I&apos;ll make the patch soon. Thanks&lt;/p&gt;</comment>
                            <comment id="19332" author="dmoreno" created="Wed, 17 Aug 2011 03:52:13 +0000"  >&lt;p&gt;Hi Niu,&lt;/p&gt;

&lt;p&gt;From my point of view I think this is what we are looking for. Just a work-around based on a simple lock, with a moderate impact on performances.&lt;/p&gt;</comment>
                            <comment id="19372" author="johann" created="Wed, 17 Aug 2011 16:38:26 +0000"  >&lt;p&gt;Could we start a transaction earlier like done in bugzilla 18030?&lt;br/&gt;
If too complicated/intrusive, then i&apos;m fine with the brute-force locking.&lt;/p&gt;</comment>
                            <comment id="19377" author="niu" created="Thu, 18 Aug 2011 00:33:31 +0000"  >&lt;p&gt;Ok, I tried to make a patch which start transaction before catlog locking in llog_cat_cancel_records(). Thanks.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/1260&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/1260&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="25449" author="hudson" created="Wed, 4 Jan 2012 08:25:16 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=i686,build_type=server,distro=el6,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; i686,server,el6,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25450" author="hudson" created="Wed, 4 Jan 2012 08:29:15 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/red.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=x86_64,build_type=server,distro=el5,ib_stack=ofa/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; x86_64,server,el5,ofa #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = FAILURE&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25451" author="hudson" created="Wed, 4 Jan 2012 08:30:04 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=x86_64,build_type=client,distro=el5,ib_stack=ofa/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; x86_64,client,el5,ofa #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25452" author="hudson" created="Wed, 4 Jan 2012 08:31:09 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=x86_64,build_type=server,distro=el6,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; x86_64,server,el6,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25453" author="hudson" created="Wed, 4 Jan 2012 08:33:15 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=x86_64,build_type=client,distro=sles11,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; x86_64,client,sles11,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25454" author="hudson" created="Wed, 4 Jan 2012 08:33:47 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=i686,build_type=client,distro=el6,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; i686,client,el6,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25455" author="hudson" created="Wed, 4 Jan 2012 08:33:55 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=x86_64,build_type=client,distro=el6,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; x86_64,client,el6,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25456" author="hudson" created="Wed, 4 Jan 2012 08:34:41 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=x86_64,build_type=client,distro=ubuntu1004,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; x86_64,client,ubuntu1004,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25457" author="hudson" created="Wed, 4 Jan 2012 08:38:34 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=i686,build_type=server,distro=el5,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; i686,server,el5,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25458" author="hudson" created="Wed, 4 Jan 2012 08:40:03 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=i686,build_type=server,distro=el5,ib_stack=ofa/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; i686,server,el5,ofa #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25460" author="hudson" created="Wed, 4 Jan 2012 08:44:41 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=x86_64,build_type=server,distro=el5,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; x86_64,server,el5,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25461" author="hudson" created="Wed, 4 Jan 2012 08:49:35 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=i686,build_type=client,distro=el5,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; i686,client,el5,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25462" author="hudson" created="Wed, 4 Jan 2012 08:51:08 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=x86_64,build_type=client,distro=el5,ib_stack=inkernel/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; x86_64,client,el5,inkernel #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25463" author="hudson" created="Wed, 4 Jan 2012 08:51:35 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/./arch=i686,build_type=client,distro=el5,ib_stack=ofa/397/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-master &#187; i686,client,el5,ofa #397&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision 4ecb94432df9f0f0866538f64b27f006562eae13)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/master&amp;amp;a=commit&amp;amp;h=4ecb94432df9f0f0866538f64b27f006562eae13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;4ecb94432df9f0f0866538f64b27f006562eae13&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="25464" author="pjones" created="Wed, 4 Jan 2012 08:52:06 +0000"  >&lt;p&gt;Landed for 2.2&lt;/p&gt;</comment>
                            <comment id="25864" author="pjones" created="Thu, 5 Jan 2012 09:35:30 +0000"  >&lt;p&gt;Bull report that this has reocurred at CEA&lt;/p&gt;</comment>
                            <comment id="25869" author="bfaccini" created="Thu, 5 Jan 2012 10:19:09 +0000"  >&lt;p&gt;So seems that the work-around (&quot;patch which start transaction before catlog locking in&lt;br/&gt;
llog_cat_cancel_records()&quot;) described in JIRA &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; is not sufficient and we may need a patch to implement&lt;br/&gt;
&quot;the brute-force locking&quot;, the alternate solution already described in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; ....&lt;/p&gt;

&lt;p&gt;What do you think ???&lt;/p&gt;</comment>
                            <comment id="25951" author="niu" created="Thu, 5 Jan 2012 20:39:23 +0000"  >&lt;p&gt;Could you provide the statck trace? If we don&apos;t know the exact reason, I&apos;m afraid that the brute-force lock can&apos;t resolve the problem too.&lt;/p&gt;</comment>
                            <comment id="26243" author="bfaccini" created="Tue, 10 Jan 2012 09:49:39 +0000"  >&lt;p&gt;This last time, the thread hung since a long time in jbd2_journal_commit_transaction() is named &quot;jbd2/dm-0-8&quot; but still with the same stack !!! :&lt;br/&gt;
==========================&lt;br/&gt;
Pid: 23067 &quot;jbd2/dm-0-8&quot;&lt;br/&gt;
schedule()&lt;br/&gt;
jbd2_journal_commit_transaction()&lt;br/&gt;
kjournald2()&lt;br/&gt;
kthread()&lt;br/&gt;
kernel_thread()&lt;br/&gt;
==========================&lt;/p&gt;

&lt;p&gt;then there is a bunch of other Lustre threads (ll_&amp;lt;...&amp;gt;, mdt_&lt;span class=&quot;error&quot;&gt;&amp;#91;rdpg_&amp;#93;&lt;/span&gt;&amp;lt;id&amp;gt;, ...) stuck with the same/following stack&apos;s ending stages :&lt;br/&gt;
====================================================================================================================================&lt;br/&gt;
schedule()&lt;br/&gt;
start_this_handle()&lt;br/&gt;
jbd2_journal_start()&lt;br/&gt;
ldiskfs_journal_start_sb()&lt;/p&gt;

&lt;p&gt;......&lt;/p&gt;

&lt;p&gt;====================================================================================================================================&lt;/p&gt;
</comment>
                            <comment id="26251" author="niu" created="Tue, 10 Jan 2012 10:51:10 +0000"  >&lt;p&gt;hi, Bruno, is there exact full stack trace available?&lt;/p&gt;</comment>
                            <comment id="32834" author="pjones" created="Thu, 29 Mar 2012 09:50:28 +0000"  >&lt;p&gt;Landed for 2.2. Bull advised separately that this issue no longer exists with the fix&lt;/p&gt;</comment>
                            <comment id="33986" author="hudson" created="Sun, 8 Apr 2012 13:40:02 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=x86_64,build_type=client,distro=sles11,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; x86_64,client,sles11,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34009" author="hudson" created="Sun, 8 Apr 2012 13:42:30 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=i686,build_type=client,distro=el6,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; i686,client,el6,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34032" author="hudson" created="Sun, 8 Apr 2012 13:44:10 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=x86_64,build_type=server,distro=el6,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; x86_64,server,el6,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34055" author="hudson" created="Sun, 8 Apr 2012 13:44:12 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=i686,build_type=client,distro=el5,ib_stack=ofa/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; i686,client,el5,ofa #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34078" author="hudson" created="Sun, 8 Apr 2012 13:44:52 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=x86_64,build_type=server,distro=el5,ib_stack=ofa/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; x86_64,server,el5,ofa #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34101" author="hudson" created="Sun, 8 Apr 2012 13:53:01 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=x86_64,build_type=client,distro=el6,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; x86_64,client,el6,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34124" author="hudson" created="Sun, 8 Apr 2012 13:54:40 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=i686,build_type=server,distro=el6,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; i686,server,el6,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34147" author="hudson" created="Sun, 8 Apr 2012 13:55:35 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=x86_64,build_type=client,distro=el5,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; x86_64,client,el5,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34170" author="hudson" created="Sun, 8 Apr 2012 13:57:59 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=i686,build_type=server,distro=el5,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; i686,server,el5,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34193" author="hudson" created="Sun, 8 Apr 2012 14:07:52 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=x86_64,build_type=server,distro=el5,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; x86_64,server,el5,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34216" author="hudson" created="Sun, 8 Apr 2012 14:13:10 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=i686,build_type=server,distro=el5,ib_stack=ofa/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; i686,server,el5,ofa #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34239" author="hudson" created="Sun, 8 Apr 2012 14:17:57 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=x86_64,build_type=client,distro=el5,ib_stack=ofa/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; x86_64,client,el5,ofa #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="34262" author="hudson" created="Sun, 8 Apr 2012 14:24:32 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://build.whamcloud.com/job/lustre-b2_1/./arch=i686,build_type=client,distro=el5,ib_stack=inkernel/41/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-b2_1 &#187; i686,client,el5,inkernel #41&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; deadlock of changelog adding vs. changelog cancelling (Revision d68d301d065296d2769ea2274bff75b21a98f9b6)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Oleg Drokin : &lt;a href=&quot;http://git.whamcloud.com/gitweb?p=fs/lustre-release.git;a=shortlog;h=refs/heads/b2_1&amp;amp;a=commit&amp;amp;h=d68d301d065296d2769ea2274bff75b21a98f9b6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;d68d301d065296d2769ea2274bff75b21a98f9b6&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/mds/mds_log.c&lt;/li&gt;
	&lt;li&gt;lustre/mdd/mdd_device.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="42473" author="nrutman" created="Mon, 30 Jul 2012 19:51:04 +0000"  >&lt;blockquote&gt;
&lt;p&gt;So seems that the work-around (&quot;patch which start transaction before catlog locking in&lt;br/&gt;
llog_cat_cancel_records()&quot;) described in JIRA &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; is not sufficient and we may need a patch to implement&lt;br/&gt;
&quot;the brute-force locking&quot;, the alternate solution already described in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-81&quot; title=&quot;Some JBD2 journaling deadlock at BULL&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-81&quot;&gt;&lt;del&gt;LU-81&lt;/del&gt;&lt;/a&gt; ....&lt;/p&gt;

&lt;p&gt;What do you think ???&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;I believe there is a general problem here that is not resolved by simply increasing the journal credits, which really just serves to mask the problem in some cases.  We&apos;re looking at a case now where cancelling lots of unlink records results in a similar lock inversion caused by the journal restart in the llog updates.  The code really needs to be changed so that the lock inversion can&apos;t happen.&lt;/p&gt;</comment>
                            <comment id="42545" author="bfaccini" created="Wed, 1 Aug 2012 09:45:31 +0000"  >&lt;p&gt;I understand there are strong assumptions that we don&apos;t have a definitive fix for this quite un-frequent problem/dead-lock actually ... And BTW, I just got a new occurence of this same scenario, but on an OSS this time running with Lustre 2.1.1 and a Kernel version 2.6.32-131.12.1 which contains the JBD2 patch jbd2-commit-timer-no-jiffies-rounding.diff patch ...&lt;/p&gt;

&lt;p&gt;The involved hung thread&apos;s stacks look about the same :&lt;br/&gt;
=======================================================&lt;br/&gt;
PID: 19269 TASK: ffff880470699340 CPU: 1 COMMAND: &quot;ll_ost_io_249&quot;&lt;br/&gt;
 #0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069f520&amp;#93;&lt;/span&gt; schedule at ffffffff8147bdd9&lt;br/&gt;
 0000001 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069f5e8&amp;#93;&lt;/span&gt; jbd2_log_wait_commit at ffffffffa00867a5&lt;br/&gt;
 0000002 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069f678&amp;#93;&lt;/span&gt; fsfilt_ldiskfs_commit_wait at ffffffffa07bf25e&lt;br/&gt;
 0000003 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069f6c8&amp;#93;&lt;/span&gt; filter_commitrw_write at ffffffffa0a794c9&lt;br/&gt;
 0000004 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069f908&amp;#93;&lt;/span&gt; filter_commitrw at ffffffffa0a6b33d&lt;br/&gt;
 0000005 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069f9c8&amp;#93;&lt;/span&gt; obd_commitrw at ffffffffa069df5a&lt;br/&gt;
 0000006 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069fa48&amp;#93;&lt;/span&gt; ost_brw_write at ffffffffa06a7922&lt;br/&gt;
 0000007 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069fbf8&amp;#93;&lt;/span&gt; ost_handle at ffffffffa06abcd5&lt;br/&gt;
 0000008 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069fd68&amp;#93;&lt;/span&gt; ptlrpc_main at ffffffffa07103e9&lt;br/&gt;
 0000009 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff88047069ff48&amp;#93;&lt;/span&gt; kernel_thread at ffffffff810041aa&lt;/p&gt;

&lt;p&gt;PID: 15704 TASK: ffff88062c52c0c0 CPU: 4 COMMAND: &quot;jbd2/dm-5-8&quot;&lt;br/&gt;
 #0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804c1467c50&amp;#93;&lt;/span&gt; schedule at ffffffff8147bdd9&lt;br/&gt;
 0000001 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804c1467d18&amp;#93;&lt;/span&gt; jbd2_journal_commit_transaction at ffffffffa0080970&lt;br/&gt;
 0000002 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804c1467e68&amp;#93;&lt;/span&gt; kjournald2 at ffffffffa0086b48&lt;br/&gt;
 0000003 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804c1467ee8&amp;#93;&lt;/span&gt; kthread at ffffffff8107ad36&lt;br/&gt;
 0000004 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804c1467f48&amp;#93;&lt;/span&gt; kernel_thread at ffffffff810041aa&lt;/p&gt;

&lt;p&gt;and many other like this one&lt;/p&gt;

&lt;p&gt;PID: 15892 TASK: ffff88062c73f4c0 CPU: 4 COMMAND: &quot;ll_ost_io_36&quot;&lt;br/&gt;
 #0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81b430&amp;#93;&lt;/span&gt; schedule at ffffffff8147bdd9&lt;br/&gt;
 0000001 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81b4f8&amp;#93;&lt;/span&gt; start_this_handle at ffffffffa007f092&lt;br/&gt;
 0000002 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81b5b8&amp;#93;&lt;/span&gt; jbd2_journal_start at ffffffffa007f510&lt;br/&gt;
 0000003 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81b608&amp;#93;&lt;/span&gt; ldiskfs_journal_start_sb at ffffffffa0a13758&lt;br/&gt;
 0000004 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81b618&amp;#93;&lt;/span&gt; fsfilt_ldiskfs_brw_start at ffffffffa07bf792&lt;br/&gt;
 0000005 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81b6c8&amp;#93;&lt;/span&gt; filter_commitrw_write at ffffffffa0a78cb8&lt;br/&gt;
 0000006 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81b908&amp;#93;&lt;/span&gt; filter_commitrw at ffffffffa0a6b33d&lt;br/&gt;
 0000007 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81b9c8&amp;#93;&lt;/span&gt; obd_commitrw at ffffffffa069df5a&lt;br/&gt;
 0000008 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81ba48&amp;#93;&lt;/span&gt; ost_brw_write at ffffffffa06a7922&lt;br/&gt;
 0000009 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81bbf8&amp;#93;&lt;/span&gt; ost_handle at ffffffffa06abcd5&lt;br/&gt;
0000010 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81bd68&amp;#93;&lt;/span&gt; ptlrpc_main at ffffffffa07103e9&lt;br/&gt;
0000011 &lt;span class=&quot;error&quot;&gt;&amp;#91;ffff8804bd81bf48&amp;#93;&lt;/span&gt; kernel_thread at ffffffff810041aa&lt;br/&gt;
=======================================================&lt;/p&gt;

&lt;p&gt;But since we are on an OSS this can not be implied from any &quot;llog&quot; activity, can we just consider that we are  back on a &quot;pure&quot; JBD2 issue there ???&lt;/p&gt;</comment>
                            <comment id="42555" author="wpower" created="Wed, 1 Aug 2012 12:51:24 +0000"  >&lt;p&gt;Bruno - can you post/attach the full set of stack traces for this lockup.  &lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="10230" name="hang_changelog_foreach_bt.txt" size="1132273" author="sebastien.buisson" created="Wed, 1 Jun 2011 00:05:33 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                    <customfield id="customfield_10020" key="com.atlassian.jira.plugin.system.customfieldtypes:float">
                        <customfieldname>Bugzilla ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>24438.0</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzv7rr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>4793</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10021"><![CDATA[2]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>