<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:08:46 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-7422] incorrect ENOENT handling in mdt_intent_reint causes panic on client side</title>
                <link>https://jira.whamcloud.com/browse/LU-7422</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;In case of DISP_OPEN_CREATE client waits for valid fid value in reply when it_status == 0.&lt;br/&gt;
When reint_open returns ENOENT fid is not set and client gets fid filled by 0. This may cause following panic:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&amp;gt; LustreError: 19177:0:(llite_lib.c:2178:ll_prep_inode()) ASSERTION( fid_is_sane(&amp;amp;md.body-&amp;gt;fid1) ) failed:
&amp;gt; LustreError: 19177:0:(llite_lib.c:2178:ll_prep_inode()) LBUG
&amp;gt; Pid: 19177, comm: cp
&amp;gt; 
&amp;gt; Call Trace:
&amp;gt;  [&amp;lt;ffffffff81006109&amp;gt;] try_stack_unwind+0x169/0x1b0
&amp;gt;  [&amp;lt;ffffffff81004b99&amp;gt;] dump_trace+0x89/0x440
&amp;gt;  [&amp;lt;ffffffffa036b8c7&amp;gt;] libcfs_debug_dumpstack+0x57/0x80 [libcfs]
&amp;gt;  [&amp;lt;ffffffffa036be27&amp;gt;] lbug_with_loc+0x47/0xc0 [libcfs]
&amp;gt;  [&amp;lt;ffffffffa0a21184&amp;gt;] ll_prep_inode+0xc04/0xe40 [lustre]
&amp;gt;  [&amp;lt;ffffffffa0a34721&amp;gt;] ll_create_nd+0x3d1/0xec0 [lustre]
&amp;gt;  [&amp;lt;ffffffff81163fe9&amp;gt;] vfs_create+0xa9/0xf0
&amp;gt;  [&amp;lt;ffffffff811646aa&amp;gt;] do_last+0x67a/0x7d0
&amp;gt;  [&amp;lt;ffffffff81165358&amp;gt;] path_openat+0xc8/0x3d0
&amp;gt;  [&amp;lt;ffffffff81165788&amp;gt;] do_filp_open+0x48/0xa0
&amp;gt;  [&amp;lt;ffffffff8115651e&amp;gt;] do_sys_open+0x16e/0x240
&amp;gt;  [&amp;lt;ffffffff81156630&amp;gt;] sys_open+0x20/0x30
&amp;gt;  [&amp;lt;ffffffff815662eb&amp;gt;] system_call_fastpath+0x16/0x1b
&amp;gt;  [&amp;lt;00007fc9c3726790&amp;gt;] 0x7fc9c3726790&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;We faced the issue on DNE setup. For unknown reason(possibly failover) FLDB on master mdt didn&apos;t include OST seq ranges.&lt;br/&gt;
We faced above panic every time after trying to create regular file in directory located on mdt1.&lt;/p&gt;</description>
                <environment></environment>
        <key id="33109">LU-7422</key>
            <summary>incorrect ENOENT handling in mdt_intent_reint causes panic on client side</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="scherementsev">Sergey Cheremencev</reporter>
                        <labels>
                            <label>patch</label>
                    </labels>
                <created>Thu, 12 Nov 2015 17:02:53 +0000</created>
                <updated>Thu, 20 Oct 2016 18:11:19 +0000</updated>
                            <resolved>Tue, 31 May 2016 12:49:26 +0000</resolved>
                                                    <fixVersion>Lustre 2.8.0</fixVersion>
                    <fixVersion>Lustre 2.9.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="133368" author="adilger" created="Thu, 12 Nov 2015 17:38:23 +0000"  >&lt;p&gt;What version of Lustre are you testing?  The fid_is_sane() check should be skipped if -ENOENT is returned:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;&lt;span class=&quot;code-object&quot;&gt;int&lt;/span&gt; ll_prep_inode(struct inode **inode, struct ptlrpc_request *req,
                  struct super_block *sb, struct lookup_intent *it)
{
        :
        :
        rc = md_get_lustre_md(sbi-&amp;gt;ll_md_exp, req, sbi-&amp;gt;ll_dt_exp,
                              sbi-&amp;gt;ll_md_exp, &amp;amp;md);
        &lt;span class=&quot;code-keyword&quot;&gt;if&lt;/span&gt; (rc != 0)
                GOTO(cleanup, rc);
        :
        :
                /*
                 * At &lt;span class=&quot;code-keyword&quot;&gt;this&lt;/span&gt; point server returns to client&apos;s same fid as client
                 * generated &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; creating. So using -&amp;gt;fid1 is okay here.
                 */
                LASSERT(fid_is_sane(&amp;amp;md.body-&amp;gt;mbo_fid1));
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="133467" author="gerrit" created="Fri, 13 Nov 2015 17:50:40 +0000"  >&lt;p&gt;Sergey Cheremencev (sergey_cheremencev@xyratex.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/17177&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/17177&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7422&quot; title=&quot;incorrect ENOENT handling in mdt_intent_reint causes panic on client side&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7422&quot;&gt;&lt;del&gt;LU-7422&lt;/del&gt;&lt;/a&gt; mdt: fix ENOENT handling in mdt_intent_reint&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: d1aeb6446540ef375bcade3f7d1215ad598d0675&lt;/p&gt;</comment>
                            <comment id="133468" author="sergey" created="Fri, 13 Nov 2015 17:55:34 +0000"  >&lt;p&gt;The problem faced on lustre 2.5.1.&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;The fid_is_sane() check should be skipped if -ENOENT is returned:&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;Yes. But the problem here is that 0 returned instead of -ENOENT.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;static int mdt_intent_reint(enum mdt_it_code opcode,
                            struct mdt_thread_info *info,
                            struct ldlm_lock **lockp,
                            __u64 flags)
...
        if (rep-&amp;gt;lock_policy_res2 == -ENOENT &amp;amp;&amp;amp;
            mdt_get_disposition(rep, DISP_LOOKUP_NEG))
                rep-&amp;gt;lock_policy_res2 = 0;&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="134611" author="heckes" created="Thu, 26 Nov 2015 09:13:19 +0000"  >&lt;p&gt;Error happened also during soak testing of master branch (see &lt;a href=&quot;https://wiki.hpdd.intel.com/pages/viewpage.action?title=Soak+Testing+on+Lola&amp;amp;spaceKey=Releases#SoakTestingonLola-20151122&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://wiki.hpdd.intel.com/pages/viewpage.action?title=Soak+Testing+on+Lola&amp;amp;spaceKey=Releases#SoakTestingonLola-20151122&lt;/a&gt;). DNE is enabled. MDSes are in active-active HA failover configuration.&lt;/p&gt;

&lt;p&gt;Here the error occurred after the remount of the MDTs on MDS (&lt;tt&gt;lola-10&lt;/tt&gt;) completed successful (2015-11-26 00:27:36).&lt;br/&gt;
Pasted the stack trace ones more, as context seems to be different than for the one above.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Nov 26 00:29:48 lola-29 kernel: LustreError: 65535:0:(llite_lib.c:2295:ll_prep_inode()) ASSERTION( fid_is_sane(&amp;amp;md.body-&amp;gt;mbo_fid1) ) failed:
Nov 26 00:29:48 lola-29 kernel: LustreError: 65535:0:(llite_lib.c:2295:ll_prep_inode()) LBUG
Nov 26 00:29:48 lola-29 kernel: Pid: 65535, comm: pct
Nov 26 00:29:48 lola-29 kernel:
Nov 26 00:29:48 lola-29 kernel: Call Trace:
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffffa050b875&amp;gt;] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffffa050be77&amp;gt;] lbug_with_loc+0x47/0xb0 [libcfs]
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffffa0abdb62&amp;gt;] ll_prep_inode+0x752/0xc40 [lustre]
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffffa0802c10&amp;gt;] ? lustre_swab_mdt_body+0x0/0x130 [ptlrpc]
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffffa0ad29d2&amp;gt;] ll_new_node+0x682/0x7f0 [lustre]
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffffa0ad5224&amp;gt;] ll_mkdir+0x104/0x220 [lustre]
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffff8122ec0f&amp;gt;] ? security_inode_permission+0x1f/0x30
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffff8119d759&amp;gt;] vfs_mkdir+0xd9/0x140
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffff811a04e7&amp;gt;] sys_mkdirat+0xc7/0x1b0
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffff8100c6f5&amp;gt;] ? math_state_restore+0x45/0x60
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffff811a05e8&amp;gt;] sys_mkdir+0x18/0x20
Nov 26 00:29:48 lola-29 kernel: [&amp;lt;ffffffff8100b0d2&amp;gt;] system_call_fastpath+0x16/0x1b
Nov 26 00:29:48 lola-29 kernel:
Nov 26 00:29:48 lola-29 kernel: LustreError: dumping log to /tmp/lustre-log.1448526588.65535
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Chronologically this event can be correlated to the following error on &lt;tt&gt;lola-10&lt;/tt&gt;:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;lola-10.log:Nov 26 00:29:26 lola-10 kernel: Lustre: 3875:0:(client.c:1994:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1448526555/real 1448526555]  req@ffff88041c873380 x1518889781449540/t0(
0) o400-&amp;gt;soaked-MDT0004-osp-MDT0005@0@lo:24/4 lens 224/224 e 1 to 1 dl 1448526560 ref 1 fl Rpc:X/c0/ffffffff rc 0/-1
lola-10.log:Nov 26 00:29:28 lola-10 kernel: LustreError: 5620:0:(ldlm_lib.c:1883:check_for_next_transno()) soaked-MDT0004: waking for gap in transno, VBR is OFF (skip: 992140549951, ql: 10, comp: 6, conn: 16, next: 992140549952
, next_update 992140549968 last_committed: 992140549904)
lola-10.log:Nov 26 00:29:28 lola-10 kernel: LustreError: 5620:0:(ldlm_lib.c:1883:check_for_next_transno()) soaked-MDT0004: waking for gap in transno, VBR is OFF (skip: 992140549956, ql: 10, comp: 6, conn: 16, next: 992140549957
, next_update 992140549968 last_committed: 992140549904)
etc... etc...
lola-10.log:Nov 26 00:29:34 lola-10 kernel: LustreError: 5620:0:(ldlm_lib.c:1883:check_for_next_transno()) soaked-MDT0004: waking for gap in transno, VBR is OFF (skip: 992140550321, ql: 10, comp: 6, conn: 16, next: 992140550322
, next_update 992140550332 last_committed: 992140550276)
lola-10.log:Nov 26 00:29:34 lola-10 kernel: LustreError: 5620:0:(ldlm_lib.c:1883:check_for_next_transno()) soaked-MDT0004: waking for gap in transno, VBR is OFF (skip: 992140550327, ql: 10, comp: 6, conn: 16, next: 992140550328
, next_update 992140550332 last_committed: 992140550276)
lola-10.log:Nov 26 00:29:35 lola-10 kernel: LustreError: 5620:0:(ldlm_lib.c:1883:check_for_next_transno()) soaked-MDT0004: waking for gap in transno, VBR is OFF (skip: 992140550374, ql: 10, comp: 6, conn: 16, next: 992140550375
, next_update 992140550385 last_committed: 992140550276)
lola-10.log:Nov 26 00:29:35 lola-10 kernel: LustreError: 5620:0:(ldlm_lib.c:1883:check_for_next_transno()) soaked-MDT0004: waking for gap in transno, VBR is OFF (skip: 992140550379, ql: 10, comp: 6, conn: 16, next: 992140550380
, next_update 992140550385 last_committed: 992140550276)
...
etc...
...
lola-10.log:Nov 26 00:29:40 lola-10 kernel: Lustre: 5620:0:(ldlm_lib.c:1946:target_recovery_overseer()) recovery is aborted, evict exports in recovery
lola-10.log:Nov 26 00:29:40 lola-10 kernel: Lustre: 5620:0:(ldlm_lib.c:1946:target_recovery_overseer()) Skipped 1 previous similar message
lola-10.log:Nov 26 00:29:40 lola-10 kernel: Lustre: soaked-MDT0004: disconnecting 10 stale clients
lola-10.log:Nov 26 00:29:40 lola-10 kernel: Lustre: 5620:0:(ldlm_lib.c:1580:abort_req_replay_queue()) @@@ aborted:  req@ffff8808216889c0 x1518687394127176/t0(992140550777) o36-&amp;gt;a8187858-2e65-5add-6cdf-c58a76255990@192.168.1.126@o2ib100:210/0 lens 624/0 e 0 to 0 dl 1448526600 ref 1 fl Complete:/4/ffffffff rc 0/-1
lola-10.log:Nov 26 00:29:40 lola-10 kernel: Lustre: 5620:0:(ldlm_lib.c:1580:abort_req_replay_queue()) Skipped 8 previous similar messages
lola-10.log:Nov 26 00:29:40 lola-10 kernel: Lustre: soaked-MDT0004: Denying connection for new client 429ceaa9-f9a8-504b-0d85-c997f52f2036(at 192.168.1.130@o2ib100), waiting for 16 known clients (6 recovered, 0 in progress, and 10 evicted) to recover in 2:25
lola-10.log:Nov 26 00:29:40 lola-10 kernel: Lustre: soaked-MDT0004: Recovery over after 2:04, of 16 clients 6 recovered and 10 were evicted.
lola-10.log:Nov 26 00:29:40 lola-10 kernel: Lustre: Skipped 9 previous similar messages
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140550816 batchid = 987843952824 flags = 0 ops = 5 params = 9
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140550834 batchid = 987843952825 flags = 0 ops = 5 params = 4
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140550870 batchid = 987843952826 flags = 0 ops = 5 params = 9
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140550885 batchid = 987843952827 flags = 0 ops = 5 params = 4
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140550923 batchid = 987843952828 flags = 0 ops = 6 params = 10
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140550934 batchid = 987843952829 flags = 0 ops = 5 params = 4
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140550960 batchid = 987843952830 flags = 0 ops = 5 params = 9
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140550975 batchid = 987843952831 flags = 0 ops = 5 params = 4
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140551007 batchid = 987843952832 flags = 0 ops = 5 params = 9
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140551023 batchid = 987843952833 flags = 0 ops = 5 params = 4
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140551054 batchid = 987843952834 flags = 0 ops = 5 params = 9
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140551065 batchid = 987843952835 flags = 0 ops = 5 params = 4
lola-10.log:Nov 26 00:29:40 lola-10 kernel: LustreError: 5620:0:(update_records.c:72:update_records_dump()) master transno = 992140551096 batchid = 987843952836 flags = 0 ops = 5 params = 9
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;(Gap in trans. is an old bug INTL-156)&lt;/p&gt;

&lt;p&gt;Attached files: soak.log (to show events); client &lt;tt&gt;lola-29&lt;/tt&gt;) kernel debug log file, messages, console logs; mds &lt;tt&gt;lola-10&lt;/tt&gt; messages, console logs&lt;/p&gt;</comment>
                            <comment id="137448" author="sergey" created="Fri, 25 Dec 2015 15:39:29 +0000"  >&lt;p&gt;Maloo set -1 because there are 2 test failures:&lt;br/&gt;
1. sanity 230f - it is marked as known but &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7549&quot; title=&quot;sanity test_230f: FAIL: migrate dir fails&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7549&quot;&gt;&lt;del&gt;LU-7549&lt;/del&gt;&lt;/a&gt;&lt;br/&gt;
2. conf-sanity 51. I don&apos;t see how it can be connected with my patch:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;15:32:55:LustreError: 3731:0:(lov_object.c:278:lov_init_raid0()) ASSERTION( subdev != NULL ) failed: not init ost 1
15:32:55:LustreError: 3731:0:(lov_object.c:278:lov_init_raid0()) LBUG
15:32:55:Pid: 3731, comm: touch
15:32:55:
15:32:55:Call Trace:
15:32:55: [&amp;lt;ffffffffa03e3875&amp;gt;] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
15:32:55: [&amp;lt;ffffffffa03e3e77&amp;gt;] lbug_with_loc+0x47/0xb0 [libcfs]
15:32:55: [&amp;lt;ffffffffa0ab1239&amp;gt;] lov_init_raid0+0xde9/0x1140 [lov]
15:32:55: [&amp;lt;ffffffffa0aadea4&amp;gt;] lov_object_init+0x124/0x300 [lov]
15:32:55: [&amp;lt;ffffffffa078621c&amp;gt;] ? lu_object_add+0x2c/0x30 [obdclass]
15:32:55: [&amp;lt;ffffffffa0788e48&amp;gt;] lu_object_alloc+0xd8/0x320 [obdclass]
15:32:55: [&amp;lt;ffffffffa078a1ad&amp;gt;] lu_object_find_try+0xc03e&amp;gt;] ? do_filp_open+0xa8e/0xd20
15:32:55: [&amp;lt;ffffffff81156a00&amp;gt;] ? __vma_link_rb+0x30/0x40
15:32:55: [&amp;lt;ffffffff81156aab&amp;gt;] ? vma_link+0x9b/0xf0
15:32:55: [&amp;lt;ffffffff8129dc3a&amp;gt;] ? strncpy_from_user+0x4a/0x90
15:32:55: [&amp;lt;ffffffff811b1d52&amp;gt;] ? alloc_fd+0x92/0x160
15:32:55: [&amp;lt;ffffffff8118e847&amp;gt;] ? do_sys_open+0x67/0x130
15:32:55: [&amp;lt;ffffffff8118e950&amp;gt;] ? sys_open+0x20/0x30
15:32:55: [&amp;lt;ffffffff8100b0d2&amp;gt;] ? system_call_fastpath+0x16/0x1b
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="138057" author="heckes" created="Wed, 6 Jan 2016 09:54:24 +0000"  >&lt;p&gt;Used build &apos;20160104&apos; from branch master (see &lt;a href=&quot;https://wiki.hpdd.intel.com/pages/viewpage.action?title=Soak+Testing+on+Lola&amp;amp;spaceKey=Releases#SoakTestingonLola-20160104&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://wiki.hpdd.intel.com/pages/viewpage.action?title=Soak+Testing+on+Lola&amp;amp;spaceKey=Releases#SoakTestingonLola-20160104&lt;/a&gt;).&lt;br/&gt;
DNE is enabled and MDSes are in active-active HA configuration. MDTs have been formatted using &lt;em&gt;ldiskfs&lt;/em&gt;, OSTs using &lt;em&gt;zfs&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;Original error happens again for two client nodes at almost the same time during failback of MDS resources to primary node while soak testing the build specified above.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&amp;lt;0&amp;gt;LustreError: 75738:0:(llite_lib.c:2295:ll_prep_inode()) ASSERTION( fid_is_sane(&amp;amp;md.body-&amp;gt;mbo_fid1) ) failed: 
&amp;lt;0&amp;gt;LustreError: 75738:0:(llite_lib.c:2295:ll_prep_inode()) LBUG
&amp;lt;4&amp;gt;Pid: 75738, comm: mdtest
...
&amp;lt;0&amp;gt;Kernel panic - not syncing: LBUG
&amp;lt;4&amp;gt;Pid: 75738, comm: mdtest Not tainted 2.6.32-504.30.3.el6.x86_64 #1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Crash dump files have been written for both nodes (&lt;tt&gt;lola-26,29&lt;/tt&gt;) and have been saved to &lt;tt&gt;lola-1:/scratch/crashdumps/lu-7422/lola-26-127.0.0.1-2016-01-05-19:02:53 , lola-29-127.0.0.1-2016-01-05-19:02:56&lt;/tt&gt;. Log files can be provided on demand.&lt;/p&gt;</comment>
                            <comment id="138842" author="gerrit" created="Wed, 13 Jan 2016 20:03:15 +0000"  >&lt;p&gt;Sergey Cheremencev (sergey_cheremencev@xyratex.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/17985&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/17985&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7422&quot; title=&quot;incorrect ENOENT handling in mdt_intent_reint causes panic on client side&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7422&quot;&gt;&lt;del&gt;LU-7422&lt;/del&gt;&lt;/a&gt; llite: don&apos;t panic when fid is insane&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 77306b00cd1b3536f8d6b4e9b25f7b065f737ec7&lt;/p&gt;</comment>
                            <comment id="138879" author="gerrit" created="Thu, 14 Jan 2016 03:59:22 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/17177/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/17177/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7422&quot; title=&quot;incorrect ENOENT handling in mdt_intent_reint causes panic on client side&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7422&quot;&gt;&lt;del&gt;LU-7422&lt;/del&gt;&lt;/a&gt; mdt: fix ENOENT handling in mdt_intent_reint&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: ec9078afb635e8a64a4906b09dc99a2ab90e321b&lt;/p&gt;</comment>
                            <comment id="154014" author="gerrit" created="Tue, 31 May 2016 04:54:31 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/17985/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/17985/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7422&quot; title=&quot;incorrect ENOENT handling in mdt_intent_reint causes panic on client side&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7422&quot;&gt;&lt;del&gt;LU-7422&lt;/del&gt;&lt;/a&gt; llite: don&apos;t panic when fid is insane&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: e4eb90ecdc09740d90834cb1e95b5693e6637173&lt;/p&gt;</comment>
                            <comment id="154062" author="pjones" created="Tue, 31 May 2016 12:49:26 +0000"  >&lt;p&gt;Landed for 2.9&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="34806">LU-7798</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                                        </outwardlinks>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="19726" name="console-lola-10.log.bz2" size="395597" author="heckes" created="Thu, 26 Nov 2015 12:30:57 +0000"/>
                            <attachment id="19727" name="console-lola-29.log.bz2" size="44631" author="heckes" created="Thu, 26 Nov 2015 12:30:57 +0000"/>
                            <attachment id="19728" name="lola-29-lustre-log.1448526588.65535.bz2" size="275" author="heckes" created="Thu, 26 Nov 2015 12:30:57 +0000"/>
                            <attachment id="19729" name="messages-lola-10.log.bz2" size="455793" author="heckes" created="Thu, 26 Nov 2015 12:30:57 +0000"/>
                            <attachment id="19730" name="messages-lola-29.log.bz2" size="419070" author="heckes" created="Thu, 26 Nov 2015 12:30:57 +0000"/>
                            <attachment id="19731" name="soak.log.bz2" size="366697" author="heckes" created="Thu, 26 Nov 2015 12:30:57 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzxswn:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>