<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:21:34 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-8907] LBUG: (namei.c:816:ll_create_node()) ASSERTION( list_empty(&amp;inode-&gt;i_dentry) ) failed</title>
                <link>https://jira.whamcloud.com/browse/LU-8907</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;One Lustre client crashed right after a recovery with the following messages:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;2016-10-29 16:00:01 Lustre: DEBUG MARKER: Sat Oct 29 16:00:01 2016
 2016-10-29 16:00:01
 2016-10-29 16:01:07 Lustre: 17766:0:(client.c:1942:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1477749640/real 1477749640] req@ffff8811e1b9a400 x1547883906151520/t0(0) 
 o101-&amp;gt;scratch3-MDT0000-mdc-ffff88205ec91400@JO.BOO.AL.IL@o2ib2:12/10 lens 632/1136 e 0 to 1 dl 1477749667 ref 2 fl Rpc:XP/0/ffffffff rc 0/-1
 2016-10-29 16:01:07 Lustre: 17766:0:(client.c:1942:ptlrpc_expire_one_request()) Skipped 12 previous similar messages
 2016-10-29 16:01:07 Lustre: scratch3-MDT0000-mdc-ffff88205ec91400: Connection to scratch3-MDT0000 (at JO.BOO.AL.IL@o2ib2) was lost; in progress operations using this service will wait for recovery to complet
 e
 2016-10-29 16:01:07 Lustre: Skipped 12 previous similar messages
 2016-10-29 16:01:07 Lustre: scratch3-MDT0000-mdc-ffff88205ec91400: Connection restored to scratch3-MDT0000 (at JO.BOO.AL.IL@o2ib2)
 2016-10-29 16:01:07 Lustre: Skipped 12 previous similar messages
 2016-10-29 16:01:07 LustreError: 17766:0:(namei.c:816:ll_create_node()) ASSERTION( list_empty(&amp;amp;inode-&amp;gt;i_dentry) ) failed:
 2016-10-29 16:01:07 LustreError: 17766:0:(namei.c:816:ll_create_node()) LBUG

&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;The stack is as follows:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;PID: 17766 TASK: ffff8819ab5b0ab0 CPU: 6 COMMAND: &quot;rsync&quot;
 #0 [ffff88189c697b20] machine_kexec at ffffffff8103ff4b
 #1 [ffff88189c697b80] crash_kexec at ffffffff810cfce2
 #2 [ffff88189c697c50] panic at ffffffff81546ce9
 #3 [ffff88189c697cd0] lbug_with_loc at ffffffffa067beeb [libcfs]
 #4 [ffff88189c697cf0] ll_create_nd at ffffffffa0dbf854 [lustre]
 #5 [ffff88189c697d70] vfs_create at ffffffff811a7946
 #6 [ffff88189c697db0] do_filp_open at ffffffff811ab75e
 #7 [ffff88189c697f20] do_sys_open at ffffffff81194e87
 #8 [ffff88189c697f70] sys_open at ffffffff81194f90
 #9 [ffff88189c697f80] system_call_fastpath at ffffffff8100b0d2
 RIP: 0000003c88adb480 RSP: 00007fffd85b6bc8 RFLAGS: 00010246
 RAX: 0000000000000002 RBX: ffffffff8100b0d2 RCX: 0000000000000001
 RDX: 0000000000000180 RSI: 00000000000000c2 RDI: 00007fffd85b7d70
 RBP: 00007fffd85b7d9b R8: 7a672e7761722e30 R9: 00000000ffffffff
 R10: 0000000000000001 R11: 0000000000000246 R12: ffffffff81194f90
 R13: ffff88189c697f78 R14: 00007fffd85b7d9c R15: 0000000000000000
 ORIG_RAX: 0000000000000002 CS: 0033 SS: 002b
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;Even though the intent is to create the inode with a dentry in the filesystem namespace (/somepath/.tmpfile), the inode structure field i_dentry is populated with one dentry pointing at&#160;/.lustre/fid/&lt;span class=&quot;error&quot;&gt;&amp;#91;0x298cd542a:0x3b3d:0x0&amp;#93;&lt;/span&gt;.&lt;/p&gt;

&lt;p&gt;Strangely enough, even though the crash occured in ll_create_nd() -&amp;gt; ll_create_it() -&amp;gt; ll_create_node, before the call to d_instantiate() in ll_create_nd() -&amp;gt; ll_create_it(), the fid actually really points back to the requested location:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# lfs fid2path /mountpoint [0x298cd542a:0x3b3d:0x0]
/somepath/.tmpfile
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;I can&apos;t upload the crashdump due to site restriction, but will be happy to give more informations as you request them.&lt;/p&gt;</description>
                <environment>RHEL 6 Bull kernel 2.6.32-642.4.2.el6.Bull.100.x86_64&lt;br/&gt;
Lustre build based on 2.5.3.90</environment>
        <key id="42193">LU-8907</key>
            <summary>LBUG: (namei.c:816:ll_create_node()) ASSERTION( list_empty(&amp;inode-&gt;i_dentry) ) failed</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bfaccini">Bruno Faccini</assignee>
                                    <reporter username="spiechurski">Sebastien Piechurski</reporter>
                        <labels>
                            <label>p4b</label>
                    </labels>
                <created>Mon, 5 Dec 2016 16:29:22 +0000</created>
                <updated>Wed, 7 Jun 2017 12:09:57 +0000</updated>
                            <resolved>Wed, 7 Jun 2017 12:09:57 +0000</resolved>
                                    <version>Lustre 2.5.3</version>
                                    <fixVersion>Lustre 2.10.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                            <comments>
                            <comment id="176520" author="bfaccini" created="Mon, 5 Dec 2016 18:29:12 +0000"  >&lt;p&gt;Hello Sebastien,&lt;br/&gt;
Could it be possible for me to work on crash-dump with you on-site or when you access it from your office?&lt;/p&gt;</comment>
                            <comment id="176649" author="spiechurski" created="Tue, 6 Dec 2016 08:19:58 +0000"  >&lt;p&gt;Hi Bruno,&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;For sure, I&apos;d be happy to work with you on this !&lt;/p&gt;</comment>
                            <comment id="176660" author="bfaccini" created="Tue, 6 Dec 2016 11:49:16 +0000"  >&lt;p&gt;Ok, then I will try to contact you offline to see how/when/where we can schedule a RDV.&lt;/p&gt;</comment>
                            <comment id="177052" author="spiechurski" created="Thu, 8 Dec 2016 15:39:34 +0000"  >&lt;p&gt;I forgot to mention that the intent dentry which should not had time to be instantiated (since the assert crashed before the call to d_instantiate()) is really /somepath/.tmpfile (the file actually exists on the fs currently).&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="179377" author="spiechurski" created="Tue, 3 Jan 2017 15:43:27 +0000"  >&lt;p&gt;Hi Bruno, and Happy New year to you !&lt;/p&gt;

&lt;p&gt;Were you able to discuss the data we extracted from the dump with your colleagues on this issue ?&lt;/p&gt;</comment>
                            <comment id="180023" author="bfaccini" created="Mon, 9 Jan 2017 09:00:19 +0000"  >&lt;p&gt;Thank you Seb and I wish the same for you and your family!&lt;br/&gt;
About the problem, it looks definitely like an unknown single-shoot for us.&lt;br/&gt;
But I am still working on both Lustre and Kernel code in order to find a hole which can lead to it.&lt;br/&gt;
BTW, can you tell me how does your kernel config (i.e., inode.i_dentry/dentry.d_alias use h_list or double-linked list) cause ll_d_hlist_empty() macro to be converted ?&lt;/p&gt;</comment>
                            <comment id="180024" author="bfaccini" created="Mon, 9 Jan 2017 09:34:54 +0000"  >&lt;p&gt;Oops, forget my previous question, the conversion can be found in the LBUG text ...&lt;/p&gt;</comment>
                            <comment id="180150" author="bfaccini" created="Mon, 9 Jan 2017 22:29:25 +0000"  >&lt;p&gt;Hello Seb,&lt;br/&gt;
As we already have discussed off-line, can you also give us some idea about any additional patches that have been applied on top of Lustre 2.5.3.90 and Kernel 2.6.32-642.4.2 ??&lt;br/&gt;
Thanks in advance.&lt;/p&gt;</comment>
                            <comment id="180453" author="bfaccini" created="Wed, 11 Jan 2017 15:40:24 +0000"  >&lt;p&gt;Hello Seb, did you see my previous update/query?&lt;br/&gt;
On my side, I am working on a possible reproducer, based on some other known occurrences of this same LBUG. Will let you know asap on how it goes.&lt;/p&gt;</comment>
                            <comment id="180573" author="bfaccini" created="Thu, 12 Jan 2017 09:38:21 +0000"  >&lt;p&gt;Hmm first attempt to reproduce looks interesting (promising?).&lt;br/&gt;
Running a modified version (to enable it on a single-node config) of replay-single/test_70b in a loop, I just got the following LBUG at the 25th pass :&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Lustre: DEBUG MARKER: -----============= acceptance-small: replay-single ============----- Wed Jan 11 16:40:48 UTC 2017
Lustre: DEBUG MARKER: only running test 70b
Lustre: DEBUG MARKER: excepting tests: 61d 33a 33b 89 62
Lustre: DEBUG MARKER: skipping tests SLOW=no: 1 2 3 4 6 12 16 44a 44b 65 66 67 68
Lustre: DEBUG MARKER: Using TIMEOUT=20
LustreError: 27268:0:(llite_lib.c:1225:ll_clear_inode()) ASSERTION( lli-&amp;gt;u.d.d_sai == ((void *)0) ) failed: 
LustreError: 27268:0:(llite_lib.c:1225:ll_clear_inode()) LBUG
Pid: 27268, comm: rm

Call Trace:
 [&amp;lt;ffffffffa03fe895&amp;gt;] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
 [&amp;lt;ffffffffa03fee97&amp;gt;] lbug_with_loc+0x47/0xb0 [libcfs]
 [&amp;lt;ffffffffa0f64e72&amp;gt;] ll_clear_inode+0x8e2/0x920 [lustre]
 [&amp;lt;ffffffffa0f63300&amp;gt;] ? ll_delete_inode+0x0/0x1e0 [lustre]
 [&amp;lt;ffffffff811a69fc&amp;gt;] clear_inode+0xac/0x140
 [&amp;lt;ffffffffa0f6338d&amp;gt;] ll_delete_inode+0x8d/0x1e0 [lustre]
 [&amp;lt;ffffffffa0f34000&amp;gt;] ? return_if_equal+0x0/0x30 [lustre]
 [&amp;lt;ffffffff811a70fe&amp;gt;] generic_delete_inode+0xde/0x1d0
 [&amp;lt;ffffffff811a7255&amp;gt;] generic_drop_inode+0x65/0x80
 [&amp;lt;ffffffff811a60a2&amp;gt;] iput+0x62/0x70
 [&amp;lt;ffffffffa0f3590a&amp;gt;] ll_d_iput+0xfa/0x820 [lustre]
 [&amp;lt;ffffffff811a2c7c&amp;gt;] dentry_iput+0x7c/0x100
 [&amp;lt;ffffffff811a2df1&amp;gt;] d_kill+0x31/0x60
 [&amp;lt;ffffffff811a49cc&amp;gt;] dput+0x7c/0x150
 [&amp;lt;ffffffff8119b1d9&amp;gt;] do_rmdir+0x199/0x1f0
 [&amp;lt;ffffffff81196df6&amp;gt;] ? final_putname+0x26/0x50
 [&amp;lt;ffffffff810e2257&amp;gt;] ? audit_syscall_entry+0x1d7/0x200
 [&amp;lt;ffffffff8119b25d&amp;gt;] sys_unlinkat+0x2d/0x40
 [&amp;lt;ffffffff8100b072&amp;gt;] system_call_fastpath+0x16/0x1b
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;which is only reported in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2048&quot; title=&quot;Crash in llite_lib.c:1161:ll_clear_inode()) ASSERTION( lli-&amp;gt;u.d.d_sai == ((void *)0) ) replay-vbr 7e&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2048&quot;&gt;&lt;del&gt;LU-2048&lt;/del&gt;&lt;/a&gt; (quite old !!!) and without any analyze/explanation !!&#8230;.&lt;/p&gt;

&lt;p&gt;But unfortunately, kdump over the LAN/nfs has failed :&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&#8230;&#8230;&#8230;&#8230;&#8230;..
mapping eth0 to eth0
ADDRCONF(NETDEV_UP): eth0: link is not ready
udhcpc (v1.15.1) started
Sending discover...
e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Sending discover...
Sending discover...
Unable to get a DHCP address retsd 5:0:0:0: [sdf] Synchronizing SCSI cache
ry...
No lease, failing
eth0 failed to come up
Saving to remote location eagle-2.eagle.hpdd.intel.com:/export/scratch
mount: RPC: Remote system error - Network is unreachable
sd 4:0:0:0: [sde] Synchronizing SCSI cache
sd 3:0:0:0: [sdd] Synchronizing SCSI cache
sd 2:0:0:0: [sdc] Synchronizing SCSI cache
sd 1:0:0:0: [sdb] Synchronizing SCSI cache
sd 0:0:0:0: [sda] Synchronizing SCSI cache
e1000e 0000:02:00.0: PCI INT A disabled
e1000e 0000:01:00.0: PCI INT A disabled
Restarting system.
machine restart
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="180578" author="spiechurski" created="Thu, 12 Jan 2017 10:29:57 +0000"  >&lt;p&gt;Hi Bruno,&#160;&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;Here is the list of patches we apply to our servers kernel:&lt;/p&gt;

&lt;p&gt;0003-Applied-Bull_lmigr-x86_64.el6.0.7.patch-for-RHEL-2.6.patch&lt;br/&gt;
Patch which delays migration of pages between numa nodes&lt;/p&gt;

&lt;p&gt;0006-Applied-Bull_311740-GENERIC_ALLOCATOR-4.patch&lt;br/&gt;
Include generic memory allocator for x86 architecture.&lt;/p&gt;

&lt;p&gt;0008-Applied-Bull_Dynamic_Power_Limitation_2.6.32-573.patch&lt;br/&gt;
CState manipulation interface through /sys&lt;/p&gt;

&lt;p&gt;0011-Applied-Bull_mce75xx_2.6.32-504.diff.patch&lt;br/&gt;
Intel Xeon 7500 series corrected memory error driver&lt;/p&gt;

&lt;p&gt;0012-Applied-Bull_improve-x86_64.csum.patch.patch&lt;br/&gt;
Performance improvement on TCP/IP checksumming&lt;/p&gt;

&lt;p&gt;0017-Applied-raid5-mmp-unplug-dev-rhel6.Bull.3.patch&lt;br/&gt;
Forces sync reads on MD devices&apos; disks for MMP compatibility&lt;/p&gt;

&lt;p&gt;0018-lustre-blkdev_tunables-2.6-rhel6.patch.patch&lt;br/&gt;
Some block device parameters value modifications&lt;/p&gt;

&lt;p&gt;0021-lustre-bh_lru_size_config.patch.patch&lt;br/&gt;
Buffer head per-cpu LRU size increased from 8 to 16&lt;/p&gt;

&lt;p&gt;0022-quota-replace-dqptr-sem.patch&lt;br/&gt;
Replaces dqptr semaphore by spin locks (lustre server kernel patch)&lt;/p&gt;

&lt;p&gt;0029-Applied-proc_pid_limits.patch.patch&lt;br/&gt;
Ease use of writes to /proc/&amp;lt;pid&amp;gt;/limits file&lt;/p&gt;

&lt;p&gt;0030-Applied-patch-Bull_oomprotect.diff.patch&lt;br/&gt;
Patch to protect processes from root to be killed by oom&lt;/p&gt;

&lt;p&gt;0055-quota-avoid-dqget-calls.patch&lt;br/&gt;
(lustre server kernel patch)&lt;/p&gt;

&lt;p&gt;0058-fix-CONFIG_XEN.patch&lt;br/&gt;
Some modifications to have kvm work when XEN is not enabled&lt;/p&gt;

&lt;p&gt;0059-ocfs2-getname.patch&lt;br/&gt;
fix getname/putname changes in ocfs2&lt;/p&gt;

&lt;p&gt;0090-kernel_H-2.6.32-573.1.1-config-x86_64.patch&lt;br/&gt;
0091-kernel_H-2.6.32-573.1.1-config-x86_64.debug.patch&lt;br/&gt;
Some modifications to the default RHEL kernel configuration.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;For the lustre code, this is based on 2.5.3.90 (commit&#160;35bb8577c83f787451cd9842a07ac670cb6f4d3f), with some patches that I will send you by mail.&lt;/p&gt;</comment>
                            <comment id="180983" author="bfaccini" created="Tue, 17 Jan 2017 16:56:15 +0000"  >&lt;p&gt;Thanks Seb, I don&apos;t see any Kernel/Lustre patch that could be related to the problem.&lt;/p&gt;

&lt;p&gt;Concerning the other LBUG, related to &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2048&quot; title=&quot;Crash in llite_lib.c:1161:ll_clear_inode()) ASSERTION( lli-&amp;gt;u.d.d_sai == ((void *)0) ) replay-vbr 7e&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2048&quot;&gt;&lt;del&gt;LU-2048&lt;/del&gt;&lt;/a&gt;, that I have triggered during my first attempts to reproduce, it is a pure problem related to statahead code where some fixes appear to be missing in 2.5.3.90.&lt;/p&gt;

&lt;p&gt;On the other hand, I have been able to reproduce your LBUG by introducing the following tempo just before the LBUG :&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[root@eagle-28 lustre-release]# git status
# On branch b2_5
# Changed but not updated:
#   (use &quot;git add &amp;lt;file&amp;gt;...&quot; to update what will be committed)
#   (use &quot;git checkout -- &amp;lt;file&amp;gt;...&quot; to discard changes in working directory)
#
#       modified:   lustre/include/obd_support.h
#       modified:   lustre/llite/namei.c
#
# Untracked files:
#   (use &quot;git add &amp;lt;file&amp;gt;...&quot; to include in what will be committed)
#
#       lustre/include/obd_support.h.bfi
#       lustre/llite/namei.c.bfi
#       lustre/scripts/make_META.pl
no changes added to commit (use &quot;git add&quot; and/or &quot;git commit -a&quot;)
[root@eagle-28 lustre-release]# 
[root@eagle-28 lustre-release]# git diff
diff --git a/lustre/include/obd_support.h b/lustre/include/obd_support.h
index 37bee22..a21f8a0 100644
--- a/lustre/include/obd_support.h
+++ b/lustre/include/obd_support.h
@@ -430,6 +430,7 @@ int obd_alloc_fail(const void *ptr, const char *name, const char *type,
 #define OBD_FAIL_MDC_GETATTR_ENQUEUE     0x803
 #define OBD_FAIL_MDC_RPCS_SEM           0x804
 #define OBD_FAIL_MDC_LIGHTWEIGHT        0x805
+#define OBD_FAIL_MDC_CREATE_PAUSE       0x805
 
 #define OBD_FAIL_MGS                     0x900
 #define OBD_FAIL_MGS_ALL_REQUEST_NET     0x901
diff --git a/lustre/llite/namei.c b/lustre/llite/namei.c
index 39cd6d5..1cf3518 100644
--- a/lustre/llite/namei.c
+++ b/lustre/llite/namei.c
@@ -813,6 +813,8 @@ static struct inode *ll_create_node(struct inode *dir, const char *name,
         if (rc)
                 GOTO(out, inode = ERR_PTR(rc));
 
+       OBD_FAIL_TIMEOUT(OBD_FAIL_MDC_CREATE_PAUSE, 5);
+
        LASSERT(ll_d_hlist_empty(&amp;amp;inode-&amp;gt;i_dentry));
 
         /* We asked for a lock on the directory, but were granted a
[root@eagle-28 lustre-release]# 
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;and by running the following snippet :&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[root@eagle-28 lustre-release]# date; lctl set_param fail_loc=0x80000805 ; echo foofour &amp;gt; /mnt/lustre/foo8 &amp;amp; sleep 2 ; date ; fid=$(lfs changelog lustre-MDT0000 | grep &apos; foo8&apos; | sed &apos;s/.* t=//&apos; | sed &apos;s/ p=.*//&apos;) ; date ; cat /mnt/lustre/.lustre/fid/$fid ; date
Tue Jan 17 16:31:09 UTC 2017
fail_loc=0x80000805
[1] 27152
Tue Jan 17 16:31:11 UTC 2017
Tue Jan 17 16:31:11 UTC 2017
Tue Jan 17 16:31:11 UTC 2017
[root@eagle-28 lustre-release]# 
Message from syslogd@eagle-28 at Jan 17 16:31:14 ...
 kernel:LustreError: 27152:0:(namei.c:818:ll_create_node()) ASSERTION( list_empty(&amp;amp;inode-&amp;gt;i_dentry) ) failed: 
                  
Message from syslogd@eagle-28 at Jan 17 16:31:14 ...
 kernel:LustreError: 27152:0:(namei.c:818:ll_create_node()) LBUG
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;So this seems to mean that, as I have suspected from the start of our joint analysis, there is a potential race during file creation (including directory/dentry populate) and concurrent access using .lustre/fid method that is allowed to a ChangeLog reader, some kind of a&lt;br/&gt;
&quot;crime of insider trading&quot;. Or may be the LBUG is finally inaccurate and could be simply removed?&lt;/p&gt;

&lt;p&gt;More to come from crash-dump/debug-log analysis, and will also check if problem is still alive (like the  same LBUG!) in master.&lt;/p&gt;
</comment>
                            <comment id="181260" author="gerrit" created="Wed, 18 Jan 2017 22:47:33 +0000"  >&lt;p&gt;Faccini Bruno (bruno.faccini@intel.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/24976&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/24976&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8907&quot; title=&quot;LBUG: (namei.c:816:ll_create_node()) ASSERTION( list_empty(&amp;amp;inode-&amp;gt;i_dentry) ) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8907&quot;&gt;&lt;del&gt;LU-8907&lt;/del&gt;&lt;/a&gt; llite: handle client racy case during create&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_5&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 376e27c4ac97d31f9729f351317891bff3f54147&lt;/p&gt;</comment>
                            <comment id="181261" author="bfaccini" created="Wed, 18 Jan 2017 22:54:27 +0000"  >&lt;p&gt;I have just pushed &lt;a href=&quot;https://review.whamcloud.com/24976&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/24976&lt;/a&gt; to b2_5 branch. It introduces sanity/test_161d sub-test in order to check that the reproducer I have identified also fails in the auto-tests environment in a first time. And then, with next patch-set that removing LBUG could fix and not introduce any regression.&lt;/p&gt;

&lt;p&gt;Next, will also check, using the same way, if problem also exists in master.&lt;/p&gt;</comment>
                            <comment id="183052" author="spiechurski" created="Thu, 2 Feb 2017 14:27:22 +0000"  >&lt;p&gt;Hi Bruno, did you see the problem reproduced in the auto tests environment ?&lt;/p&gt;</comment>
                            <comment id="183723" author="gerrit" created="Tue, 7 Feb 2017 11:38:09 +0000"  >&lt;p&gt;Faccini Bruno (bruno.faccini@intel.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/25296&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/25296&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8907&quot; title=&quot;LBUG: (namei.c:816:ll_create_node()) ASSERTION( list_empty(&amp;amp;inode-&amp;gt;i_dentry) ) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8907&quot;&gt;&lt;del&gt;LU-8907&lt;/del&gt;&lt;/a&gt; llite: handle client racy case during create&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 3d35f58c3b6c736e5db3d22dea2451bf634cdb95&lt;/p&gt;</comment>
                            <comment id="183910" author="bfaccini" created="Wed, 8 Feb 2017 10:21:57 +0000"  >&lt;p&gt;Yes, new sanity/test_161d sub-test is also a reproducer in auto-tests environment and it also demonstrate that problem is still alive in master!&lt;br/&gt;
This has been proofed by patch-set #2 testing of &lt;a href=&quot;https://review.whamcloud.com/25296&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/25296&lt;/a&gt;, and now patch-set #3 introduces a first attempt to fix simply removing the inaccurate &lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;LASSERT(ll_d_hlist_empty(&amp;amp;inode-&amp;gt;i_dentry));&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt; in ll_create_node() and exposes it to full tests suites.&lt;/p&gt;</comment>
                            <comment id="186776" author="spiechurski" created="Thu, 2 Mar 2017 16:30:33 +0000"  >&lt;p&gt;Hello Bruno,&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;What is the next step for this patch ?&lt;/p&gt;

&lt;p&gt;Are we waiting for reviewers ?&lt;/p&gt;</comment>
                            <comment id="189674" author="gerrit" created="Sun, 26 Mar 2017 06:50:24 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/25296/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/25296/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-8907&quot; title=&quot;LBUG: (namei.c:816:ll_create_node()) ASSERTION( list_empty(&amp;amp;inode-&amp;gt;i_dentry) ) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-8907&quot;&gt;&lt;del&gt;LU-8907&lt;/del&gt;&lt;/a&gt; llite: handle client racy case during create&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: ee25babfe72378f9496a9732742984f26eb7d4a5&lt;/p&gt;</comment>
                            <comment id="189728" author="bfaccini" created="Mon, 27 Mar 2017 09:03:51 +0000"  >&lt;p&gt;Hello Seb,&lt;br/&gt;
As you can see &lt;a href=&quot;https://review.whamcloud.com/25296/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/25296/&lt;/a&gt; has just landed on master. &lt;br/&gt;
And as per its content (at least its pure fix part in core code, i.e. simply remove LBUG), it should be pretty easy for you to back-port it on top of your Lustre branch based on 2.5.3.90.&lt;/p&gt;
</comment>
                            <comment id="198411" author="spiechurski" created="Wed, 7 Jun 2017 08:00:55 +0000"  >&lt;p&gt;Thanks Bruno.&lt;/p&gt;

&lt;p&gt;The patch was backported to our branch.&lt;/p&gt;

&lt;p&gt;This ticket can be closed.&lt;/p&gt;</comment>
                            <comment id="198432" author="pjones" created="Wed, 7 Jun 2017 12:06:57 +0000"  >&lt;p&gt;Thanks&lt;/p&gt;</comment>
                            <comment id="198433" author="pjones" created="Wed, 7 Jun 2017 12:09:57 +0000"  >&lt;p&gt;Landed for 2.10&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                                        </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzyxgn:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>