<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:30:02 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-9868] dcache/namei fixes for lustre</title>
                <link>https://jira.whamcloud.com/browse/LU-9868</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;From Neil Brown that works at SuSE&lt;/p&gt;

&lt;p&gt;I was drawn to look at this code due to the tests on&lt;br/&gt;
DCACHE_DISCONNECTED which are often wrong, and it turns out&lt;br/&gt;
they are used wrongly in lustre too.  Fixing one led to some&lt;br/&gt;
clean-up.  Fixing the other is straight forward.&lt;/p&gt;

&lt;p&gt;A particular change here from the previous posting is&lt;br/&gt;
the first patch which tests for DCACHE_PAR_LOOKUP in ll_dcompare().&lt;br/&gt;
Without this patch, two threads can be looking up the same&lt;br/&gt;
name in a given directory in parallel.  This parallelism lead&lt;br/&gt;
to my concerns about needing improved locking in ll_splice_alias().&lt;br/&gt;
Instead of improving the locking, I now avoid the need for it&lt;br/&gt;
by fixing ll_dcompare.&lt;/p&gt;

&lt;p&gt;This code passes basic &quot;smoke tests&quot;.&lt;/p&gt;

&lt;p&gt;Note that the cast to &quot;struct dentry *&quot; in the first patch is because&lt;br/&gt;
we have a &quot;const struct dentry *&quot; but d_in_lookup() requires a&lt;br/&gt;
pointer to a non-const structure.  I&apos;ll send a separate patch to&lt;br/&gt;
change d_in_lookup().&lt;/p&gt;</description>
                <environment>upstream and well as any lustre 2.11 client.</environment>
        <key id="47785">LU-9868</key>
            <summary>dcache/namei fixes for lustre</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="simmonsja">James A Simmons</assignee>
                                    <reporter username="simmonsja">James A Simmons</reporter>
                        <labels>
                            <label>patch</label>
                            <label>upstream</label>
                    </labels>
                <created>Fri, 11 Aug 2017 17:02:54 +0000</created>
                <updated>Thu, 24 Nov 2022 14:32:32 +0000</updated>
                                            <version>Upstream</version>
                    <version>Lustre 2.11.0</version>
                    <version>Lustre 2.12.0</version>
                    <version>Lustre 2.10.5</version>
                                    <fixVersion>Upstream</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>13</watches>
                                                                            <comments>
                            <comment id="205162" author="gerrit" created="Fri, 11 Aug 2017 17:08:30 +0000"  >&lt;p&gt;James Simmons (uja.ornl@yahoo.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/28486&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/28486&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: dcache/namei fixes for lustre&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: ee439ac5304343708bc344c62297749c5909d59e&lt;/p&gt;</comment>
                            <comment id="234104" author="m.magrys" created="Fri, 28 Sep 2018 15:55:41 +0000"  >&lt;p&gt;I gave it a spin, but opening a file under vim (probably a mix of stat/open/write/chmod operations) hangs the process. Trace is :&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
Sep 28 17:50:46 p0851 kernel: INFO: rcu_sched self-detected stall on CPU { 7}  (t=60000 jiffies g=16256 c=16255 q=13733)
Sep 28 17:50:46 p0851 kernel: Task dump &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; CPU 7:
Sep 28 17:50:46 p0851 kernel: vim             R  running task        0 28301  28299 0x0000000b
Sep 28 17:50:46 p0851 kernel: Call Trace:
Sep 28 17:50:46 p0851 kernel: &amp;lt;IRQ&amp;gt;  [&amp;lt;ffffffffa6ed1268&amp;gt;] sched_show_task+0xa8/0x110
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6ed4e39&amp;gt;] dump_cpu_task+0x39/0x70
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6f4bb00&amp;gt;] rcu_dump_cpu_stacks+0x90/0xd0
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6f4f1a2&amp;gt;] rcu_check_callbacks+0x442/0x730
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6f046c0&amp;gt;] ? tick_sched_do_timer+0x50/0x50
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6ea7a16&amp;gt;] update_process_times+0x46/0x80
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6f044c0&amp;gt;] tick_sched_handle+0x30/0x70
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6f046f9&amp;gt;] tick_sched_timer+0x39/0x80
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6ec2163&amp;gt;] __hrtimer_run_queues+0xf3/0x270
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6ec26ef&amp;gt;] hrtimer_interrupt+0xaf/0x1d0
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6e596ab&amp;gt;] local_apic_timer_interrupt+0x3b/0x60
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa752a083&amp;gt;] smp_apic_timer_interrupt+0x43/0x60
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa75267b2&amp;gt;] apic_timer_interrupt+0x162/0x170
Sep 28 17:50:46 p0851 kernel: &amp;lt;EOI&amp;gt;  [&amp;lt;ffffffffa6f0bb43&amp;gt;] ? native_queued_spin_lock_slowpath+0x1d3/0x200
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa750dc5f&amp;gt;] queued_spin_lock_slowpath+0xb/0xf
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa751b750&amp;gt;] _raw_spin_lock+0x20/0x30
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa703bf3e&amp;gt;] igrab+0x1e/0x60
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc1026040&amp;gt;] ll_inode_from_resource_lock+0xa0/0xb0 [lustre]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc103ed92&amp;gt;] ll_md_blocking_ast+0x52/0x730 [lustre]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6f9f8e4&amp;gt;] ? free_one_page+0x2e4/0x310
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc0e1321a&amp;gt;] ldlm_cancel_callback+0x8a/0x330 [ptlrpc]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc0e81f22&amp;gt;] ? null_free_repbuf+0xd2/0x200 [ptlrpc]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6fa01fd&amp;gt;] ? __free_pages+0x1d/0x30
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa6fa0412&amp;gt;] ? __free_memcg_kmem_pages+0x22/0x50
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc0e1e9b0&amp;gt;] ldlm_cli_cancel_local+0xa0/0x420 [ptlrpc]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc0e2303a&amp;gt;] ldlm_cli_cancel_list_local+0xea/0x280 [ptlrpc]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc0e2335b&amp;gt;] ldlm_cancel_resource_local+0x18b/0x280 [ptlrpc]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc0a93e92&amp;gt;] mdc_resource_get_unused+0x142/0x2a0 [mdc]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc0a94217&amp;gt;] mdc_setattr+0x227/0x4a0 [mdc]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa7037779&amp;gt;] ? dput+0x29/0x160
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc0f5ff54&amp;gt;] lmv_setattr+0x1c4/0x570 [lmv]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc103115b&amp;gt;] ll_setattr_raw+0x28b/0x1350 [lustre]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffc103228c&amp;gt;] ll_setattr+0x6c/0xd0 [lustre]
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa703dc54&amp;gt;] notify_change+0x2c4/0x420
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa701c837&amp;gt;] chmod_common+0x137/0x160
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa701dac7&amp;gt;] SyS_fchmodat+0x57/0xc0
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa701db49&amp;gt;] SyS_chmod+0x19/0x20
Sep 28 17:50:46 p0851 kernel: [&amp;lt;ffffffffa7525a1b&amp;gt;] tracesys+0xa3/0xc9
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Tested on Centos 7.5 3.10.0-862.14.4.el7.x86_64 kernel and Lustre 2.10.5.&lt;/p&gt;</comment>
                            <comment id="234105" author="simmonsja" created="Fri, 28 Sep 2018 16:08:33 +0000"  >&lt;p&gt;Yeah the original patch was based on a earlier version posted for upstream. A later patch set was done and landed. I just did a push of the new patch. My testing shows its in much better shape.&lt;/p&gt;

&lt;p&gt;Sorry about the breakage.&lt;/p&gt;</comment>
                            <comment id="234151" author="m.magrys" created="Mon, 1 Oct 2018 09:12:04 +0000"  >&lt;p&gt;We moved patchset 17 into production after some initial testing and as for now it has been running for over 3000 node days without any problems. Is there any plan to have the patchset included in b2_10 or master?&lt;/p&gt;</comment>
                            <comment id="234176" author="simmonsja" created="Mon, 1 Oct 2018 17:23:15 +0000"  >&lt;p&gt;Wow, that is awesome news. Before your report of fixing an actually production system their wasn&apos;t really a push to merge this for 2.12. Now it looks like it&apos;s an important fix for production systems.&lt;/p&gt;

&lt;p&gt;The way it works is that we have to port patches exactly from upstream to opensfs so we are looking at 5 patches instead of the one test patch I have currently. Since this fixes real problems I will push to 2.10.X once it lands to latest lustre.&lt;/p&gt;</comment>
                            <comment id="234179" author="gerrit" created="Mon, 1 Oct 2018 18:22:43 +0000"  >&lt;p&gt;&lt;del&gt;James Simmons (uja.ornl@yahoo.com) uploaded a new patch:&lt;/del&gt; &lt;a href=&quot;https://review.whamcloud.com/33258&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/33258&lt;/a&gt;&lt;br/&gt;
&lt;del&gt;Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: use d_splice_alias for directories.&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Project: fs/lustre-release&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Branch: master&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Current Patch Set: 1&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Commit: 025cf259d4a66b2a5de57a66891cb88f0e860e14&lt;/del&gt;&lt;/p&gt;</comment>
                            <comment id="234180" author="gerrit" created="Mon, 1 Oct 2018 18:24:53 +0000"  >&lt;p&gt;James Simmons (uja.ornl@yahoo.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/33259&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/33259&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: remove directory-specific code from ll_find_alias()&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 58df6511f22d17a7cbe6d8d0f13d3c6da198c060&lt;/p&gt;</comment>
                            <comment id="234181" author="gerrit" created="Mon, 1 Oct 2018 18:26:45 +0000"  >&lt;p&gt;James Simmons (uja.ornl@yahoo.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/33260&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/33260&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: simplify ll_find_alias()&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 808f713c96423555d57f11dd69d0a372cc30c769&lt;/p&gt;</comment>
                            <comment id="234182" author="gerrit" created="Mon, 1 Oct 2018 18:27:55 +0000"  >&lt;p&gt;James Simmons (uja.ornl@yahoo.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/33261&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/33261&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: refine ll_find_alias based on d_exact_alias&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 160ed51460d9ab7be92a7abf4581560f9d97093a&lt;/p&gt;</comment>
                            <comment id="234193" author="simmonsja" created="Tue, 2 Oct 2018 01:14:20 +0000"  >&lt;p&gt;I see while this bug fixes things in newer kernel it breaks ancients kernel like RHEL.&lt;/p&gt;</comment>
                            <comment id="234648" author="simmonsja" created="Tue, 9 Oct 2018 18:35:22 +0000"  >&lt;p&gt;I found a work around to the bug the breaks older kernels. I&apos;m testing it now.&lt;/p&gt;</comment>
                            <comment id="234783" author="pjones" created="Thu, 11 Oct 2018 17:53:14 +0000"  >&lt;p&gt;We think that any changes in this area would need to soak a while and get tested broadly before including into a release&lt;/p&gt;</comment>
                            <comment id="234946" author="simmonsja" created="Mon, 15 Oct 2018 20:42:14 +0000"  >&lt;p&gt;I agree. This is a complex problem so caution is required here.&lt;/p&gt;</comment>
                            <comment id="235025" author="lflis" created="Wed, 17 Oct 2018 10:25:00 +0000"  >&lt;p&gt;We are running &lt;a href=&quot;https://review.whamcloud.com/#/c/28486&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/28486&lt;/a&gt; - patch set 17 with b2_10 and as Marek said there were no major issues except few nodes with processes blocked in rename operation invoked by software called OOMMF. We are not sure it&apos;s related but i thought it&apos;s worth reporting&lt;/p&gt;

&lt;p&gt;Stacktrace:&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
[455219.524004] oxs             D ffff984d68ba1fa0     0  3818   3806 0x00000000
[455219.558389] Call Trace:
[455219.571231]  [&amp;lt;ffffffffa3f19e59&amp;gt;] schedule_preempt_disabled+0x29/0x70
[455219.602131]  [&amp;lt;ffffffffa3f17c17&amp;gt;] __mutex_lock_slowpath+0xc7/0x1d0
[455219.631887]  [&amp;lt;ffffffffa3f16fff&amp;gt;] mutex_lock+0x1f/0x2f
[455219.656923]  [&amp;lt;ffffffffa3a2b50a&amp;gt;] lock_rename+0xda/0xe0
[455219.681973]  [&amp;lt;ffffffffa3a315bf&amp;gt;] SYSC_renameat2+0x22f/0x5a0
[455219.709742]  [&amp;lt;ffffffffa39caefd&amp;gt;] ? handle_mm_fault+0x39d/0x9b0
[455219.739183]  [&amp;lt;ffffffffa3f2056c&amp;gt;] ? __do_page_fault+0x1bc/0x4f0
[455219.767746]  [&amp;lt;ffffffffa3a327ae&amp;gt;] SyS_renameat2+0xe/0x10
[455219.793358]  [&amp;lt;ffffffffa3a327ee&amp;gt;] SyS_rename+0x1e/0x20
[455219.818174]  [&amp;lt;ffffffffa3f2579b&amp;gt;] system_call_fastpath+0x22/0x27
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;According to our logs issue has started to appear few days after deployment of client patched with &lt;a href=&quot;https://review.whamcloud.com/#/c/28486&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/28486&lt;/a&gt; - patch set 17&lt;/p&gt;

&lt;p&gt;I have noticed that current patch set is 22 - do you recommend to give upgrade a try ?&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;


&lt;p&gt;UPDATE#1: or is it better way to patch rhel kernel like&#160; SuSe did to address the issues now?&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="235035" author="simmonsja" created="Wed, 17 Oct 2018 13:11:39 +0000"  >&lt;p&gt;I broke the patch up so its more than one now. Also a bug in lustre was exposed while testing this fix. On newer kernels it reports an error but older ones it will crash the node. Once I figure everything out I will port a 2.10 backported patch. The rename issue is new.&lt;/p&gt;</comment>
                            <comment id="235038" author="lflis" created="Wed, 17 Oct 2018 13:22:22 +0000"  >&lt;p&gt;OK. If so we will revert patch 17 from &lt;a href=&quot;https://review.whamcloud.com/#/c/28486&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/28486&lt;/a&gt; and see if the rename issue is gone as OOMMF crash is easily reproducible (however we don&apos;t have a separate reproducer yet). &lt;br/&gt;
We&apos;ll update the ticket to make clear whether it&apos;s related&lt;/p&gt;</comment>
                            <comment id="235148" author="lflis" created="Fri, 19 Oct 2018 08:50:51 +0000"  >&lt;p&gt;Reverting patch 17 from &lt;a href=&quot;https://review.whamcloud.com/#/c/28486&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/28486&lt;/a&gt;&#160; &lt;b&gt;fixed the problem&lt;/b&gt; with clients hanging on rename in oommf&lt;/p&gt;

&lt;p&gt;We have four processes interacting, probably one is handling file writes, then signals another one which does rename opearations which lead to client lock-up after several dozens of such operations&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
[pid  5643] futex(0x2b774caf1d20, FUTEX_WAKE_PRIVATE, 1) = 0
[pid  5643] sendto(17, &lt;span class=&quot;code-quote&quot;&gt;&quot;reply 1762 0 0\r\n&quot;&lt;/span&gt;, 16, 0, NULL, 0) = 16
[pid  5643] recvfrom(18, &lt;span class=&quot;code-quote&quot;&gt;&quot;query 13 datafile {/net/scratch/&quot;&lt;/span&gt;..., 4096, 0, NULL, NULL) = 305
[pid  5643] stat(&lt;span class=&quot;code-quote&quot;&gt;&quot;/net/scratch/people/plgczuchr/OOMMF/dwieBariery/./dwieBariery/freeAniz_20000_Voltage_0.1500_mtop_-1_mfree_1_mbottom_1_VProfileType_0_/dwieBariery-Oxs_TimeDriver-Magnetization-00-0044000.omf&quot;&lt;/span&gt;, {st_mode=S_IFREG|0644, st_size=388216, ...}) = 0
[pid  5643] lstat(&lt;span class=&quot;code-quote&quot;&gt;&quot;/net/scratch/people/plgczuchr/slurm_jobdir/13738101/tmp/dwieBariery-p1311.prometheus-5582.omf&quot;&lt;/span&gt;, {st_mode=S_IFREG|0644, st_size=388216, ...}) = 0
[pid  5643] lstat(&lt;span class=&quot;code-quote&quot;&gt;&quot;/net/scratch/people/plgczuchr/OOMMF/dwieBariery/./dwieBariery/freeAniz_20000_Voltage_0.1500_mtop_-1_mfree_1_mbottom_1_VProfileType_0_/dwieBariery-Oxs_TimeDriver-Magnetization-00-0044000.omf&quot;&lt;/span&gt;, {st_mode=S_IFREG|0644, st_size=388216, ...}) = 0
[pid  5643] stat(&lt;span class=&quot;code-quote&quot;&gt;&quot;/net/scratch/people/plgczuchr/OOMMF/dwieBariery/./dwieBariery/freeAniz_20000_Voltage_0.1500_mtop_-1_mfree_1_mbottom_1_VProfileType_0_/dwieBariery-Oxs_TimeDriver-Magnetization-00-0044000.omf&quot;&lt;/span&gt;, {st_mode=S_IFREG|0644, st_size=388216, ...}) = 0
[pid  5643] chmod(&lt;span class=&quot;code-quote&quot;&gt;&quot;/net/scratch/people/plgczuchr/OOMMF/dwieBariery/./dwieBariery/freeAniz_20000_Voltage_0.1500_mtop_-1_mfree_1_mbottom_1_VProfileType_0_/dwieBariery-Oxs_TimeDriver-Magnetization-00-0044000.omf&quot;&lt;/span&gt;, 0644) = 0
[pid  5643] rename(&lt;span class=&quot;code-quote&quot;&gt;&quot;/net/scratch/people/plgczuchr/slurm_jobdir/13738101/tmp/dwieBariery-p1311.prometheus-5582.omf&quot;&lt;/span&gt;, &lt;span class=&quot;code-quote&quot;&gt;&quot;/net/scratch/people/plgczuchr/OOMMF/dwieBariery/./dwieBariery/freeAniz_20000_Voltage_0.1500_mtop_-1_mfree_1_mbottom_1_VProfileType_0_/dwieBariery-Oxs_TimeDriver-Magnetization-00-0044000.omf&quot;&lt;/span&gt;) = 0
[pid  5643] write(9, &lt;span class=&quot;code-quote&quot;&gt;&quot;\0&quot;&lt;/span&gt;, 1 &amp;lt;unfinished ...&amp;gt;
[pid  5652] &amp;lt;... select resumed&amp;gt; )      = 1 (in [8])
[pid  5643] &amp;lt;... write resumed&amp;gt; )       = 1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="235364" author="simmonsja" created="Tue, 23 Oct 2018 15:29:39 +0000"  >&lt;p&gt;Thank you for this info and also for testing out this work. I will talk with Neil about this problem to see if we can resolve it.&lt;/p&gt;</comment>
                            <comment id="235396" author="simmonsja" created="Wed, 24 Oct 2018 01:01:11 +0000"  >&lt;p&gt;Which version of lustre are you running and is it only patched with &lt;a href=&quot;https://review.whamcloud.com/#/c/28486/17&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/28486/17&lt;/a&gt;&#160;? Better yet can you&lt;/p&gt;

&lt;p&gt;share your rpms with us.&lt;/p&gt;</comment>
                            <comment id="241678" author="gerrit" created="Mon, 11 Feb 2019 03:21:46 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/28486/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/28486/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: handle DCACHE_PAR_LOOKUP in ll_dcompare&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 32e547aa9cb913f5736ee3d58cb79f4e63ce2c0b&lt;/p&gt;</comment>
                            <comment id="245176" author="gerrit" created="Wed, 3 Apr 2019 10:56:34 +0000"  >&lt;p&gt;Sebastien Piechurski (sebastien.piechurski@atos.net) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/34582&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/34582&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: handle DCACHE_PAR_LOOKUP in ll_dcompare&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: fb6aa13622d441b3495287cbc2630cc47da7c4ae&lt;/p&gt;</comment>
                            <comment id="245177" author="spiechurski" created="Wed, 3 Apr 2019 10:58:16 +0000"  >&lt;p&gt;Any chances to have the patch included in 2.12.1 ?&lt;/p&gt;</comment>
                            <comment id="245221" author="pjones" created="Thu, 4 Apr 2019 01:17:13 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=spiechurski&quot; class=&quot;user-hover&quot; rel=&quot;spiechurski&quot;&gt;spiechurski&lt;/a&gt; it would be possible but what is the driver to do so?&lt;/p&gt;</comment>
                            <comment id="245223" author="simmonsja" created="Thu, 4 Apr 2019 02:19:35 +0000"  >&lt;p&gt;Note the work is not complete. I need to solve the dcache hard link breakage issue. I have an idea on how to fix it but it will take some time to resolve it.&lt;/p&gt;</comment>
                            <comment id="255782" author="spiechurski" created="Wed, 2 Oct 2019 14:55:21 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=pjones&quot; class=&quot;user-hover&quot; rel=&quot;pjones&quot;&gt;pjones&lt;/a&gt; : one of our customer (Cardiff University) is hitting this issue from time to time.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=simmonsja&quot; class=&quot;user-hover&quot; rel=&quot;simmonsja&quot;&gt;simmonsja&lt;/a&gt; : Did you progress on this topic since April ? Maybe in another ticket ?&lt;/p&gt;</comment>
                            <comment id="255787" author="simmonsja" created="Wed, 2 Oct 2019 16:46:29 +0000"  >&lt;p&gt;Oh I thought with &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9735&quot; title=&quot;Sles12Sp2 and 2.9 getcwd() sometimes fails&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9735&quot;&gt;&lt;del&gt;LU-9735&lt;/del&gt;&lt;/a&gt; being closed this was not a pressing issue. I can look at it. Just need to find a few cycles to work on this.&lt;/p&gt;</comment>
                            <comment id="255788" author="spiechurski" created="Wed, 2 Oct 2019 17:07:19 +0000"  >&lt;p&gt;Indeed with &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9735&quot; title=&quot;Sles12Sp2 and 2.9 getcwd() sometimes fails&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9735&quot;&gt;&lt;del&gt;LU-9735&lt;/del&gt;&lt;/a&gt;, the occurences have dramatically decreased, but there are still a few left.&lt;/p&gt;

&lt;p&gt;No pressure though, I was just looking for information on progress.&lt;/p&gt;

&lt;p&gt;Thanks,&lt;/p&gt;

&lt;p&gt;Seb.&lt;/p&gt;</comment>
                            <comment id="257303" author="simmonsja" created="Wed, 30 Oct 2019 01:39:40 +0000"  >&lt;p&gt;I have been looking at how to resolve these issues and thinking of some ideas. So the breakage you see is due to the way Lustre manages its dcache so the special .lustre/fid directory works. This special handling is what is causing these problems. The patches from upstream resolve the issues you see but end up making the .lustre/fid handling not work. I cover why under ticket &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11501&quot; title=&quot;use the dcache properly with .lustre/fid&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11501&quot;&gt;LU-11501&lt;/a&gt; what that breakage is. Additional the special .lustre/fid directory doesn&apos;t work with file sets so I really need to think of a way to handle all these conditions.&lt;/p&gt;

&lt;p&gt;Before we can land the patches under these tickets .lustre/fid needs to be resolved first. What I have been thinking of is using a overlayfs type approach which used d_real() to map a dentry to a real dentry under the hood. This way a dentry for .lustre/fid/$FID can then be mapped to the real dentry. This means moving the .lustre/fid dentry out of the dcache into its own private cache for lookup. This is needed anyways since you don&apos;t want a mounted file set being able to FID outside of its tree. Also at the same time work is being done in the tools to move away from using .lustre/fid to using fhandles. This might help to resolve some of the issues as well.&lt;/p&gt;</comment>
                            <comment id="259815" author="gerrit" created="Fri, 13 Dec 2019 18:28:01 +0000"  >&lt;p&gt;&lt;del&gt;James Simmons (jsimmons@infradead.org) uploaded a new patch:&lt;/del&gt; &lt;a href=&quot;https://review.whamcloud.com/37013&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/37013&lt;/a&gt;&lt;br/&gt;
&lt;del&gt;Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: dcache/namei fixes for lustre&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Project: fs/lustre-release&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Branch: master&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Current Patch Set: 1&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Commit: b148c9e3f634ecff0b766d3ae89f27440690c001&lt;/del&gt;&lt;/p&gt;</comment>
                            <comment id="264130" author="gerrit" created="Thu, 27 Feb 2020 03:32:49 +0000"  >&lt;p&gt;Neil Brown (neilb@suse.de) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/37741&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/37741&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: remove lld_it field of ll_dentry_data&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: c21a2a28bd71d9fe4468c211cb64dc8ca27174e1&lt;/p&gt;</comment>
                            <comment id="264709" author="gerrit" created="Thu, 5 Mar 2020 22:35:39 +0000"  >&lt;p&gt;Oleg Drokin (green@whamcloud.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/37741/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/37741/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: remove lld_it field of ll_dentry_data&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: f172a59c99f7c6a3398b5f590f4cf1cecabdaee4&lt;/p&gt;</comment>
                            <comment id="267504" author="gerrit" created="Tue, 14 Apr 2020 08:09:04 +0000"  >&lt;p&gt;&lt;del&gt;Oleg Drokin (green@whamcloud.com) merged in patch&lt;/del&gt; &lt;a href=&quot;https://review.whamcloud.com/24175/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/24175/&lt;/a&gt;&lt;br/&gt;
&lt;del&gt;Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: Get rid of ll_dcompare&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Project: fs/lustre-release&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Branch: master&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Current Patch Set:&lt;/del&gt;&lt;br/&gt;
&lt;del&gt;Commit: 787231f53ab63c72634250f8fe9d27bc66cc4e46&lt;/del&gt;&lt;/p&gt;</comment>
                            <comment id="306173" author="gerrit" created="Sat, 3 Jul 2021 21:09:37 +0000"  >&lt;p&gt;James Simmons (jsimmons@infradead.org) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/44135&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/44135&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; lustre: switch to use of -&amp;gt;d_init()&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: c704d8547f63613b34c96151f9451f407f0ef664&lt;/p&gt;</comment>
                            <comment id="312525" author="gerrit" created="Sat, 11 Sep 2021 06:19:39 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; merged in patch &lt;a href=&quot;https://review.whamcloud.com/44135/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/44135/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; lustre: switch to use of -&amp;gt;d_init()&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 57569871375dd857a89e5e03524d0eec093fef5d&lt;/p&gt;</comment>
                            <comment id="354092" author="gerrit" created="Thu, 24 Nov 2022 14:32:32 +0000"  >&lt;p&gt;&quot;James Simmons &amp;lt;jsimmons@infradead.org&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/49237&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/49237&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9868&quot; title=&quot;dcache/namei fixes for lustre&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9868&quot;&gt;LU-9868&lt;/a&gt; llite: remove lld_nfs_dentry flag&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 66e550a888a89e333033b8d19e14f4e1b41ca1c9&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="72471">LU-16179</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="53572">LU-11501</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="47106">LU-9735</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="57438">LU-12997</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="58912">LU-13486</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="56278">LU-12511</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzief:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>