<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:17:29 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-15340] client stuck unable to complete eviction with &quot;still on delayed list&quot; messages printed</title>
                <link>https://jira.whamcloud.com/browse/LU-15340</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Sometimes client in eviction gets stuck, unable to complete it. The symptoms include ll_imp_inval thread printing &quot;still on delayed list&quot; for some stuck RPC(s) and an import showing in EVICTED stated in the device list.&lt;/p&gt;

&lt;p&gt;The problem seems to be a deadlock between a request on the delayed list that was signalled to complete, and ptlrpc thread that&apos;s supposed to take care of it being stuck waiting for the request to finish like this:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;PID: 1931 &#160; TASK: ffff8800d60a9110 &#160;CPU: 3 &#160; COMMAND: &quot;ptlrpcd_01_01&quot;
&#160;#0 [ffff8800c8cfb3e8] __schedule at ffffffff817e3e22
&#160;#1 [ffff8800c8cfb450] schedule at ffffffff817e4339
&#160;#2 [ffff8800c8cfb460] osc_extent_wait at ffffffffa086a0cd [osc]
&#160;#3 [ffff8800c8cfb590] osc_cache_wait_range at ffffffffa086c5ad [osc]
&#160;#4 [ffff8800c8cfb688] osc_cache_writeback_range at ffffffffa086d56e [osc]
&#160;#5 [ffff8800c8cfb7d0] osc_io_fsync_start at ffffffffa085b735 [osc]
&#160;#6 [ffff8800c8cfb810] cl_io_start at ffffffffa0325a8d [obdclass]
&#160;#7 [ffff8800c8cfb840] lov_io_call at ffffffffa08ca9f5 [lov]
&#160;#8 [ffff8800c8cfb878] lov_io_start at ffffffffa08cabc6 [lov]
&#160;#9 [ffff8800c8cfb898] cl_io_start at ffffffffa0325a8d [obdclass]
#10 [ffff8800c8cfb8c8] cl_io_loop at ffffffffa032803f [obdclass]
#11 [ffff8800c8cfb900] cl_sync_file_range at ffffffffa0e0b7eb [lustre]
#12 [ffff8800c8cfb958] ll_delete_inode at ffffffffa0e2686c [lustre]
#13 [ffff8800c8cfb970] evict at ffffffff81263a8f
#14 [ffff8800c8cfb998] iput at ffffffff81263ec5
#15 [ffff8800c8cfb9c8] __dentry_kill at ffffffff8125efc8
#16 [ffff8800c8cfb9f0] dput at ffffffff8125f78a
#17 [ffff8800c8cfba20] ll_dirty_page_discard_warn at ffffffffa0e2c205 [lustre]
#18 [ffff8800c8cfba90] vvp_page_completion_write at ffffffffa0e5a214 [lustre]
#19 [ffff8800c8cfbac0] cl_page_completion at ffffffffa03205e8 [obdclass]
#20 [ffff8800c8cfbb18] osc_ap_completion at ffffffffa08609b9 [osc]
#21 [ffff8800c8cfbb60] osc_extent_finish at ffffffffa0867792 [osc]
#22 [ffff8800c8cfbc60] brw_interpret at ffffffffa0849ee9 [osc]
#23 [ffff8800c8cfbcd8] ptlrpc_check_set at ffffffffa05fe4da [ptlrpc]
#24 [ffff8800c8cfbd90] ptlrpcd at ffffffffa062f014 [ptlrpc]
#25 [ffff8800c8cfbea8] kthread at ffffffff810ba114
#26 [ffff8800c8cfbf50] ret_from_fork_nospec_begin at ffffffff817f1e5d &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Of course blocking in ptlrpcd thread is a big no-no exactly for this sort of deadlocks.&lt;/p&gt;

&lt;p&gt;it sounds like we need to kick ll_dirty_page_discard_warn() call from vvp_vmpage_error() into a separate thread to ensure we are not blocking the brw_interpret where this is normally called from. or perhaps just the dput in there?&lt;/p&gt;</description>
                <environment></environment>
        <key id="67506">LU-15340</key>
            <summary>client stuck unable to complete eviction with &quot;still on delayed list&quot; messages printed</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="green">Oleg Drokin</assignee>
                                    <reporter username="green">Oleg Drokin</reporter>
                        <labels>
                    </labels>
                <created>Wed, 8 Dec 2021 04:14:54 +0000</created>
                <updated>Fri, 24 Jun 2022 18:04:10 +0000</updated>
                            <resolved>Mon, 31 Jan 2022 04:30:38 +0000</resolved>
                                    <version>Lustre 2.15.0</version>
                                    <fixVersion>Lustre 2.15.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="320262" author="gerrit" created="Wed, 8 Dec 2021 04:50:08 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/45784&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/45784&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-15340&quot; title=&quot;client stuck unable to complete eviction with &amp;quot;still on delayed list&amp;quot; messages printed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-15340&quot;&gt;&lt;del&gt;LU-15340&lt;/del&gt;&lt;/a&gt; llite: Delay dput in ll_dirty_page_discard_warn&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 6bf95c9aa5506b7a13fe7148743cf64d28c54ea2&lt;/p&gt;</comment>
                            <comment id="323763" author="gerrit" created="Tue, 25 Jan 2022 03:48:59 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/46296&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/46296&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-15340&quot; title=&quot;client stuck unable to complete eviction with &amp;quot;still on delayed list&amp;quot; messages printed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-15340&quot;&gt;&lt;del&gt;LU-15340&lt;/del&gt;&lt;/a&gt; llite: Reuse existing inode for io warning print&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 1bec36589f16038d66051af5be25e8deb94a7098&lt;/p&gt;</comment>
                            <comment id="324502" author="gerrit" created="Mon, 31 Jan 2022 01:24:39 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; merged in patch &lt;a href=&quot;https://review.whamcloud.com/45784/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/45784/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-15340&quot; title=&quot;client stuck unable to complete eviction with &amp;quot;still on delayed list&amp;quot; messages printed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-15340&quot;&gt;&lt;del&gt;LU-15340&lt;/del&gt;&lt;/a&gt; llite: Delay dput in ll_dirty_page_discard_warn&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: a1d75780ba19cfca53cbacf0d38e8d7df540b209&lt;/p&gt;</comment>
                            <comment id="324529" author="pjones" created="Mon, 31 Jan 2022 04:30:38 +0000"  >&lt;p&gt;Landed for 2.15&lt;/p&gt;</comment>
                            <comment id="329137" author="gerrit" created="Mon, 14 Mar 2022 14:49:15 +0000"  >&lt;p&gt;&quot;Etienne AUJAMES &amp;lt;eaujames@ddn.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/46818&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/46818&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-15340&quot; title=&quot;client stuck unable to complete eviction with &amp;quot;still on delayed list&amp;quot; messages printed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-15340&quot;&gt;&lt;del&gt;LU-15340&lt;/del&gt;&lt;/a&gt; llite: Delay dput in ll_dirty_page_discard_warn&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_12&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: dc5d2593e1d85ee641ee6de72ad55437cdff75c2&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                                        </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="66730">LU-15127</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i02bxr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>