<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:26:19 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-16356] high contention on cdt_request_lock causes clients to hang</title>
                <link>https://jira.whamcloud.com/browse/LU-16356</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;If there is above a million entries in cdt_restore_list, it takes a time to go through the all list.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt; #4 [ffff8b71d270f900] memcmp at ffffffff8ab7fe1c
 #5 [ffff8b71d270f908] cdt_restore_handle_find at ffffffffc14e7ee9 [mdt]
 #6 [ffff8b71d270f938] mdt_hsm_restore_is_running at ffffffffc14df0c2 [mdt]
 #7 [ffff8b71d270f968] mdt_getattr_internal at ffffffffc14919a1 [mdt]
 #8 [ffff8b71d270f9e0] mdt_getattr_name_lock at ffffffffc1495a7d [mdt]
 #9 [ffff8b71d270fa90] mdt_intent_getattr at ffffffffc149d5d5 [mdt]
#10 [ffff8b71d270fad0] mdt_intent_opc at ffffffffc14926ba [mdt]
#11 [ffff8b71d270fb30] mdt_intent_policy at ffffffffc149a7f4 [mdt]
#12 [ffff8b71d270fb70] ldlm_lock_enqueue at ffffffffc0ff852a [ptlrpc]
#13 [ffff8b71d270fbf0] ldlm_handle_enqueue0 at ffffffffc1020f97 [ptlrpc]
#14 [ffff8b71d270fc80] tgt_enqueue at ffffffffc10ab0f2 [ptlrpc]
#15 [ffff8b71d270fca0] tgt_request_handle at ffffffffc10aff0a [ptlrpc]
#16 [ffff8b71d270fd30] ptlrpc_server_handle_request at ffffffffc1055a56 [ptlrpc]
#17 [ffff8b71d270fde8] ptlrpc_main at ffffffffc1059b35 [ptlrpc]
#18 [ffff8b71d270fec8] kthread at ffffffff8a8c1f81
#19 [ffff8b71d270ff50] ret_from_fork_nospec_begin at ffffffff8af77c1d   &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Holding cdt_restore_lock it blocks other tasks that also need cdt_restore_lock:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;crash&amp;gt; bt 38174
PID: 38174&#160; TASK: ffff8b722d5fb0c0&#160; CPU: 28&#160; COMMAND: &quot;hsm_cdtr&quot;
 #0 [ffff8b71c31af778] __schedule at ffffffff8af6ab17
 #1 [ffff8b71c31af808] schedule_preempt_disabled at ffffffff8af6bf39
 #2 [ffff8b71c31af818] __mutex_lock_slowpath at ffffffff8af69e87
 #3 [ffff8b71c31af878] mutex_lock at ffffffff8af6926f
 #4 [ffff8b71c31af890] cdt_restore_handle_del at ffffffffc14e8008 [mdt]
 #5 [ffff8b71c31af8c0] mdt_cdt_started_cb at ffffffffc14e8393 [mdt]
 #6 [ffff8b71c31af940] mdt_coordinator_cb at ffffffffc14e8659 [mdt]
 #7 [ffff8b71c31af978] llog_process_thread at ffffffffc0d1a7ff [obdclass]
 #8 [ffff8b71c31afa88] llog_process_or_fork at ffffffffc0d1bae9 [obdclass]
 #9 [ffff8b71c31afaf0] llog_cat_process_cb at ffffffffc0d211ea [obdclass]
#10 [ffff8b71c31afb40] llog_process_thread at ffffffffc0d1a7ff [obdclass]
#11 [ffff8b71c31afc50] llog_process_or_fork at ffffffffc0d1bae9 [obdclass]
#12 [ffff8b71c31afcb8] llog_cat_process_or_fork at ffffffffc0d1d961 [obdclass]
#13 [ffff8b71c31afd30] llog_cat_process at ffffffffc0d1db0e [obdclass]
#14 [ffff8b71c31afd50] cdt_llog_process at ffffffffc14da8be [mdt]
#15 [ffff8b71c31afda0] mdt_coordinator at ffffffffc14e4621 [mdt]
#16 [ffff8b71c31afec8] kthread at ffffffff8a8c1f81
#17 [ffff8b71c31aff50] ret_from_fork_nospec_begin at ffffffff8af77c1d   &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Investigating the case from HPe&apos;s customer I found there was above a million RESTORE entries in&#160;cdt_restore_handle_list:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;crash&amp;gt; list -o cdt_restore_handle.crh_list -H 0xffff8b71c5ede158 | wc -l
1218162  &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;I also tried to analyze the entries but it seems that there are no duplicates in a list, i.e. there are only unique FIDs.&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;&quot;&gt;&lt;/a&gt;&lt;/h4&gt;</description>
                <environment></environment>
        <key id="73448">LU-16356</key>
            <summary>high contention on cdt_request_lock causes clients to hang</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="scherementsev">Sergey Cheremencev</assignee>
                                    <reporter username="scherementsev">Sergey Cheremencev</reporter>
                        <labels>
                            <label>HSM</label>
                    </labels>
                <created>Wed, 30 Nov 2022 11:11:01 +0000</created>
                <updated>Thu, 18 Jan 2024 20:30:43 +0000</updated>
                                                                                <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="354657" author="gerrit" created="Wed, 30 Nov 2022 11:25:02 +0000"  >&lt;p&gt;&quot;Sergey Cheremencev &amp;lt;sergey.cheremencev@hpe.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/49284&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/49284&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16356&quot; title=&quot;high contention on cdt_request_lock causes clients to hang&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16356&quot;&gt;LU-16356&lt;/a&gt; hsm: store crh in rhashtable instead of list&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 22209b985bc3bd62ee078d29ac501202c2ec07c0&lt;/p&gt;</comment>
                            <comment id="356315" author="gerrit" created="Tue, 13 Dec 2022 21:54:31 +0000"  >&lt;p&gt;&quot;Nikitas Angelinas &amp;lt;nikitas.angelinas@hpe.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/49397&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/49397&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16356&quot; title=&quot;high contention on cdt_request_lock causes clients to hang&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16356&quot;&gt;LU-16356&lt;/a&gt; hsm: initialize cdt_restore_hash at MDT startup time&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 48d00fd0c00cdd5be93a225f6e5b794ab0e3df39&lt;/p&gt;</comment>
                            <comment id="374884" author="gerrit" created="Thu, 8 Jun 2023 20:11:38 +0000"  >&lt;p&gt;&quot;Etienne AUJAMES &amp;lt;eaujames@ddn.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/51256&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/51256&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16356&quot; title=&quot;high contention on cdt_request_lock causes clients to hang&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16356&quot;&gt;LU-16356&lt;/a&gt; hsm: for test only prototype&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 6bed4eeb2685125945a0db57115ff662d2135c8f&lt;/p&gt;</comment>
                            <comment id="392293" author="gerrit" created="Wed, 8 Nov 2023 21:59:03 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; merged in patch &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/51256/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/51256/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16356&quot; title=&quot;high contention on cdt_request_lock causes clients to hang&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16356&quot;&gt;LU-16356&lt;/a&gt; hsm: add running ref to the coordinator&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 4512347d6cda68fc8c949db521e00b485258c420&lt;/p&gt;</comment>
                            <comment id="399455" author="gerrit" created="Fri, 12 Jan 2024 12:00:37 +0000"  >&lt;p&gt;&quot;Etienne AUJAMES &amp;lt;eaujames@ddn.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/53661&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/53661&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16356&quot; title=&quot;high contention on cdt_request_lock causes clients to hang&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16356&quot;&gt;LU-16356&lt;/a&gt; hsm: add running ref to the coordinator&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_15&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 5361978eab8d95cd9bbf8e592e4d6ec2fb32f9f4&lt;/p&gt;</comment>
                            <comment id="400274" author="gerrit" created="Thu, 18 Jan 2024 20:30:43 +0000"  >&lt;p&gt;&quot;Etienne AUJAMES &amp;lt;eaujames@ddn.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/53735&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/53735&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-16356&quot; title=&quot;high contention on cdt_request_lock causes clients to hang&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-16356&quot;&gt;LU-16356&lt;/a&gt; hsm: store crh in rhashtable instead of list&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: b2_15&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 47c549c029f3bf63bbe2ec420fd8dc633b8c54da&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="66751">LU-15132</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="72775">LU-16235</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i036uv:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>