<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:34:22 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-17308] makes &quot;lctl pool_*&quot; more reliable for big configurations</title>
                <link>https://jira.whamcloud.com/browse/LU-17308</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Recently, at the CEA, we hit an issue to re-create pools after a --writeconf on a big configuration (a lot of targets and pools).&lt;/p&gt;

&lt;p&gt;Errors were returned when adding too quickly OST in the pool (using separate commands). The workaround is to add delays between each command.&lt;/p&gt;

&lt;p&gt;This was hit on a standalone MGS with a mounted client (I am not 100% sure) with a 2.12.9.&lt;br/&gt;
Since 2.12, there are several patches that could help:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17182&quot; title=&quot;lctl pool_add is slow when using individual OST&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17182&quot;&gt;&lt;del&gt;LU-17182&lt;/del&gt;&lt;/a&gt; utils: pool_add send OSTs in one batch&lt;/li&gt;
	&lt;li&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-15706&quot; title=&quot;&amp;quot;lctl pool_destroy&amp;quot; can&amp;#39;t work correctly due to &amp;quot;SKIP&amp;quot; records&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-15706&quot;&gt;&lt;del&gt;LU-15706&lt;/del&gt;&lt;/a&gt; llog: deal with &quot;SKIP&quot; pool llog records correctly&lt;/li&gt;
	&lt;li&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-14516&quot; title=&quot;make mgc&amp;#39;s wait-before-reprocess configurable&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-14516&quot;&gt;&lt;del&gt;LU-14516&lt;/del&gt;&lt;/a&gt; mgc: configurable wait-to-reprocess time&lt;/li&gt;
	&lt;li&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-13686&quot; title=&quot;lctl pool_add returns error randomly&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-13686&quot;&gt;&lt;del&gt;LU-13686&lt;/del&gt;&lt;/a&gt; utils: pool_add/remove error code fix&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;But I found some issues when I tried to understand the &quot;lctl pool_*&quot; command:&lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;&lt;b&gt;with a client&lt;/b&gt; (MDT and MGT share the same node), the sanity check before touching the MGS configuration is done in userspace by checking the lov client pool parameters. But nothing guarantees those parameters are sync with the MGS. Only the MGS configuration should be trusted, otherwise this could lead to inconsistencies (e.g: adding an OST to a non-existing pool). I think those kinds of behavior is more likely to be hit when executing several commands in a row (clients have to cancel their config lock and re-read their configuration for each command).&lt;/li&gt;
	&lt;li&gt;&lt;b&gt;on a separate MGS&lt;/b&gt; (without a client mounted), the MGS configuration is checked in userspace. But there are a lot of overheads. e.g: to add an OST, the MGS client configuration (fsname-client) is read 5 times (sanity check x3 + kernel x1 + check result x1). So when the configuration is big, this take time. And this use case is not documented.&lt;/li&gt;
	&lt;li&gt;&quot;lctl pool_add/pool_remove&quot; do not check the ioctl return code (kernel).&lt;/li&gt;
	&lt;li&gt;check_pool_cmd_result() does not re-compute the client wait delay with mgc_requeue_timeout_min parameter.&lt;/li&gt;
&lt;/ol&gt;
</description>
                <environment></environment>
        <key id="79117">LU-17308</key>
            <summary>makes &quot;lctl pool_*&quot; more reliable for big configurations</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="eaujames">Etienne Aujames</assignee>
                                    <reporter username="eaujames">Etienne Aujames</reporter>
                        <labels>
                    </labels>
                <created>Tue, 21 Nov 2023 18:31:30 +0000</created>
                <updated>Sun, 4 Feb 2024 14:37:27 +0000</updated>
                            <resolved>Sun, 4 Feb 2024 14:37:27 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                            <comments>
                            <comment id="393788" author="gerrit" created="Tue, 21 Nov 2023 21:19:30 +0000"  >&lt;p&gt;&quot;Etienne AUJAMES &amp;lt;eaujames@ddn.com&amp;gt;&quot; uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/53202&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/53202&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17308&quot; title=&quot;makes &amp;quot;lctl pool_*&amp;quot; more reliable for big configurations&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17308&quot;&gt;&lt;del&gt;LU-17308&lt;/del&gt;&lt;/a&gt; mgs: move pool_cmd check to the kernel&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 641a8a68655f57205f12b5f1731efd3b7f0825f0&lt;/p&gt;</comment>
                            <comment id="402549" author="gerrit" created="Sun, 4 Feb 2024 08:28:42 +0000"  >&lt;p&gt;&quot;Oleg Drokin &amp;lt;green@whamcloud.com&amp;gt;&quot; merged in patch &lt;a href=&quot;https://review.whamcloud.com/c/fs/lustre-release/+/53202/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/c/fs/lustre-release/+/53202/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-17308&quot; title=&quot;makes &amp;quot;lctl pool_*&amp;quot; more reliable for big configurations&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-17308&quot;&gt;&lt;del&gt;LU-17308&lt;/del&gt;&lt;/a&gt; mgs: move pool_cmd check to the kernel&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: ce824977a212d243e15cf07e52a91984841f9b17&lt;/p&gt;</comment>
                            <comment id="402581" author="pjones" created="Sun, 4 Feb 2024 14:37:27 +0000"  >&lt;p&gt;Merged for 2.16&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="78720">LU-17250</issuekey>
        </issuelink>
            <issuelink>
            <issuekey id="42646">LU-8970</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i042f3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>