<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:23:16 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-2207] planning upgrade to 2.3 to increase metadata performance using smp affinity.</title>
                <link>https://jira.whamcloud.com/browse/LU-2207</link>
                <project id="10000" key="LU">Lustre</project>
                    <description></description>
                <environment>We are planning to upgrade to the 2.3 lustre version to increase metadata performance expecially using the smp affinity. We have a Intel Nehalem 2 socket, 4 cores each socket as a mds and Intel Sandy Bridge 2 socket, 8 cores each.</environment>
        <key id="16395">LU-2207</key>
            <summary>planning upgrade to 2.3 to increase metadata performance using smp affinity.</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="6">Not a Bug</resolution>
                                        <assignee username="liang">Liang Zhen</assignee>
                                    <reporter username="lustre.support">Supporto Lustre Jnet2000</reporter>
                        <labels>
                    </labels>
                <created>Wed, 17 Oct 2012 20:34:15 +0000</created>
                <updated>Wed, 24 Oct 2012 17:16:30 +0000</updated>
                            <resolved>Wed, 24 Oct 2012 17:16:30 +0000</resolved>
                                    <version>Lustre 2.3.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="46694" author="lustre.support" created="Wed, 17 Oct 2012 20:37:42 +0000"  >&lt;p&gt;Hi,&lt;br/&gt;
could you help us to planning the configuration of the lustre mds and the lustre client in term of Compute Partitions to maximize the metadata performance? Thanks in advance.&lt;/p&gt;</comment>
                            <comment id="46695" author="lustre.support" created="Wed, 17 Oct 2012 20:40:00 +0000"  >&lt;p&gt;Every node has 1 infiniband hca (ib0).&lt;/p&gt;</comment>
                            <comment id="46704" author="pjones" created="Thu, 18 Oct 2012 01:46:25 +0000"  >&lt;p&gt;Hi there&lt;/p&gt;

&lt;p&gt;When is your timeframe for this upgrade? We are still in final release testing for 2.3 and it is not GA yet...&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="46743" author="lustre.support" created="Thu, 18 Oct 2012 13:45:32 +0000"  >&lt;p&gt;Hi Peter,&lt;br/&gt;
we are planning for the end of the year,but the customer need a detailed plan for the next week to approve the project. I yust need more information on the configuration of &quot;smp affinity&quot; for our hardware because the information that I find in the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-56&quot; title=&quot;Finish SMP scalability work&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-56&quot;&gt;&lt;del&gt;LU-56&lt;/del&gt;&lt;/a&gt; is not enough to understand well. Could you help me?&lt;/p&gt;

&lt;p&gt;Thanks in advance.&lt;/p&gt;</comment>
                            <comment id="46754" author="pjones" created="Thu, 18 Oct 2012 16:56:10 +0000"  >&lt;p&gt;ok. I am assigning this ticket to the engineer who worked on this feature. Could you please elaborate with some specifics as to what you need to know? If any action is needed then I would expect this to be covered in the latest version of the Lustre manual.&lt;/p&gt;</comment>
                            <comment id="46757" author="lustre.support" created="Thu, 18 Oct 2012 18:18:44 +0000"  >&lt;p&gt;Our goal is to increase the metadata performance.  We have a Intel Nehalem 2 socket, 4 cores each socket as a mds and Intel Sandy Bridge 2 socket, 8 cores each as client. Our network is a QDR infiniband.&lt;/p&gt;
</comment>
                            <comment id="46773" author="liang" created="Fri, 19 Oct 2012 09:27:35 +0000"  >&lt;p&gt;this is going to be covered by Lustre manual very soon.&lt;br/&gt;
A few questions: &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;is there router in your cluster?&lt;/li&gt;
	&lt;li&gt;how many clients node in the cluster?&lt;/li&gt;
	&lt;li&gt;how many OSS and OSTs in the cluster?&lt;br/&gt;
But I think it&apos;s worth a try with all defaults, of course, there are a few things can be tuned:&lt;/li&gt;
	&lt;li&gt;turne on &quot;Portal Rotor&quot; on MDS, it can be tuned on by &quot;echo on &amp;gt; /proc/sys/lnet/portal_rotor&quot; (please check define or &quot;portal rotor&quot; from here: &lt;a href=&quot;http://jira.whamcloud.com/secure/attachment/11805/opensfs_smp_demonstration.pdf&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;http://jira.whamcloud.com/secure/attachment/11805/opensfs_smp_demonstration.pdf&lt;/a&gt;)&lt;/li&gt;
	&lt;li&gt;By default Lustre will create 2 CPU partitions on 2X4 cores MDS, you can try to set it to 4 (options libcfs cpu_npartitions=4) and choose the better one.&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="46803" author="lustre.support" created="Sat, 20 Oct 2012 03:59:38 +0000"  >&lt;p&gt;I&apos;ve setting up a small lab with 6 clients (Sandy Bridge Intel CPU) and 2 server (Nehalem Intel CPU).&lt;br/&gt;
I&apos;m trying with lustre 2.3RC2.&lt;/p&gt;

&lt;p&gt;I respond to your question for the lab that I have setting up.&lt;/p&gt;

&lt;p&gt;1) no router yust a mellanox switch, we use infiniband connections&lt;br/&gt;
2) 6 clients&lt;br/&gt;
3) 2 server. I have 4 lun for the OSS and 1 lun for the MDS. I plan to use 1 server as MDS and 1 server as OSS. &lt;br/&gt;
4) okay&lt;br/&gt;
5) I see this in the /var/log/message:&lt;/p&gt;

&lt;p&gt;Oct 20 09:25:37 gemini2 kernel: LNet: HW CPU cores: 8, npartitions: 2&lt;br/&gt;
Oct 20 09:25:37 gemini2 kernel: alg: No test for crc32 (crc32-table)&lt;br/&gt;
Oct 20 09:25:37 gemini2 kernel: alg: No test for adler32 (adler32-zlib)&lt;br/&gt;
Oct 20 09:25:41 gemini2 kernel: padlock: VIA PadLock Hash Engine not detected.&lt;br/&gt;
Oct 20 09:25:41 gemini2 modprobe: FATAL: Error inserting padlock_sha (/lib/modules/2.6.32-279.5.1.el6_lustre.g8932d49.x86_64/kernel/drivers/crypto/padlock-sha.ko): No such device&lt;br/&gt;
Oct 20 09:25:50 gemini2 kernel: Lustre: Lustre: Build Version: 2.3.0-RC2--PRISTINE-2.6.32-279.5.1.el6_lustre.g8932d49.x86_64&lt;br/&gt;
Oct 20 09:25:50 gemini2 kernel: LNet: Added LNI 192.168.3.2@o2ib &lt;span class=&quot;error&quot;&gt;&amp;#91;8/256/0/180&amp;#93;&lt;/span&gt;&lt;/p&gt;


&lt;p&gt;gemini1 and gemini2 are the servers&lt;/p&gt;

&lt;p&gt;and this on the clients:&lt;/p&gt;

&lt;p&gt;Oct 20 09:57:53 virgo1 kernel: LNet: HW CPU cores: 16, npartitions: 4&lt;br/&gt;
Oct 20 09:57:53 virgo1 kernel: alg: No test for crc32 (crc32-table)&lt;br/&gt;
Oct 20 09:57:53 virgo1 kernel: alg: No test for adler32 (adler32-zlib)&lt;br/&gt;
Oct 20 09:57:53 virgo1 kernel: alg: No test for crc32 (crc32-pclmul)&lt;br/&gt;
Oct 20 09:57:57 virgo1 kernel: padlock: VIA PadLock Hash Engine not detected.&lt;br/&gt;
Oct 20 09:57:57 virgo1 modprobe: FATAL: Error inserting padlock_sha (/lib/modules/2.6.32-279.5.1.el6.x86_64/kernel/drivers/crypto/padlock-sha.ko): No such device&lt;br/&gt;
Oct 20 09:58:06 virgo1 kernel: Lustre: Lustre: Build Version: 2.3.0-RC2--PRISTINE-2.6.32-279.5.1.el6.x86_64&lt;br/&gt;
Oct 20 09:58:06 virgo1 kernel: LNet: Added LNI 192.168.3.14@o2ib &lt;span class=&quot;error&quot;&gt;&amp;#91;8/256/0/180&amp;#93;&lt;/span&gt;&lt;/p&gt;
</comment>
                            <comment id="46821" author="liang" created="Sun, 21 Oct 2012 02:35:33 +0000"  >&lt;p&gt;yes, these should be fine, there are two concerns:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;I&apos;m not sure 6 clients can generate enough load on MDS, because SMP feature is supposed to improve aggregation performance for server side, not for each single client&lt;/li&gt;
	&lt;li&gt;1 OSS is probably not very enough, because load on OSS could be high, especially if file stripecount &amp;gt; 1.&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;But it&apos;s worth a try to see what you can get.&lt;br/&gt;
Another suggestion is: please use big journal size, for example: 8G external journal for MDS(it could be better to put journal on a fast device like SSD), or at least 2G internal journal for both MDS and OSS.&lt;/p&gt;</comment>
                            <comment id="46823" author="lustre.support" created="Sun, 21 Oct 2012 10:58:49 +0000"  >&lt;p&gt;Hi,&lt;br/&gt;
I have a 100GB SSD on the MDS. Thanks.&lt;/p&gt;</comment>
                            <comment id="46861" author="lustre.support" created="Tue, 23 Oct 2012 11:13:59 +0000"  >&lt;p&gt;Hi,&lt;br/&gt;
I try to test without the smp affinity. To disable this feature is sufficient to put in the modprobe.conf &quot;options libcfs cpu_npartitions=0&quot; ?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;</comment>
                            <comment id="46864" author="liang" created="Tue, 23 Oct 2012 17:09:32 +0000"  >&lt;p&gt;well, we can&apos;t totally disable SMP feature on 2.3, because there are many locking &amp;amp; threading improvements even CPU partition is disabled. So the only way to test w/o SMP feature is running 2.2 or 2.1, but still, there will be some &quot;noise&quot; because there are many other changes in 2.3.&lt;/p&gt;
</comment>
                            <comment id="46877" author="lustre.support" created="Wed, 24 Oct 2012 17:09:25 +0000"  >&lt;p&gt;Thank you. Could you close this issue?&lt;/p&gt;</comment>
                            <comment id="46879" author="pjones" created="Wed, 24 Oct 2012 17:16:30 +0000"  >&lt;p&gt;ok thanks!&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvalj:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>5252</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>