<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:19:45 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-15604] sanity-lnet test_226: failed to unload modules</title>
                <link>https://jira.whamcloud.com/browse/LU-15604</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;This issue was created by maloo for Chris Horn &amp;lt;chris.horn@hpe.com&amp;gt;&lt;/p&gt;

&lt;p&gt;This issue relates to the following test suite run: &lt;a href=&quot;https://testing.whamcloud.com/test_sets/b3098e8b-70ca-4980-a955-159ee6537597&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/b3098e8b-70ca-4980-a955-159ee6537597&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;test_226 failed with the following error:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;failed to unload modules
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This is a new test case added by the patch. I don&apos;t see anything obviously wrong, so I&apos;m not sure why it failed in this way. I repeated the test 100 times in my VM environment and I did not see this issue.&lt;/p&gt;





&lt;p&gt;VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV&lt;br/&gt;
sanity-lnet test_226 - failed to unload modules&lt;/p&gt;</description>
                <environment></environment>
        <key id="68896">LU-15604</key>
            <summary>sanity-lnet test_226: failed to unload modules</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="6" iconUrl="https://jira.whamcloud.com/images/icons/statuses/closed.png" description="The issue is considered finished, the resolution is correct. Issues which are closed can be reopened.">Closed</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="2">Won&apos;t Fix</resolution>
                                        <assignee username="hornc">Chris Horn</assignee>
                                    <reporter username="maloo">Maloo</reporter>
                        <labels>
                    </labels>
                <created>Mon, 28 Feb 2022 16:58:20 +0000</created>
                <updated>Thu, 24 Mar 2022 15:49:55 +0000</updated>
                            <resolved>Thu, 24 Mar 2022 15:49:55 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="329914" author="hornc" created="Tue, 22 Mar 2022 21:00:27 +0000"  >&lt;p&gt;I was testing &lt;a href=&quot;https://review.whamcloud.com/#/c/46727/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/46727/&lt;/a&gt; by starting/stopping LNet in a loop while pinging the test node from 3 other peers:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[root@s-lmo-gaz38a ~]# while true; do for i in {1..20}; do lctl ping 172.18.2.6@o2ib10; done; echo sleep 5; sleep 5; done
...

[root@s-lmo-gaz38b ~]# while true; do for i in {1..2}; do lnetctl discover --force 172.18.2.6@o2ib10; done; lnetctl peer del --prim 172.18.2.6@o2ib; echo sleep 5; sleep 5; done
...

cassini-hosta:~ # while true; do for i in {1..20}; do lnetctl discover --force 172.18.2.6@o2ib; done; lnetctl peer del --prim 172.18.2.6@o2ib; echo sleep 5; sleep 5; done
...

cassini-hostb:~ # while true; do /bin/start.sh2 ; lustre_rmmod ; echo sleep 5; sleep 5; done
sleep 5
sleep 5
sleep 5
sleep 5
sleep 5
sleep 5
...
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;After a while, I opened some additional terminals on the test node and ran some lctl and lnetctl commands in a loop:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;cassini-hostb:~ # while true; do lctl list_nids 2&amp;gt;/dev/null; done
...
cassini-hostb:~ # while true; do lnetctl peer show 2&amp;gt;/dev/null; done
...
cassini-hostb:~ # while true; do lnetctl net show 2&amp;gt;/dev/null; done
...
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;At that point, I started to see some rmmod failures like in this ticket:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;sleep 5
sleep 5
rmmod: ERROR: Module libcfs is in use
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;So I think that this test failure is most likely just from some other lctl or lnetctl process that is running at same time as rmmod. We&apos;re executing unload_modules_local via do_rpc_nodes() which is going to invoke &quot;lctl mark&quot; on all the test nodes. This is most likely what is causing the occasional rmmod failure. So it seems that it is not safe to call unload_modules_local() via do_rpc_nodes().&lt;/p&gt;</comment>
                            <comment id="330124" author="hornc" created="Thu, 24 Mar 2022 15:49:55 +0000"  >&lt;p&gt;The tests were modified to call lustre_rmmod via do_nodes() instead of the using unload_modules_locals() via do_rpc_nodes(). With that change, the test now passes consistently. &lt;a href=&quot;https://testing.whamcloud.com/test_sessions/81932feb-6028-40fb-868d-f6a21485811c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sessions/81932feb-6028-40fb-868d-f6a21485811c&lt;/a&gt; 100/100&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i02jlr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>