<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:23:59 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-16098] LustreError: 6866:0:(osp_sync.c:644:osp_sync_send_new_rpc()) ASSERTION( atomic_read(&amp;d-&gt;opd_sync_rpcs_in_flight) &lt;= d-&gt;opd_sync_max_rpcs_in_flight ) failed</title>
                <link>https://jira.whamcloud.com/browse/LU-16098</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;server got crash in the following test case. &lt;br/&gt;
idea is that create, read and delete files by mdtest. After finished mdtest, increase osp.&amp;#42;.max_rpcs_in_flight and osp.&amp;#42;.max_rpcs_in_progress to speedup background object deletion process, then changed them back to default value and repeat mdtest.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;for i in `seq 10`; do
	salloc -p 40n -N 40 -n 640 --ntasks-per-node=16 /usr/mpi/gcc/openmpi-4.1.4rc1/bin/mpirun /work/tools/bin/mdtest -n 50000 
-t -P -G=-1573035764 -d /exafs/home/sihara/mdt0@/exafs/home/sihara/mdt1@/exafs/home/sihara/mdt2@/exafs/home/sihara/mdt3 -x /exafs
/home/sihara/stonewall -C -Y -E -u -F -i 1 -r -v

	clush -w root@ai400x2-1-vm[1-4] lctl set_param osp.*.max_rpcs_in_flight=128 osp.*.max_rpcs_in_progress=32768 
	sleep 120
	clush -w root@ai400x2-1-vm[1-4] lctl set_param osp.*.max_rpcs_in_flight=8 osp.*.max_rpcs_in_progress=4096 
done
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Here is what server got crash.&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[40284.439365] LustreError: 6866:0:(osp_sync.c:644:osp_sync_send_new_rpc()) ASSERTION( atomic_read(&amp;amp;d-&amp;gt;opd_sync_rpcs_in_flight) &amp;lt;= d-&amp;gt;opd_sync_max_rpcs_in_flight ) failed: 
[40284.444545] LustreError: 6866:0:(osp_sync.c:644:osp_sync_send_new_rpc()) LBUG
[40284.446786] Pid: 6866, comm: osp-syn-4-0 3.10.0-1062.18.1.el7_lustre.ddn12.x86_64 #1 SMP Wed Dec 23 06:55:33 PST 2020
[40284.446787] Call Trace:
[40284.446808] [&amp;lt;0&amp;gt;] libcfs_call_trace+0x90/0xf0 [libcfs]
[40284.446813] [&amp;lt;0&amp;gt;] lbug_with_loc+0x4c/0xa0 [libcfs]
[40284.446821] [&amp;lt;0&amp;gt;] osp_sync_send_new_rpc+0xed/0xf0 [osp]
[40284.446827] [&amp;lt;0&amp;gt;] osp_sync_process_record+0x3e9/0x1040 [osp]
[40284.446833] [&amp;lt;0&amp;gt;] osp_sync_process_queues+0x564/0xde0 [osp]
[40284.446863] [&amp;lt;0&amp;gt;] llog_process_thread+0xa2a/0x1b20 [obdclass]
[40284.446880] [&amp;lt;0&amp;gt;] llog_process_or_fork+0xd9/0x560 [obdclass]
[40284.446908] [&amp;lt;0&amp;gt;] llog_cat_process_cb+0x2c1/0x2d0 [obdclass]
[40284.446925] [&amp;lt;0&amp;gt;] llog_process_thread+0xa2a/0x1b20 [obdclass]
[40284.446941] [&amp;lt;0&amp;gt;] llog_process_or_fork+0xd9/0x560 [obdclass]
[40284.446958] [&amp;lt;0&amp;gt;] llog_cat_process_or_fork+0x201/0x3a0 [obdclass]
[40284.446975] [&amp;lt;0&amp;gt;] llog_cat_process+0x2e/0x30 [obdclass]
[40284.446981] [&amp;lt;0&amp;gt;] osp_sync_thread+0x19e/0xc30 [osp]
[40284.446994] [&amp;lt;0&amp;gt;] kthread+0xd1/0xe0
[40284.446998] [&amp;lt;0&amp;gt;] ret_from_fork_nospec_begin+0x7/0x21
[40284.447016] [&amp;lt;0&amp;gt;] 0xfffffffffffffffe
[40284.447027] Kernel panic - not syncing: LBUG
[40284.448636] CPU: 17 PID: 6866 Comm: osp-syn-4-0 Kdump: loaded Tainted: G           OE  ------------ T 3.10.0-1062.18.1.el7_lustre.ddn12.x86_64 #1
[40284.452338] Hardware name: DDN SFA400NVX2E, BIOS rel-1.14.0-0-g155821a1990b-prebuilt.qemu.org 04/01/2014
[40284.454559] Call Trace:
[40284.455935]  [&amp;lt;ffffffff95f7b416&amp;gt;] dump_stack+0x19/0x1b
[40284.457623]  [&amp;lt;ffffffff95f74a0b&amp;gt;] panic+0xe8/0x21f
[40284.459248]  [&amp;lt;ffffffffc095658b&amp;gt;] lbug_with_loc+0x9b/0xa0 [libcfs]
[40284.461041]  [&amp;lt;ffffffffc17392ed&amp;gt;] osp_sync_send_new_rpc+0xed/0xf0 [osp]
[40284.462872]  [&amp;lt;ffffffffc173e309&amp;gt;] osp_sync_process_record+0x3e9/0x1040 [osp]
[40284.464779]  [&amp;lt;ffffffffc0da0360&amp;gt;] ? lustre_swab_niobuf_remote+0x30/0x30 [ptlrpc]
[40284.466678]  [&amp;lt;ffffffffc173f4c4&amp;gt;] osp_sync_process_queues+0x564/0xde0 [osp]
[40284.468511]  [&amp;lt;ffffffff958c7410&amp;gt;] ? wake_up_atomic_t+0x30/0x30
[40284.470211]  [&amp;lt;ffffffffc0a50efa&amp;gt;] llog_process_thread+0xa2a/0x1b20 [obdclass]
[40284.472066]  [&amp;lt;ffffffffc0a56a58&amp;gt;] ? llog_cat_id2handle+0x3b8/0x670 [obdclass]
[40284.473894]  [&amp;lt;ffffffffc173ef60&amp;gt;] ? osp_sync_process_record+0x1040/0x1040 [osp]
[40284.475757]  [&amp;lt;ffffffffc0a520c9&amp;gt;] llog_process_or_fork+0xd9/0x560 [obdclass]
[40284.477581]  [&amp;lt;ffffffffc0a56e31&amp;gt;] ? llog_cat_process_common+0x121/0x470 [obdclass]
[40284.479457]  [&amp;lt;ffffffffc0a580d1&amp;gt;] llog_cat_process_cb+0x2c1/0x2d0 [obdclass]
[40284.481249]  [&amp;lt;ffffffffc0a50efa&amp;gt;] llog_process_thread+0xa2a/0x1b20 [obdclass]
[40284.483040]  [&amp;lt;ffffffff9596eafd&amp;gt;] ? tracing_record_cmdline+0x1d/0x120
[40284.484755]  [&amp;lt;ffffffffc0a57e10&amp;gt;] ? llog_cat_cancel_records+0x1d0/0x1d0 [obdclass]
[40284.486594]  [&amp;lt;ffffffffc0a520c9&amp;gt;] llog_process_or_fork+0xd9/0x560 [obdclass]
[40284.488336]  [&amp;lt;ffffffff958d7a0f&amp;gt;] ? ttwu_do_activate+0x6f/0x80
[40284.489939]  [&amp;lt;ffffffffc0a57e10&amp;gt;] ? llog_cat_cancel_records+0x1d0/0x1d0 [obdclass]
[40284.491736]  [&amp;lt;ffffffffc0a54341&amp;gt;] llog_cat_process_or_fork+0x201/0x3a0 [obdclass]
[40284.493493]  [&amp;lt;ffffffff958db612&amp;gt;] ? default_wake_function+0x12/0x20
[40284.495084]  [&amp;lt;ffffffff958d38c2&amp;gt;] ? __wake_up_common+0x82/0x120
[40284.496625]  [&amp;lt;ffffffffc173ef60&amp;gt;] ? osp_sync_process_record+0x1040/0x1040 [osp]
[40284.498322]  [&amp;lt;ffffffffc0a5450e&amp;gt;] llog_cat_process+0x2e/0x30 [obdclass]
[40284.499918]  [&amp;lt;ffffffffc173b90e&amp;gt;] osp_sync_thread+0x19e/0xc30 [osp]
[40284.501446]  [&amp;lt;ffffffff95f80e02&amp;gt;] ? __schedule+0x402/0x840
[40284.502884]  [&amp;lt;ffffffffc173b770&amp;gt;] ? osp_sync_process_committed+0xd70/0xd70 [osp]
[40284.504539]  [&amp;lt;ffffffff958c6321&amp;gt;] kthread+0xd1/0xe0
[40284.505870]  [&amp;lt;ffffffff958c6250&amp;gt;] ? insert_kthread_work+0x40/0x40
[40284.507344]  [&amp;lt;ffffffff95f8ed1d&amp;gt;] ret_from_fork_nospec_begin+0x7/0x21
[40284.508850]  [&amp;lt;ffffffff958c6250&amp;gt;] ? insert_kthread_work+0x40/0x40
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment></environment>
        <key id="71891">LU-16098</key>
            <summary>LustreError: 6866:0:(osp_sync.c:644:osp_sync_send_new_rpc()) ASSERTION( atomic_read(&amp;d-&gt;opd_sync_rpcs_in_flight) &lt;= d-&gt;opd_sync_max_rpcs_in_flight ) failed</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="sihara">Shuichi Ihara</reporter>
                        <labels>
                    </labels>
                <created>Tue, 16 Aug 2022 22:23:01 +0000</created>
                <updated>Thu, 23 Mar 2023 17:24:19 +0000</updated>
                                            <version>Lustre 2.15.2</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i02xbb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>