<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:58:46 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-6270] lustre-rsync-test test 6: &#8220;BUG: soft lockup - CPU#0 stuck for 67s!&#8221;</title>
                <link>https://jira.whamcloud.com/browse/LU-6270</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;lustre-rsync-test test 6 failed with &apos;test failed to respond and timed out&apos;  and the test suite could not progress beyond test 6. Test results are at &lt;a href=&quot;https://testing.hpdd.intel.com/test_sets/4e4a5590-b9b9-11e4-ba8b-5254006e85c2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.hpdd.intel.com/test_sets/4e4a5590-b9b9-11e4-ba8b-5254006e85c2&lt;/a&gt; . &lt;/p&gt;

&lt;p&gt;The client test_log is empty or just missing. The last entries in the suite_stdout are: &lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;07:21:25:== lustre-rsync-test test 6: lustre_rsync large no of hard links == 23:21:22 (1424503282)
07:21:25:CMD: onyx-34vm7 lctl --device lustre-MDT0000 changelog_register -n
07:21:25:lustre-MDT0000: Registered changelog user cl12
07:21:26:CMD: onyx-34vm7 lctl get_param -n mdd.lustre-MDT0000.changelog_users
07:21:58:Lustre filesystem: lustre
07:21:58:MDT device: lustre-MDT0000
07:21:58:Source: /mnt/lustre
07:21:58:Target: /tmp/target
07:21:58:Target: /tmp/target2
07:21:58:Statuslog: /tmp/lustre_rsync.log
07:21:58:Changelog registration: cl12
07:21:58:Starting changelog record: 0
07:21:58:Clear changelog after use: no
07:21:58:Errors: 0
07:21:58:lustre_rsync took 24 seconds
07:21:58:Changelog records consumed: 128
07:21:58:CMD: onyx-34vm7 lctl --device lustre-MDT0000 changelog_deregister cl12
07:21:58:lustre-MDT0000: Deregistered changelog user &apos;cl12&apos;
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;


&lt;p&gt;From the client console, we see the soft lockup error with call trace:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;07:23:16:Lustre: DEBUG MARKER: == lustre-rsync-test test 6: lustre_rsync large no of hard links == 23:21:22 (1424503282)
07:23:16:BUG: soft lockup - CPU#0 stuck for 67s! [ll_sa_26763:26764]
07:23:16:Modules linked in: lustre(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) ext2 sha512_generic sha256_generic nfs fscache nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4 ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs]
07:23:16:CPU 0 
&#8230;
07:23:16:irq 11: nobody cared (try booting with the &quot;irqpoll&quot; option)
07:23:16:Pid: 1, comm: swapper Not tainted 2.6.32-431.29.2.el6.x86_64 #1
07:23:16:Call Trace:
07:23:16: &amp;lt;IRQ&amp;gt;  [&amp;lt;ffffffff810e8d8b&amp;gt;] ? __report_bad_irq+0x2b/0xa0
07:23:16: [&amp;lt;ffffffff810e8f8c&amp;gt;] ? note_interrupt+0x18c/0x1d0
07:23:16: [&amp;lt;ffffffff810e972d&amp;gt;] ? handle_fasteoi_irq+0xcd/0xf0
07:23:16: [&amp;lt;ffffffff8100faf9&amp;gt;] ? handle_irq+0x49/0xa0
07:23:16: [&amp;lt;ffffffff8153251c&amp;gt;] ? do_IRQ+0x6c/0xf0
07:23:16: [&amp;lt;ffffffff8100b9d3&amp;gt;] ? ret_from_intr+0x0/0x11
07:23:16: [&amp;lt;ffffffff8107a5a3&amp;gt;] ? __do_softirq+0x73/0x1e0
07:23:16: [&amp;lt;ffffffff8100c30c&amp;gt;] ? call_softirq+0x1c/0x30
07:23:16: [&amp;lt;ffffffff8100fa75&amp;gt;] ? do_softirq+0x65/0xa0
07:23:16: [&amp;lt;ffffffff8107a4a5&amp;gt;] ? irq_exit+0x85/0x90
07:23:16: [&amp;lt;ffffffff81532525&amp;gt;] ? do_IRQ+0x75/0xf0
07:23:16: [&amp;lt;ffffffff8100b9d3&amp;gt;] ? ret_from_intr+0x0/0x11
07:23:16: &amp;lt;EOI&amp;gt;  [&amp;lt;ffffffff8152b897&amp;gt;] ? _spin_unlock_irqrestore+0x17/0x20
07:23:16: [&amp;lt;ffffffff810e7b83&amp;gt;] ? __setup_irq+0x1b3/0x3c0
07:23:16: [&amp;lt;ffffffff813c4bb0&amp;gt;] ? usb_hcd_irq+0x0/0x90
07:23:16: [&amp;lt;ffffffff810e8553&amp;gt;] ? request_threaded_irq+0x133/0x230
07:23:16: [&amp;lt;ffffffff813c6b8e&amp;gt;] ? usb_add_hcd+0x50e/0x890
07:23:16: [&amp;lt;ffffffff813d73da&amp;gt;] ? usb_hcd_pci_probe+0x16a/0x3e0
07:23:16: [&amp;lt;ffffffff812a5877&amp;gt;] ? local_pci_probe+0x17/0x20
07:23:16: [&amp;lt;ffffffff812a6a61&amp;gt;] ? pci_device_probe+0x101/0x120
07:23:16: [&amp;lt;ffffffff8136e232&amp;gt;] ? driver_sysfs_add+0x62/0x90
07:23:16: [&amp;lt;ffffffff8136e3d0&amp;gt;] ? driver_probe_device+0xa0/0x2a0
07:23:16: [&amp;lt;ffffffff8136e67b&amp;gt;] ? __driver_attach+0xab/0xb0
07:23:16: [&amp;lt;ffffffff8136e5d0&amp;gt;] ? __driver_attach+0x0/0xb0
07:23:16: [&amp;lt;ffffffff8136d984&amp;gt;] ? bus_for_each_dev+0x64/0x90
07:23:16: [&amp;lt;ffffffff8136e16e&amp;gt;] ? driver_attach+0x1e/0x20
07:23:16: [&amp;lt;ffffffff8136d1b8&amp;gt;] ? bus_add_driver+0x1e8/0x2b0
07:23:16: [&amp;lt;ffffffff8136e9c6&amp;gt;] ? driver_register+0x76/0x140
07:23:16: [&amp;lt;ffffffff812a6cc6&amp;gt;] ? __pci_register_driver+0x56/0xd0
07:23:16: [&amp;lt;ffffffff81c62471&amp;gt;] ? uhci_hcd_init+0x0/0xca
07:23:16: [&amp;lt;ffffffff81c62471&amp;gt;] ? uhci_hcd_init+0x0/0xca
07:23:16: [&amp;lt;ffffffff81c624fd&amp;gt;] ? uhci_hcd_init+0x8c/0xca
07:23:16: [&amp;lt;ffffffff81c62442&amp;gt;] ? ohci_hcd_mod_init+0x61/0x90
07:23:16: [&amp;lt;ffffffff8100204c&amp;gt;] ? do_one_initcall+0x3c/0x1d0
07:23:16: [&amp;lt;ffffffff81c268e4&amp;gt;] ? kernel_init+0x29b/0x2f7
07:23:16: [&amp;lt;ffffffff8100c20a&amp;gt;] ? child_rip+0xa/0x20
07:23:16: [&amp;lt;ffffffff81c26649&amp;gt;] ? kernel_init+0x0/0x2f7
07:23:16: [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
07:23:16:handlers:
07:23:16:[&amp;lt;ffffffff813c4bb0&amp;gt;] (usb_hcd_irq+0x0/0x90)
07:23:16:Disabling IRQ #11
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This is probably caused by the client problem, after about an hour, the OST dmesg log has: &lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Lustre: DEBUG MARKER: == lustre-rsync-test test 6: lustre_rsync large no of hard links == 23:21:22 (1424503282)
Lustre: lustre-OST0000: haven&apos;t heard from client 9b13a821-4bea-aec2-907b-0135ad711c6b (at 10.2.4.130@tcp) in 50 seconds. I think it&apos;s dead, and I am evicting it. exp ffff880073e07000, cur 1424503359 expire 1424503329 last 1424503309
Lustre: Skipped 6 previous similar messages
Lustre: lustre-OST0004: haven&apos;t heard from client 9b13a821-4bea-aec2-907b-0135ad711c6b (at 10.2.4.130@tcp) in 55 seconds. I think it&apos;s dead, and I am evicting it. exp ffff880066c78400, cur 1424503364 expire 1424503334 last 1424503309
Lustre: Skipped 3 previous similar messages
LustreError: 9994:0:(ofd_grant.c:163:ofd_grant_sanity_check()) ofd_statfs: tot_granted 2379776 != fo_tot_granted 100167680
SysRq : Show State
  task                        PC stack   pid father
init          S 0000000000000001     0     1      0 0x00000000
 ffff88007e4c5908 0000000000000086 ffff88007e4c58d0 ffff88007e4c58cc
 0000000000000000 ffff88007f823240 ffff880002216880 0000000000000400
 ffff88007e4c3ab8 ffff88007e4c5fd8 000000000000fbc8 ffff88007e4c3ab8
Call Trace:
 [&amp;lt;ffffffff8100bb8e&amp;gt;] ? apic_timer_interrupt+0xe/0x20
 [&amp;lt;ffffffff8152b21d&amp;gt;] schedule_hrtimeout_range+0x13d/0x160
 [&amp;lt;ffffffff8109b346&amp;gt;] ? add_wait_queue+0x46/0x60
 [&amp;lt;ffffffff811a0ca5&amp;gt;] ? __pollwait+0x75/0xf0
 [&amp;lt;ffffffff811a0ca5&amp;gt;] ? __pollwait+0x75/0xf0
 [&amp;lt;ffffffff811a0b39&amp;gt;] poll_schedule_timeout+0x39/0x60
 [&amp;lt;ffffffff811a1bfc&amp;gt;] do_select+0x57c/0x6c0
 [&amp;lt;ffffffff8100bb8e&amp;gt;] ? apic_timer_interrupt+0xe/0x20
 [&amp;lt;ffffffff811a0c30&amp;gt;] ? __pollwait+0x0/0xf0
 [&amp;lt;ffffffff811a0d20&amp;gt;] ? pollwake+0x0/0x60
 [&amp;lt;ffffffff811a0d20&amp;gt;] ? pollwake+0x0/0x60
 [&amp;lt;ffffffff811a0d20&amp;gt;] ? pollwake+0x0/0x60
 [&amp;lt;ffffffff811a0d20&amp;gt;] ? pollwake+0x0/0x60
 [&amp;lt;ffffffff811a0d20&amp;gt;] ? pollwake+0x0/0x60
 [&amp;lt;ffffffff811a4e25&amp;gt;] ? d_lookup+0x35/0x60
 [&amp;lt;ffffffff8152ad0e&amp;gt;] ? mutex_lock+0x1e/0x50
 [&amp;lt;ffffffff81194787&amp;gt;] ? pipe_read+0x2a7/0x4e0
 [&amp;lt;ffffffff811a1eca&amp;gt;] core_sys_select+0x18a/0x2c0
 [&amp;lt;ffffffff81227806&amp;gt;] ? security_task_wait+0x16/0x20
 [&amp;lt;ffffffff8107557d&amp;gt;] ? wait_consider_task+0x9d/0xb20
 [&amp;lt;ffffffff8109b39c&amp;gt;] ? remove_wait_queue+0x3c/0x50
 [&amp;lt;ffffffff8107617f&amp;gt;] ? do_wait+0x17f/0x240
 [&amp;lt;ffffffff8103f9d8&amp;gt;] ? pvclock_clocksource_read+0x58/0xd0
 [&amp;lt;ffffffff811a2257&amp;gt;] sys_select+0x47/0x110
 [&amp;lt;ffffffff81098091&amp;gt;] ? posix_ktime_get_ts+0x11/0x20
 [&amp;lt;ffffffff8100b072&amp;gt;] system_call_fastpath+0x16/0x1b
&#8230;
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment>autotest</environment>
        <key id="28803">LU-6270</key>
            <summary>lustre-rsync-test test 6: &#8220;BUG: soft lockup - CPU#0 stuck for 67s!&#8221;</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="6" iconUrl="https://jira.whamcloud.com/images/icons/statuses/closed.png" description="The issue is considered finished, the resolution is correct. Issues which are closed can be reopened.">Closed</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                    </labels>
                <created>Mon, 23 Feb 2015 17:15:30 +0000</created>
                <updated>Tue, 24 Feb 2015 05:13:47 +0000</updated>
                            <resolved>Tue, 24 Feb 2015 05:13:22 +0000</resolved>
                                    <version>Lustre 2.5.3</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="107745" author="yujian" created="Tue, 24 Feb 2015 05:13:22 +0000"  >&lt;p&gt;This is a duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-4410&quot; title=&quot;sanityn test 40a: BUG: soft lockup - CPU#0 stuck for 67s! [ptlrpcd_0:2892]&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-4410&quot;&gt;&lt;del&gt;LU-4410&lt;/del&gt;&lt;/a&gt; .&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="22553">LU-4410</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzx6v3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>17579</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>