<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:11:26 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-893] system hang when running recovery-mds-scale FLAVOR=OSS</title>
                <link>https://jira.whamcloud.com/browse/LU-893</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Running recovery-mds-scale FLAVOR=OSS with quota enables and HARD failure mode, console log shows one of the OSS&apos;s network is up but after a while it cannot be accessed. After reboot the node, it&apos;s back to use.&lt;/p&gt;

&lt;p&gt;==== Checking the clients loads AFTER  failover &amp;#8211; failure NOT OK&lt;br/&gt;
ost6 has failed over 1 times, and counting...&lt;br/&gt;
sleeping 421 seconds ... &lt;br/&gt;
==== Checking the clients loads BEFORE failover &amp;#8211; failure NOT OK     ELAPSED=179 DURATION=86400 PERIOD=600&lt;br/&gt;
Wait ost4 recovery complete before doing next failover ....&lt;br/&gt;
affected facets: ost1,ost2,ost3,ost4,ost5,ost6&lt;br/&gt;
client-12: *.lustre-OST0000.recovery_status status: INACTIVE&lt;br/&gt;
client-12: *.lustre-OST0001.recovery_status status: COMPLETE&lt;br/&gt;
client-12: *.lustre-OST0002.recovery_status status: INACTIVE&lt;br/&gt;
client-12: *.lustre-OST0003.recovery_status status: COMPLETE&lt;br/&gt;
client-12: *.lustre-OST0004.recovery_status status: INACTIVE&lt;br/&gt;
client-12: *.lustre-OST0005.recovery_status status: COMPLETE&lt;br/&gt;
Checking clients are in FULL state before doing next failover&lt;br/&gt;
client-13: osc.lustre-OST0000-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-13: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-13: osc.lustre-OST0001-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-18: osc.lustre-OST0000-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-13: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-18: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-13: osc.lustre-OST0002-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-18: osc.lustre-OST0001-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-13: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-13: osc.lustre-OST0003-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-18: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-13: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-17: osc.lustre-OST0000-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-13: osc.lustre-OST0004-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-18: osc.lustre-OST0002-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-13: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-17: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-18: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-13: osc.lustre-OST0005-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-17: osc.lustre-OST0001-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-18: osc.lustre-OST0003-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-13: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-17: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-18: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-17: osc.lustre-OST0002-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-18: osc.lustre-OST0004-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-17: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-18: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-17: osc.lustre-OST0003-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-18: osc.lustre-OST0005-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-17: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-18: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-17: osc.lustre-OST0004-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-17: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
client-17: osc.lustre-OST0005-osc-&lt;span class=&quot;error&quot;&gt;&amp;#91;^M&amp;#93;&lt;/span&gt;*.ost_server_uuid in FULL state after 0 sec&lt;br/&gt;
client-17: cannot run remote command on client-13,client-17,client-18 with &lt;br/&gt;
Starting failover on ost4&lt;br/&gt;
Failing ost4 on node client-12&lt;br/&gt;
+ pm -h powerman --off client-12&lt;br/&gt;
Command completed successfully&lt;br/&gt;
affected facets: ost1,ost2,ost3,ost4,ost5,ost6&lt;br/&gt;
+ pm -h powerman --on client-12&lt;br/&gt;
Command completed successfully&lt;br/&gt;
Failover ost1 to fat-amd-2&lt;br/&gt;
Failover ost2 to fat-amd-2&lt;br/&gt;
Failover ost3 to fat-amd-2&lt;br/&gt;
Failover ost4 to fat-amd-2&lt;br/&gt;
Failover ost5 to fat-amd-2&lt;br/&gt;
Failover ost6 to fat-amd-2&lt;br/&gt;
15:04:41 (1322867081) waiting for fat-amd-2 network 900 secs ...&lt;br/&gt;
15:04:41 (1322867081) network interface is UP&lt;br/&gt;
Starting ost1:   /dev/disk/by-id/scsi-1IET_00020001 /mnt/ost1&lt;br/&gt;
fat-amd-2: debug=0xb3f0405&lt;br/&gt;
fat-amd-2: subsystem_debug=0xffb7efff&lt;br/&gt;
fat-amd-2: debug_mb=48&lt;br/&gt;
Started lustre-OST0000&lt;br/&gt;
Starting ost2:   /dev/disk/by-id/scsi-1IET_00030001 /mnt/ost2&lt;br/&gt;
-------------------------------------------------------------------&lt;/p&gt;

&lt;p&gt;PING fat-amd-2.lab.whamcloud.com (10.10.4.133) 56(84) bytes of data.&lt;br/&gt;
From brent.lab.whamcloud.com (10.10.0.1) icmp_seq=1 Destination Host Unreachable&lt;/p&gt;</description>
                <environment>lustre-master build #353 RHEL6-x86_64 for both server and client</environment>
        <key id="12589">LU-893</key>
            <summary>system hang when running recovery-mds-scale FLAVOR=OSS</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="5">Cannot Reproduce</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="sarah">Sarah Liu</reporter>
                        <labels>
                    </labels>
                <created>Fri, 2 Dec 2011 19:50:42 +0000</created>
                <updated>Thu, 3 Oct 2019 17:41:18 +0000</updated>
                            <resolved>Mon, 29 May 2017 02:53:11 +0000</resolved>
                                    <version>Lustre 2.2.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>1</watches>
                                                                            <comments>
                            <comment id="25344" author="green" created="Tue, 3 Jan 2012 08:23:35 +0000"  >&lt;p&gt;is it possible to check the console of this node to see what happened to the network?&lt;/p&gt;

&lt;p&gt;with no maloo report also impossible to see console logs I guess.&lt;/p&gt;</comment>
                            <comment id="25356" author="sarah" created="Tue, 3 Jan 2012 13:30:36 +0000"  >&lt;p&gt;I will try to reproduce this bug to see if I can get more information, will keep you updated.&lt;/p&gt;</comment>
                            <comment id="25842" author="sarah" created="Wed, 4 Jan 2012 22:13:34 +0000"  >&lt;p&gt;I reran this test on &lt;a href=&quot;https://newbuild.whamcloud.com/job/lustre-master/376/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://newbuild.whamcloud.com/job/lustre-master/376/&lt;/a&gt; RHEL6. &lt;br/&gt;
1. when I use IB, I got this error on the one of the OSS&lt;/p&gt;

&lt;p&gt;Loading ib_core.ko module&lt;br/&gt;
usb 5-3: New USB device found, idVendor=0557, idProduct=2221&lt;br/&gt;
usb 5-3: New USB device strings: Mfr=1, Product=2, SerialNumber=0&lt;br/&gt;
usb 5-3: Product: Hermon USB hidmouse Device&lt;br/&gt;
usb 5-3: Manufacturer: Winbond Electronics Corp&lt;br/&gt;
usb 5-3: configuration #1 chosen from 1 choice&lt;br/&gt;
Loading mlx4_core.ko module&lt;br/&gt;
input: Winbond Electronics Corp Hermon USB hidmouse Device as /devices/pci0000:00/0000:00:13.0/usb5/5-3/5-3:1.0/input/input3&lt;br/&gt;
generic-usb 0003:0557:2221.0001: input,hidraw0: USB HID v1.00 Mouse &lt;span class=&quot;error&quot;&gt;&amp;#91;Winbond Electronics Corp Hermon USB hidmouse Device&amp;#93;&lt;/span&gt; on usb-0000:00:13.0-3/input0&lt;br/&gt;
mlx4_core: Mellanox ConnectX core driver v0.01 (May 1, 2007)&lt;br/&gt;
mlx4_core: Initializing 0000:04:00.0&lt;br/&gt;
mlx4_core 0000:04:00.0: PCI INT A -&amp;gt; GSI 18 (level, low) -&amp;gt; IRQ 18&lt;br/&gt;
input: Winbond Electronics Corp Hermon USB hidmouse Device as /devices/pci0000:00/0000:00:13.0/usb5/5-3/5-3:1.1/input/input4&lt;br/&gt;
generic-usb 0003:0557:2221.0002: input,hidraw1: USB HID v1.00 Keyboard &lt;span class=&quot;error&quot;&gt;&amp;#91;Winbond Electronics Corp Hermon USB hidmouse Device&amp;#93;&lt;/span&gt; on usb-0000:00:13.0-3/input1&lt;br/&gt;
work_for_cpu invoked oom-killer: gfp_mask=0x80d0, order=0, oom_adj=0&lt;br/&gt;
work_for_cpu cpuset=/ mems_allowed=0&lt;br/&gt;
Pid: 116, comm: work_for_cpu Not tainted 2.6.32-131.17.1.el6_lustre.g2e85b73.x86_64 #1&lt;br/&gt;
Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810c00f1&amp;gt;&amp;#93;&lt;/span&gt; ? cpuset_print_task_mems_allowed+0x91/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff811102bb&amp;gt;&amp;#93;&lt;/span&gt; ? oom_kill_process+0xcb/0x2e0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81110880&amp;gt;&amp;#93;&lt;/span&gt; ? select_bad_process+0xd0/0x110&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81110918&amp;gt;&amp;#93;&lt;/span&gt; ? __out_of_memory+0x58/0xc0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81110b19&amp;gt;&amp;#93;&lt;/span&gt; ? out_of_memory+0x199/0x210&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81120262&amp;gt;&amp;#93;&lt;/span&gt; ? __alloc_pages_nodemask+0x812/0x8b0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81010e86&amp;gt;&amp;#93;&lt;/span&gt; ? dma_generic_alloc_coherent+0xa6/0x160&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00ace99&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_create_eq+0x139/0x6b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00ad5f9&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init_eq_table+0x1e9/0x560 &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00b24d0&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_setup_hca+0xa0/0x5c0 &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00b3045&amp;gt;&amp;#93;&lt;/span&gt; ? __mlx4_init_one+0x2f5/0x880 &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81088ce0&amp;gt;&amp;#93;&lt;/span&gt; ? do_work_for_cpu+0x0/0x30&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00b815f&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init_one+0x42/0x47 &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81281087&amp;gt;&amp;#93;&lt;/span&gt; ? local_pci_probe+0x17/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81088cf8&amp;gt;&amp;#93;&lt;/span&gt; ? do_work_for_cpu+0x18/0x30&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8108de16&amp;gt;&amp;#93;&lt;/span&gt; ? kthread+0x96/0xa0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810886d0&amp;gt;&amp;#93;&lt;/span&gt; ? worker_thread+0x0/0x2a0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c1ca&amp;gt;&amp;#93;&lt;/span&gt; ? child_rip+0xa/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8108dd80&amp;gt;&amp;#93;&lt;/span&gt; ? kthread+0x0/0xa0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c1c0&amp;gt;&amp;#93;&lt;/span&gt; ? child_rip+0x0/0x20&lt;br/&gt;
Mem-Info:&lt;br/&gt;
Node 0 DMA per-cpu:&lt;br/&gt;
CPU    0: hi:    0, btch:   1 usd:   0&lt;br/&gt;
Node 0 DMA32 per-cpu:&lt;br/&gt;
CPU    0: hi:   42, btch:   7 usd:  18&lt;br/&gt;
active_anon:24 inactive_anon:32 isolated_anon:0&lt;br/&gt;
 active_&lt;a href=&quot;file:288&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;file:288&lt;/a&gt; inactive_&lt;a href=&quot;file:0&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;file:0&lt;/a&gt; isolated_&lt;a href=&quot;file:0&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;file:0&lt;/a&gt;&lt;br/&gt;
 unevictable:2985 dirty:0 writeback:0 unstable:0&lt;br/&gt;
 free:403 slab_reclaimable:991 slab_unreclaimable:4105&lt;br/&gt;
 mapped:50 shmem:0 pagetables:9 bounce:0&lt;br/&gt;
Node 0 DMA free:188kB min:0kB low:0kB high:0kB active_anon:0kB inactive_anon:0kB active_&lt;a href=&quot;file:0kB&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;file:0kB&lt;/a&gt; inactive_&lt;a href=&quot;file:0kB&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;file:0kB&lt;/a&gt; unevictable:0kB isolated(anon):0kB isolated(file):0kB present:312kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes&lt;br/&gt;
lowmem_reserve[]: 0 126 126 126&lt;br/&gt;
Node 0 DMA32 free:1424kB min:1436kB low:1792kB high:2152kB active_anon:96kB inactive_anon:128kB active_&lt;a href=&quot;file:1152kB&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;file:1152kB&lt;/a&gt; inactive_&lt;a href=&quot;file:0kB&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;file:0kB&lt;/a&gt; unevictable:11940kB isolated(anon):0kB isolated(file):0kB present:129504kB mlocked:0kB dirty:0kB writeback:0kB mapped:200kB shmem:0kB slab_reclaimable:3964kB slab_unreclaimable:16420kB kernel_stack:304kB pagetables:36kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:3019 all_unreclaimable? no&lt;br/&gt;
lowmem_reserve[]: 0 0 0 0&lt;br/&gt;
Node 0 DMA: 1*4kB 1*8kB 1*16kB 1*32kB 0*64kB 1*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 188kB&lt;br/&gt;
Node 0 DMA32: 0*4kB 0*8kB 1*16kB 0*32kB 0*64kB 1*128kB 1*256kB 0*512kB 1*1024kB 0*2048kB 0*4096kB = 1424kB&lt;br/&gt;
3275 total pagecache pages&lt;br/&gt;
0 pages in swap cache&lt;br/&gt;
Swap cache stats: add 0, delete 0, find 0/0&lt;br/&gt;
Free swap  = 0kB&lt;br/&gt;
Total swap = 0kB&lt;br/&gt;
41195 pages RAM&lt;br/&gt;
12945 pages reserved&lt;br/&gt;
74 pages shared&lt;br/&gt;
20269 pages non-shared&lt;br/&gt;
Out of memory: kill process 107 (insmod) score 20 or a child&lt;br/&gt;
Killed process 107 (insmod) vsz:1296kB, anon-rss:192kB, file-rss:116kB&lt;br/&gt;
INFO: task insmod:107 blocked for more than 120 seconds.&lt;br/&gt;
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.&lt;br/&gt;
insmod        D 0000000000000000     0   107      1 0x00100004&lt;br/&gt;
 ffff880009e7fb78 0000000000000082 ffff880009e7fb08 ffffffff8105055a&lt;br/&gt;
 ffff880009e7fb08 ffff8800095e4ab8 0000000000000000 ffff880002a15f80&lt;br/&gt;
 ffff880009d10638 ffff880009e7ffd8 000000000000f598 ffff880009d10638&lt;br/&gt;
Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81270ccc&amp;gt;&amp;#93;&lt;/span&gt; ? __bitmap_weight+0x8c/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc035&amp;gt;&amp;#93;&lt;/span&gt; schedule_timeout+0x215/0x2e0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbcb3&amp;gt;&amp;#93;&lt;/span&gt; wait_for_common+0x123/0x180&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105dc20&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbdcd&amp;gt;&amp;#93;&lt;/span&gt; wait_for_completion+0x1d/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81088a5e&amp;gt;&amp;#93;&lt;/span&gt; work_on_cpu+0xae/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81281070&amp;gt;&amp;#93;&lt;/span&gt; ? local_pci_probe+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc6ae&amp;gt;&amp;#93;&lt;/span&gt; ? mutex_lock+0x1e/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8128223b&amp;gt;&amp;#93;&lt;/span&gt; pci_device_probe+0xcb/0x120&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bb12&amp;gt;&amp;#93;&lt;/span&gt; ? driver_sysfs_add+0x62/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bcb0&amp;gt;&amp;#93;&lt;/span&gt; driver_probe_device+0xa0/0x2a0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bf5b&amp;gt;&amp;#93;&lt;/span&gt; __driver_attach+0xab/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133beb0&amp;gt;&amp;#93;&lt;/span&gt; ? __driver_attach+0x0/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133af14&amp;gt;&amp;#93;&lt;/span&gt; bus_for_each_dev+0x64/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133ba4e&amp;gt;&amp;#93;&lt;/span&gt; driver_attach+0x1e/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133b350&amp;gt;&amp;#93;&lt;/span&gt; bus_add_driver+0x200/0x300&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133c286&amp;gt;&amp;#93;&lt;/span&gt; driver_register+0x76/0x140&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810899a8&amp;gt;&amp;#93;&lt;/span&gt; ? __create_workqueue_key+0x1e8/0x280&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff812824d6&amp;gt;&amp;#93;&lt;/span&gt; __pci_register_driver+0x56/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c30b2&amp;gt;&amp;#93;&lt;/span&gt; mlx4_init+0x81/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100204c&amp;gt;&amp;#93;&lt;/span&gt; do_one_initcall+0x3c/0x1d0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810aca7f&amp;gt;&amp;#93;&lt;/span&gt; sys_init_module+0xdf/0x250&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100b172&amp;gt;&amp;#93;&lt;/span&gt; system_call_fastpath+0x16/0x1b&lt;br/&gt;
INFO: task insmod:107 blocked for more than 120 seconds.&lt;br/&gt;
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.&lt;br/&gt;
insmod        D 0000000000000000     0   107      1 0x00100004&lt;br/&gt;
 ffff880009e7fb78 0000000000000082 ffff880009e7fb08 ffffffff8105055a&lt;br/&gt;
 ffff880009e7fb08 ffff8800095e4ab8 0000000000000000 ffff880002a15f80&lt;br/&gt;
 ffff880009d10638 ffff880009e7ffd8 000000000000f598 ffff880009d10638&lt;br/&gt;
Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81270ccc&amp;gt;&amp;#93;&lt;/span&gt; ? __bitmap_weight+0x8c/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc035&amp;gt;&amp;#93;&lt;/span&gt; schedule_timeout+0x215/0x2e0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbcb3&amp;gt;&amp;#93;&lt;/span&gt; wait_for_common+0x123/0x180&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105dc20&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbdcd&amp;gt;&amp;#93;&lt;/span&gt; wait_for_completion+0x1d/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81088a5e&amp;gt;&amp;#93;&lt;/span&gt; work_on_cpu+0xae/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81281070&amp;gt;&amp;#93;&lt;/span&gt; ? local_pci_probe+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc6ae&amp;gt;&amp;#93;&lt;/span&gt; ? mutex_lock+0x1e/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8128223b&amp;gt;&amp;#93;&lt;/span&gt; pci_device_probe+0xcb/0x120&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bb12&amp;gt;&amp;#93;&lt;/span&gt; ? driver_sysfs_add+0x62/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bcb0&amp;gt;&amp;#93;&lt;/span&gt; driver_probe_device+0xa0/0x2a0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bf5b&amp;gt;&amp;#93;&lt;/span&gt; __driver_attach+0xab/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133beb0&amp;gt;&amp;#93;&lt;/span&gt; ? __driver_attach+0x0/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133af14&amp;gt;&amp;#93;&lt;/span&gt; bus_for_each_dev+0x64/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133ba4e&amp;gt;&amp;#93;&lt;/span&gt; driver_attach+0x1e/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133b350&amp;gt;&amp;#93;&lt;/span&gt; bus_add_driver+0x200/0x300&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133c286&amp;gt;&amp;#93;&lt;/span&gt; driver_register+0x76/0x140&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810899a8&amp;gt;&amp;#93;&lt;/span&gt; ? __create_workqueue_key+0x1e8/0x280&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff812824d6&amp;gt;&amp;#93;&lt;/span&gt; __pci_register_driver+0x56/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c30b2&amp;gt;&amp;#93;&lt;/span&gt; mlx4_init+0x81/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100204c&amp;gt;&amp;#93;&lt;/span&gt; do_one_initcall+0x3c/0x1d0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810aca7f&amp;gt;&amp;#93;&lt;/span&gt; sys_init_module+0xdf/0x250&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100b172&amp;gt;&amp;#93;&lt;/span&gt; system_call_fastpath+0x16/0x1b&lt;br/&gt;
INFO: task insmod:107 blocked for more than 120 seconds.&lt;br/&gt;
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.&lt;br/&gt;
insmod        D 0000000000000000     0   107      1 0x00100004&lt;br/&gt;
 ffff880009e7fb78 0000000000000082 ffff880009e7fb08 ffffffff8105055a&lt;br/&gt;
 ffff880009e7fb08 ffff8800095e4ab8 0000000000000000 ffff880002a15f80&lt;br/&gt;
 ffff880009d10638 ffff880009e7ffd8 000000000000f598 ffff880009d10638&lt;br/&gt;
Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81270ccc&amp;gt;&amp;#93;&lt;/span&gt; ? __bitmap_weight+0x8c/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc035&amp;gt;&amp;#93;&lt;/span&gt; schedule_timeout+0x215/0x2e0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbcb3&amp;gt;&amp;#93;&lt;/span&gt; wait_for_common+0x123/0x180&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105dc20&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbdcd&amp;gt;&amp;#93;&lt;/span&gt; wait_for_completion+0x1d/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81088a5e&amp;gt;&amp;#93;&lt;/span&gt; work_on_cpu+0xae/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81281070&amp;gt;&amp;#93;&lt;/span&gt; ? local_pci_probe+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc6ae&amp;gt;&amp;#93;&lt;/span&gt; ? mutex_lock+0x1e/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8128223b&amp;gt;&amp;#93;&lt;/span&gt; pci_device_probe+0xcb/0x120&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bb12&amp;gt;&amp;#93;&lt;/span&gt; ? driver_sysfs_add+0x62/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bcb0&amp;gt;&amp;#93;&lt;/span&gt; driver_probe_device+0xa0/0x2a0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bf5b&amp;gt;&amp;#93;&lt;/span&gt; __driver_attach+0xab/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133beb0&amp;gt;&amp;#93;&lt;/span&gt; ? __driver_attach+0x0/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133af14&amp;gt;&amp;#93;&lt;/span&gt; bus_for_each_dev+0x64/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133ba4e&amp;gt;&amp;#93;&lt;/span&gt; driver_attach+0x1e/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133b350&amp;gt;&amp;#93;&lt;/span&gt; bus_add_driver+0x200/0x300&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133c286&amp;gt;&amp;#93;&lt;/span&gt; driver_register+0x76/0x140&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810899a8&amp;gt;&amp;#93;&lt;/span&gt; ? __create_workqueue_key+0x1e8/0x280&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff812824d6&amp;gt;&amp;#93;&lt;/span&gt; __pci_register_driver+0x56/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c30b2&amp;gt;&amp;#93;&lt;/span&gt; mlx4_init+0x81/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100204c&amp;gt;&amp;#93;&lt;/span&gt; do_one_initcall+0x3c/0x1d0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810aca7f&amp;gt;&amp;#93;&lt;/span&gt; sys_init_module+0xdf/0x250&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100b172&amp;gt;&amp;#93;&lt;/span&gt; system_call_fastpath+0x16/0x1b&lt;br/&gt;
INFO: task insmod:107 blocked for more than 120 seconds.&lt;br/&gt;
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.&lt;br/&gt;
insmod        D 0000000000000000     0   107      1 0x00100004&lt;br/&gt;
 ffff880009e7fb78 0000000000000082 ffff880009e7fb08 ffffffff8105055a&lt;br/&gt;
 ffff880009e7fb08 ffff8800095e4ab8 0000000000000000 ffff880002a15f80&lt;br/&gt;
 ffff880009d10638 ffff880009e7ffd8 000000000000f598 ffff880009d10638&lt;br/&gt;
Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81270ccc&amp;gt;&amp;#93;&lt;/span&gt; ? __bitmap_weight+0x8c/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc035&amp;gt;&amp;#93;&lt;/span&gt; schedule_timeout+0x215/0x2e0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbcb3&amp;gt;&amp;#93;&lt;/span&gt; wait_for_common+0x123/0x180&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105dc20&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbdcd&amp;gt;&amp;#93;&lt;/span&gt; wait_for_completion+0x1d/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81088a5e&amp;gt;&amp;#93;&lt;/span&gt; work_on_cpu+0xae/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81281070&amp;gt;&amp;#93;&lt;/span&gt; ? local_pci_probe+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc6ae&amp;gt;&amp;#93;&lt;/span&gt; ? mutex_lock+0x1e/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8128223b&amp;gt;&amp;#93;&lt;/span&gt; pci_device_probe+0xcb/0x120&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bb12&amp;gt;&amp;#93;&lt;/span&gt; ? driver_sysfs_add+0x62/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bcb0&amp;gt;&amp;#93;&lt;/span&gt; driver_probe_device+0xa0/0x2a0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bf5b&amp;gt;&amp;#93;&lt;/span&gt; __driver_attach+0xab/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133beb0&amp;gt;&amp;#93;&lt;/span&gt; ? __driver_attach+0x0/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133af14&amp;gt;&amp;#93;&lt;/span&gt; bus_for_each_dev+0x64/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133ba4e&amp;gt;&amp;#93;&lt;/span&gt; driver_attach+0x1e/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133b350&amp;gt;&amp;#93;&lt;/span&gt; bus_add_driver+0x200/0x300&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133c286&amp;gt;&amp;#93;&lt;/span&gt; driver_register+0x76/0x140&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810899a8&amp;gt;&amp;#93;&lt;/span&gt; ? __create_workqueue_key+0x1e8/0x280&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff812824d6&amp;gt;&amp;#93;&lt;/span&gt; __pci_register_driver+0x56/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c30b2&amp;gt;&amp;#93;&lt;/span&gt; mlx4_init+0x81/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100204c&amp;gt;&amp;#93;&lt;/span&gt; do_one_initcall+0x3c/0x1d0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810aca7f&amp;gt;&amp;#93;&lt;/span&gt; sys_init_module+0xdf/0x250&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100b172&amp;gt;&amp;#93;&lt;/span&gt; system_call_fastpath+0x16/0x1b&lt;br/&gt;
INFO: task insmod:107 blocked for more than 120 seconds.&lt;br/&gt;
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.&lt;br/&gt;
insmod        D 0000000000000000     0   107      1 0x00100004&lt;br/&gt;
 ffff880009e7fb78 0000000000000082 ffff880009e7fb08 ffffffff8105055a&lt;br/&gt;
 ffff880009e7fb08 ffff8800095e4ab8 0000000000000000 ffff880002a15f80&lt;br/&gt;
 ffff880009d10638 ffff880009e7ffd8 000000000000f598 ffff880009d10638&lt;br/&gt;
Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81270ccc&amp;gt;&amp;#93;&lt;/span&gt; ? __bitmap_weight+0x8c/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc035&amp;gt;&amp;#93;&lt;/span&gt; schedule_timeout+0x215/0x2e0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105055a&amp;gt;&amp;#93;&lt;/span&gt; ? enqueue_entity+0x13a/0x340&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbcb3&amp;gt;&amp;#93;&lt;/span&gt; wait_for_common+0x123/0x180&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8105dc20&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dbdcd&amp;gt;&amp;#93;&lt;/span&gt; wait_for_completion+0x1d/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81088a5e&amp;gt;&amp;#93;&lt;/span&gt; work_on_cpu+0xae/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81281070&amp;gt;&amp;#93;&lt;/span&gt; ? local_pci_probe+0x0/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc6ae&amp;gt;&amp;#93;&lt;/span&gt; ? mutex_lock+0x1e/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8128223b&amp;gt;&amp;#93;&lt;/span&gt; pci_device_probe+0xcb/0x120&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bb12&amp;gt;&amp;#93;&lt;/span&gt; ? driver_sysfs_add+0x62/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bcb0&amp;gt;&amp;#93;&lt;/span&gt; driver_probe_device+0xa0/0x2a0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133bf5b&amp;gt;&amp;#93;&lt;/span&gt; __driver_attach+0xab/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133beb0&amp;gt;&amp;#93;&lt;/span&gt; ? __driver_attach+0x0/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133af14&amp;gt;&amp;#93;&lt;/span&gt; bus_for_each_dev+0x64/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133ba4e&amp;gt;&amp;#93;&lt;/span&gt; driver_attach+0x1e/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133b350&amp;gt;&amp;#93;&lt;/span&gt; bus_add_driver+0x200/0x300&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8133c286&amp;gt;&amp;#93;&lt;/span&gt; driver_register+0x76/0x140&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810899a8&amp;gt;&amp;#93;&lt;/span&gt; ? __create_workqueue_key+0x1e8/0x280&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff812824d6&amp;gt;&amp;#93;&lt;/span&gt; __pci_register_driver+0x56/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c3031&amp;gt;&amp;#93;&lt;/span&gt; ? mlx4_init+0x0/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa00c30b2&amp;gt;&amp;#93;&lt;/span&gt; mlx4_init+0x81/0xbf &lt;span class=&quot;error&quot;&gt;&amp;#91;mlx4_core&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100204c&amp;gt;&amp;#93;&lt;/span&gt; do_one_initcall+0x3c/0x1d0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810aca7f&amp;gt;&amp;#93;&lt;/span&gt; sys_init_module+0xdf/0x250&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100b172&amp;gt;&amp;#93;&lt;/span&gt; system_call_fastpath+0x16/0x1b&lt;/p&gt;


&lt;p&gt;2. Then I changed to TCP, but the system are still hang there. with these msg on one of the OSS. And the OSS can not even be reboot with pm.&lt;/p&gt;

&lt;p&gt;Lustre: MGC10.10.4.12@tcp: Reactivating import&lt;br/&gt;
Lustre: lustre-OST0000: new disk, initializing&lt;br/&gt;
Lustre: lustre-OST0000: Now serving lustre-OST0000 on /dev/sdm with recovery enabled&lt;br/&gt;
Lustre: 3706:0:(debug.c:326:libcfs_debug_str2mask()) You are trying to use a numerical value for the mask - this will be deprecated in a future release.&lt;br/&gt;
Lustre: 3707:0:(debug.c:326:libcfs_debug_str2mask()) You are trying to use a numerical value for the mask - this will be deprecated in a future release.&lt;br/&gt;
LDISKFS-fs (sdj): mounted filesystem with ordered data mode&lt;br/&gt;
LDISKFS-fs (sdj): mounted filesystem with ordered data mode&lt;br/&gt;
Lustre: lustre-OST0002: new disk, initializing&lt;br/&gt;
Lustre: lustre-OST0002: Now serving lustre-OST0002 on /dev/sdj with recovery enabled&lt;br/&gt;
Lustre: 3928:0:(debug.c:326:libcfs_debug_str2mask()) You are trying to use a numerical value for the mask - this will be deprecated in a future release.&lt;br/&gt;
Lustre: 3457:0:(ldlm_lib.c:909:target_handle_connect()) lustre-OST0000: connection from lustre-MDT0000-mdtlov_UUID@10.10.4.12@tcp t0 exp (null) cur 1325730017 last 0&lt;br/&gt;
Lustre: 3457:0:(filter.c:2695:filter_connect_internal()) lustre-OST0000: Received MDS connection for group 0&lt;br/&gt;
Lustre: 3456:0:(ldlm_lib.c:909:target_handle_connect()) lustre-OST0002: connection from lustre-MDT0000-mdtlov_UUID@10.10.4.12@tcp t0 exp (null) cur 1325730017 last 0&lt;br/&gt;
Lustre: 3456:0:(filter.c:2695:filter_connect_internal()) lustre-OST0002: Received MDS connection for group 0&lt;br/&gt;
Lustre: import lustre-OST0002-&amp;gt;NET_0x200000a0a040c_UUID netid 20000: select flavor null&lt;br/&gt;
Lustre: lustre-OST0002: received MDS connection from 10.10.4.12@tcp&lt;br/&gt;
Lustre: lustre-OST0000: received MDS connection from 10.10.4.12@tcp&lt;br/&gt;
LDISKFS-fs (sdh): mounted filesystem with ordered data mode&lt;br/&gt;
LDISKFS-fs (sdh): mounted filesystem with ordered data mode&lt;br/&gt;
Lustre: lustre-OST0004: new disk, initializing&lt;br/&gt;
Lustre: lustre-OST0004: Now serving lustre-OST0004 on /dev/sdh with recovery enabled&lt;br/&gt;
Lustre: 4149:0:(debug.c:326:libcfs_debug_str2mask()) You are trying to use a numerical value for the mask - this will be deprecated in a future release.&lt;br/&gt;
Lustre: 4149:0:(debug.c:326:libcfs_debug_str2mask()) Skipped 1 previous similar message&lt;br/&gt;
Lustre: 3457:0:(ldlm_lib.c:909:target_handle_connect()) lustre-OST0000: connection from c59ba0b4-190e-55a2-df32-cfae2f798b2c@10.10.4.133@tcp t0 exp (null) cur 1325730020 last 0&lt;br/&gt;
Lustre: import lustre-OST0002-&amp;gt;NET_0x200000a0a0485_UUID netid 20000: select flavor null&lt;br/&gt;
Lustre: Skipped 1 previous similar message&lt;br/&gt;
CLIENT MAC ADDR: 00 25 90 14 4E 48  GUID: 534D4349 0002 1490 2500 14902500484E  &lt;br/&gt;
CLIENT IP: 10.10.4.132  MASK: 255.255.0.0  DHCP IP: 10.10.0.6                   &lt;br/&gt;
GATEWAY IP: 10.10.0.1                                                   &lt;/p&gt;


&lt;p&gt;Since the OSS are completely inaccessible, I can not get further useful information. But it should be easy to reproduce on TORO, if some one want to investigate it. Probably duplicate &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-885&quot; title=&quot;recovery-mds-scale (FLAVOR=mds) fail, network is not avaliable&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-885&quot;&gt;&lt;del&gt;LU-885&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;



&lt;p&gt;Thanks&lt;/p&gt;

</comment>
                            <comment id="197349" author="adilger" created="Mon, 29 May 2017 02:53:11 +0000"  >&lt;p&gt;Close old ticket.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="12568">LU-885</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzw13r:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>10260</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>