Details
-
Bug
-
Resolution: Duplicate
-
Minor
-
None
-
Lustre 2.12.0
-
None
-
3
-
9223372036854775807
Description
This issue was created by maloo for Minh Diep <mdiep@whamcloud.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/ecc78788-5523-11e9-92fe-52540065bddc
test_64 failed with the following error:
Timeout occurred after 308 mins, last suite running was conf-sanity, restarting cluster to continue tests
<<Please provide additional information about the failure here>>
[14994.822655] Lustre: setting import lustre-MDT0000_UUID INACTIVE by administrator request
[15004.893254] Lustre: Unmounted lustre-client
[15042.573217] Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && lctl dl | grep ' ST ' || true
[15042.714100] Key type lgssc unregistered
[15044.164409] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
[15048.165408] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
[15056.166467] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
[15072.167392] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
[15104.168388] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
[15168.169393] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
[15240.322419] INFO: task socknal_sd00_00:842 blocked for more than 120 seconds.
[15240.323677] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[15240.324983] socknal_sd00_00 D ffff940fb6be8000 0 842 2 0x00000080
[15240.326214] Call Trace:
[15240.326714] [] schedule_preempt_disabled+0x29/0x70
[15240.327920] [] __mutex_lock_slowpath+0xc7/0x1d0
[15240.328985] [] mutex_lock+0x1f/0x2f
[15240.329906] [] lnet_nid2peerni_locked+0x71/0x150 [lnet]
[15240.331085] [] lnet_parse+0x791/0x11e0 [lnet]
[15240.332077] [] ksocknal_process_receive+0x46e/0xda0 [ksocklnd]
[15240.333496] [] ksocknal_scheduler+0xee/0x670 [ksocklnd]
[15240.334632] [] ? wake_up_atomic_t+0x30/0x30
[15240.335646] [] ? ksocknal_recv+0x2a0/0x2a0 [ksocklnd]
[15240.336765] [] kthread+0xd1/0xe0
[15240.337599] [] ? insert_kthread_work+0x40/0x40
[15240.338630] [] ret_from_fork_nospec_begin+0x21/0x21
[15240.339725] [] ? insert_kthread_work+0x40/0x40
[15296.170391] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
[15360.340409] INFO: task socknal_sd00_00:842 blocked for more than 120 seconds.
[15360.341721] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[15360.342979] socknal_sd00_00 D ffff940fb6be8000 0 842 2 0x00000080
VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
conf-sanity test_64 - Timeout occurred after 308 mins, last suite running was conf-sanity, restarting cluster to continue tests