Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-12148

conf-sanity test_64: timed out

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • Lustre 2.12.0
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Minh Diep <mdiep@whamcloud.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/ecc78788-5523-11e9-92fe-52540065bddc

      test_64 failed with the following error:

      Timeout occurred after 308 mins, last suite running was conf-sanity, restarting cluster to continue tests
      

      <<Please provide additional information about the failure here>>
      [14994.822655] Lustre: setting import lustre-MDT0000_UUID INACTIVE by administrator request
      [15004.893254] Lustre: Unmounted lustre-client
      [15042.573217] Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && lctl dl | grep ' ST ' || true
      [15042.714100] Key type lgssc unregistered
      [15044.164409] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
      [15048.165408] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
      [15056.166467] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
      [15072.167392] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
      [15104.168388] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
      [15168.169393] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
      [15240.322419] INFO: task socknal_sd00_00:842 blocked for more than 120 seconds.
      [15240.323677] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      [15240.324983] socknal_sd00_00 D ffff940fb6be8000 0 842 2 0x00000080
      [15240.326214] Call Trace:
      [15240.326714] [] schedule_preempt_disabled+0x29/0x70
      [15240.327920] [] __mutex_lock_slowpath+0xc7/0x1d0
      [15240.328985] [] mutex_lock+0x1f/0x2f
      [15240.329906] [] lnet_nid2peerni_locked+0x71/0x150 [lnet]
      [15240.331085] [] lnet_parse+0x791/0x11e0 [lnet]
      [15240.332077] [] ksocknal_process_receive+0x46e/0xda0 [ksocklnd]
      [15240.333496] [] ksocknal_scheduler+0xee/0x670 [ksocklnd]
      [15240.334632] [] ? wake_up_atomic_t+0x30/0x30
      [15240.335646] [] ? ksocknal_recv+0x2a0/0x2a0 [ksocklnd]
      [15240.336765] [] kthread+0xd1/0xe0
      [15240.337599] [] ? insert_kthread_work+0x40/0x40
      [15240.338630] [] ret_from_fork_nospec_begin+0x21/0x21
      [15240.339725] [] ? insert_kthread_work+0x40/0x40
      [15296.170391] LNet: 7665:0:(socklnd.c:2600:ksocknal_shutdown()) waiting for 1 peers to disconnect
      [15360.340409] INFO: task socknal_sd00_00:842 blocked for more than 120 seconds.
      [15360.341721] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      [15360.342979] socknal_sd00_00 D ffff940fb6be8000 0 842 2 0x00000080

      VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
      conf-sanity test_64 - Timeout occurred after 308 mins, last suite running was conf-sanity, restarting cluster to continue tests

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: