Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-3475

Failure on test suite lustre-rsync-test test_1: task mdt00_002:9743 blocked for more than 120 seconds

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • None
    • Lustre 2.5.0
    • None
    • server and client: lustre-master build #1525
      client: SLES11 SP2
    • 3
    • 8715

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/d8d45b58-d512-11e2-b13b-52540035b04c.

      The sub-test test_1 failed with the following error:

      test failed to respond and timed out

      MDS console shows:

      19:00:05:Lustre: DEBUG MARKER: == lustre-rsync-test test 1: Simple Replication ====================================================== 19:00:00 (1371175200)
      19:00:05:Lustre: DEBUG MARKER: lctl --device lustre-MDT0000 changelog_register -n
      19:00:05:Lustre: lustre-MDD0000: changelog on
      19:00:05:Lustre: DEBUG MARKER: lctl get_param -n mdd.lustre-MDT0000.changelog_users
      19:00:05:Lustre: DEBUG MARKER: dumpe2fs -h /dev/lvm-MDS/P1 2>&1 | grep -q large_xattr
      19:00:05:Lustre: DEBUG MARKER: dumpe2fs -h /dev/lvm-MDS/P1 2>&1
      19:00:05:Lustre: DEBUG MARKER: dumpe2fs -h /dev/lvm-MDS/P1 2>&1 | grep -q large_xattr
      19:02:07:INFO: task mdt00_002:9743 blocked for more than 120 seconds.
      19:02:07:"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      19:02:07:mdt00_002     D 0000000000000000     0  9743      2 0x00000080
      19:02:07: ffff88007d0eda50 0000000000000046 000200000a0a04dc ffff880037d444c0
      19:02:07: ffff88007d0eda00 ffffc90000938030 0000000000000246 0000000000000246
      19:02:07: ffff88007d0ebaf8 ffff88007d0edfd8 000000000000fb88 ffff88007d0ebaf8
      19:02:07:Call Trace:
      19:02:07: [<ffffffffa05eee06>] ? htable_lookup+0x1a6/0x1c0 [obdclass]
      19:02:07: [<ffffffffa04796fe>] cfs_waitq_wait+0xe/0x10 [libcfs]
      19:02:07: [<ffffffffa05ef413>] lu_object_find_at+0xb3/0x360 [obdclass]
      19:02:07: [<ffffffff8127f6de>] ? number+0x2ee/0x320
      19:02:07: [<ffffffff81063310>] ? default_wake_function+0x0/0x20
      19:02:07: [<ffffffffa05f1aea>] dt_locate_at+0x3a/0x140 [obdclass]
      19:02:07: [<ffffffffa05c846d>] llog_osd_dir_get+0xdd/0x1e0 [obdclass]
      19:02:07: [<ffffffffa05cf6c7>] llog_osd_open+0x427/0xc00 [obdclass]
      19:02:07: [<ffffffffa059a36a>] llog_open+0xba/0x2c0 [obdclass]
      19:02:07: [<ffffffffa0790997>] llog_origin_handle_open+0x1f7/0x6f0 [ptlrpc]
      19:02:07: [<ffffffffa0dcc592>] mdt_llog_create+0x32/0x50 [mdt]
      19:02:07: [<ffffffffa0dd2b78>] mdt_handle_common+0x648/0x1660 [mdt]
      19:02:07: [<ffffffffa0e0c205>] mds_regular_handle+0x15/0x20 [mdt]
      19:02:07: [<ffffffffa07896a8>] ptlrpc_server_handle_request+0x398/0xc60 [ptlrpc]
      19:02:07: [<ffffffffa04795de>] ? cfs_timer_arm+0xe/0x10 [libcfs]
      19:02:07: [<ffffffffa048adaf>] ? lc_watchdog_touch+0x6f/0x170 [libcfs]
      19:02:07: [<ffffffffa0780a09>] ? ptlrpc_wait_event+0xa9/0x290 [ptlrpc]
      19:02:07: [<ffffffff81055ab3>] ? __wake_up+0x53/0x70
      19:02:07: [<ffffffffa078aa3e>] ptlrpc_main+0xace/0x1700 [ptlrpc]
      19:02:07: [<ffffffffa0789f70>] ? ptlrpc_main+0x0/0x1700 [ptlrpc]
      19:02:07: [<ffffffff8100c0ca>] child_rip+0xa/0x20
      19:02:07: [<ffffffffa0789f70>] ? ptlrpc_main+0x0/0x1700 [ptlrpc]
      19:02:07: [<ffffffffa0789f70>] ? ptlrpc_main+0x0/0x1700 [ptlrpc]
      19:02:07: [<ffffffff8100c0c0>] ? child_rip+0x0/0x20
      

      Attachments

        Activity

          People

            emoly.liu Emoly Liu
            maloo Maloo
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: