Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4500

Failure on test suite sanity-hsm test_300

Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • Lustre 2.6.0
    • server: lustre-master build # 1837 RHEL6 zfs
      client: lustre-master build # 1837 RHEL6
    • 3
    • 12313

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/e37aae8e-7e51-11e3-bfda-52540035b04c.

      The sub-test test_300 failed with the following error:

      test failed to respond and timed out

      MDS console

      16:11:15:Lustre: MGS is waiting for obd_unlinked_exports more than 32 seconds. The obd refcount = 5. Is it stuck?
      16:11:15:LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 10.10.4.199@tcp (no target). If you are running an HA pair check that the target is mounted on the other server.
      16:11:16:LustreError: Skipped 160 previous similar messages
      16:11:17:Lustre: MGS is waiting for obd_unlinked_exports more than 64 seconds. The obd refcount = 5. Is it stuck?
      16:11:17:Lustre: 3512:0:(client.c:1903:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1389830831/real 1389830831]  req@ffff88006d2c6800 x1457304639026692/t0(0) o250->MGC10.10.4.198@tcp@0@lo:26/25 lens 400/544 e 0 to 1 dl 1389830856 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
      16:11:17:Lustre: 3512:0:(client.c:1903:ptlrpc_expire_one_request()) Skipped 6 previous similar messages
      16:11:17:Lustre: MGS is waiting for obd_unlinked_exports more than 128 seconds. The obd refcount = 5. Is it stuck?
      16:11:17:LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 10.10.4.199@tcp (no target). If you are running an HA pair check that the target is mounted on the other server.
      16:11:18:LustreError: Skipped 319 previous similar messages
      16:11:19:INFO: task umount:6969 blocked for more than 120 seconds.
      16:11:19:"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      16:11:19:umount        D 0000000000000000     0  6969   6968 0x00000080
      16:11:19: ffff880072217aa8 0000000000000082 ffff880072217a08 ffff88004fd88800
      16:11:20: ffffffffa078355a 0000000000000000 ffff88006ef9c344 ffffffffa078355a
      16:11:20: ffff880057051058 ffff880072217fd8 000000000000fb88 ffff880057051058
      16:11:21:Call Trace:
      16:11:22: [<ffffffff8150f3f2>] schedule_timeout+0x192/0x2e0
      16:11:22: [<ffffffff810811e0>] ? process_timeout+0x0/0x10
      16:11:23: [<ffffffffa07096ab>] obd_exports_barrier+0xab/0x180 [obdclass]
      16:11:23: [<ffffffffa0f1952e>] mgs_device_fini+0xfe/0x580 [mgs]
      16:11:23: [<ffffffffa0732063>] class_cleanup+0x573/0xd30 [obdclass]
      16:11:23: [<ffffffffa070b846>] ? class_name2dev+0x56/0xe0 [obdclass]
      16:11:23: [<ffffffffa0733d8a>] class_process_config+0x156a/0x1ad0 [obdclass]
      16:11:24: [<ffffffffa072c073>] ? lustre_cfg_new+0x2d3/0x6e0 [obdclass]
      16:11:25: [<ffffffffa0734469>] class_manual_cleanup+0x179/0x6f0 [obdclass]
      16:11:25: [<ffffffffa070b846>] ? class_name2dev+0x56/0xe0 [obdclass]
      16:11:25: [<ffffffffa076e06d>] server_put_super+0x45d/0xf60 [obdclass]
      16:11:25: [<ffffffff8118366b>] generic_shutdown_super+0x5b/0xe0
      16:11:26: [<ffffffff81183756>] kill_anon_super+0x16/0x60
      16:19:10: [<ffffffffa0736316>] lustre_kill_super+0x36/0x60 [obdclass]
      16:19:11: [<ffffffff81183ef7>] deactivate_super+0x57/0x80
      16:19:11: [<ffffffff811a21ef>] mntput_no_expire+0xbf/0x110
      16:19:12: [<ffffffff811a2c5b>] sys_umount+0x7b/0x3a0
      16:19:13: [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
      16:19:14:Lustre: MGS is waiting for obd_unlinked_exports more than 256 seconds. The obd refcount = 5. Is it stuck?
      

      Attachments

        Activity

          People

            wc-triage WC Triage
            maloo Maloo
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: