[LU-4500] Failure on test suite sanity-hsm test_300 Created: 17/Jan/14  Updated: 22/Jan/14  Resolved: 22/Jan/14

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.6.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: zfs
Environment:

server: lustre-master build # 1837 RHEL6 zfs
client: lustre-master build # 1837 RHEL6


Severity: 3
Rank (Obsolete): 12313

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/e37aae8e-7e51-11e3-bfda-52540035b04c.

The sub-test test_300 failed with the following error:

test failed to respond and timed out

MDS console

16:11:15:Lustre: MGS is waiting for obd_unlinked_exports more than 32 seconds. The obd refcount = 5. Is it stuck?
16:11:15:LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 10.10.4.199@tcp (no target). If you are running an HA pair check that the target is mounted on the other server.
16:11:16:LustreError: Skipped 160 previous similar messages
16:11:17:Lustre: MGS is waiting for obd_unlinked_exports more than 64 seconds. The obd refcount = 5. Is it stuck?
16:11:17:Lustre: 3512:0:(client.c:1903:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1389830831/real 1389830831]  req@ffff88006d2c6800 x1457304639026692/t0(0) o250->MGC10.10.4.198@tcp@0@lo:26/25 lens 400/544 e 0 to 1 dl 1389830856 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
16:11:17:Lustre: 3512:0:(client.c:1903:ptlrpc_expire_one_request()) Skipped 6 previous similar messages
16:11:17:Lustre: MGS is waiting for obd_unlinked_exports more than 128 seconds. The obd refcount = 5. Is it stuck?
16:11:17:LustreError: 137-5: lustre-MDT0000_UUID: not available for connect from 10.10.4.199@tcp (no target). If you are running an HA pair check that the target is mounted on the other server.
16:11:18:LustreError: Skipped 319 previous similar messages
16:11:19:INFO: task umount:6969 blocked for more than 120 seconds.
16:11:19:"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
16:11:19:umount        D 0000000000000000     0  6969   6968 0x00000080
16:11:19: ffff880072217aa8 0000000000000082 ffff880072217a08 ffff88004fd88800
16:11:20: ffffffffa078355a 0000000000000000 ffff88006ef9c344 ffffffffa078355a
16:11:20: ffff880057051058 ffff880072217fd8 000000000000fb88 ffff880057051058
16:11:21:Call Trace:
16:11:22: [<ffffffff8150f3f2>] schedule_timeout+0x192/0x2e0
16:11:22: [<ffffffff810811e0>] ? process_timeout+0x0/0x10
16:11:23: [<ffffffffa07096ab>] obd_exports_barrier+0xab/0x180 [obdclass]
16:11:23: [<ffffffffa0f1952e>] mgs_device_fini+0xfe/0x580 [mgs]
16:11:23: [<ffffffffa0732063>] class_cleanup+0x573/0xd30 [obdclass]
16:11:23: [<ffffffffa070b846>] ? class_name2dev+0x56/0xe0 [obdclass]
16:11:23: [<ffffffffa0733d8a>] class_process_config+0x156a/0x1ad0 [obdclass]
16:11:24: [<ffffffffa072c073>] ? lustre_cfg_new+0x2d3/0x6e0 [obdclass]
16:11:25: [<ffffffffa0734469>] class_manual_cleanup+0x179/0x6f0 [obdclass]
16:11:25: [<ffffffffa070b846>] ? class_name2dev+0x56/0xe0 [obdclass]
16:11:25: [<ffffffffa076e06d>] server_put_super+0x45d/0xf60 [obdclass]
16:11:25: [<ffffffff8118366b>] generic_shutdown_super+0x5b/0xe0
16:11:26: [<ffffffff81183756>] kill_anon_super+0x16/0x60
16:19:10: [<ffffffffa0736316>] lustre_kill_super+0x36/0x60 [obdclass]
16:19:11: [<ffffffff81183ef7>] deactivate_super+0x57/0x80
16:19:11: [<ffffffff811a21ef>] mntput_no_expire+0xbf/0x110
16:19:12: [<ffffffff811a2c5b>] sys_umount+0x7b/0x3a0
16:19:13: [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
16:19:14:Lustre: MGS is waiting for obd_unlinked_exports more than 256 seconds. The obd refcount = 5. Is it stuck?


 Comments   
Comment by Doug Oucharek (Inactive) [ 22/Jan/14 ]

Duplicate of LU-3230

Generated at Sat Feb 10 01:43:16 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.