[LU-5500] sanity test_17n: test failed to respond and timed out Created: 18/Aug/14  Updated: 17/Apr/17  Resolved: 17/Apr/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.7.0
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Maloo Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 15346

 Description   

This issue was created by maloo for Bob Glossman <bob.glossman@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/13dc24b2-25a1-11e4-a51c-5254006e85c2.

The sub-test test_17n failed with the following error:

test failed to respond and timed out

Info required for matching: sanity 17n



 Comments   
Comment by Oleg Drokin [ 19/Aug/14 ]

So mds1 fails to stop in reasonable time:

LustreError: 12125:0:(import.c:323:ptlrpc_invalidate_import()) lustre-MDT0000_UUID: rc = -110 waiting for callback (1 != 0)
LustreError: 12125:0:(import.c:349:ptlrpc_invalidate_import()) @@@ still on sending list  req@ffff880059520400 x1476614117401812/t0(0) o400->lustre-MDT0000-osp-MDT0001@10.2.4.185@tcp:24/4 lens 224/224 e 236031 to 0 dl 1408209204 ref 1 fl Unregistering:E/c0/ffffffff rc -5/-1
LustreError: 12125:0:(import.c:364:ptlrpc_invalidate_import()) lustre-MDT0000_UUID: RPCs in "Unregistering" phase found (1). Network is sluggish? Waiting them to error out.
Lustre: lustre-MDT0001: Not available for connect from 10.2.4.186@tcp (stopping)
Lustre: Skipped 39 previous similar messages
LustreError: 12125:0:(import.c:323:ptlrpc_invalidate_import()) lustre-MDT0000_UUID: rc = -110 waiting for callback (1 != 0)
LustreError: 12125:0:(import.c:349:ptlrpc_invalidate_import()) @@@ still on sending list  req@ffff880059520400 x1476614117401812/t0(0) o400->lustre-MDT0000-osp-MDT0001@10.2.4.185@tcp:24/4 lens 224/224 e 236031 to 0 dl 1408209204 ref 1 fl Unregistering:E/c0/ffffffff rc -5/-1
LustreError: 12125:0:(import.c:364:ptlrpc_invalidate_import()) lustre-MDT0000_UUID: RPCs in "Unregistering" phase found (1). Network is sluggish? Waiting them to error out.
LustreError: 12125:0:(import.c:323:ptlrpc_invalidate_import()) lustre-MDT0000_UUID: rc = -110 waiting for callback (1 != 0)
LustreError: 12125:0:(import.c:349:ptlrpc_invalidate_import()) @@@ still on sending list  req@ffff880059520400 x1476614117401812/t0(0) o400->lustre-MDT0000-osp-MDT0001@10.2.4.185@tcp:24/4 lens 224/224 e 236031 to 0 dl 1408209204 ref 1 fl Unregistering:E/c0/ffffffff rc -5/-1
LustreError: 12125:0:(import.c:364:ptlrpc_invalidate_import()) lustre-MDT0000_UUID: RPCs in "Unregistering" phase found (1). Network is sluggish? Waiting them to error out.
Lustre: lustre-MDT0001: Not available for connect from 10.2.4.187@tcp (stopping)
Lustre: Skipped 80 previous similar messages

There was a ticket recently from Xyratex that discussed something like this due to one of recent patches in this area?

Comment by Andreas Dilger [ 17/Apr/17 ]

Close old issue.

Generated at Sat Feb 10 01:52:01 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.