[LU-15254] conf-sanity: test_28A TIMEOUT (mds_connect -11 Created: 19/Nov/21  Updated: 19/Nov/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Sergey Cheremencev <c17829@cray.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/38207303-50da-46ed-9211-1c19aa357fe8

client 1 test output:

mount lustre  on /mnt/lustre.....
Starting client: onyx-60vm1.onyx.whamcloud.com:  -o user_xattr,flock onyx-71vm4@tcp:/lustre /mnt/lustre
CMD: onyx-60vm1.onyx.whamcloud.com mkdir -p /mnt/lustre
CMD: onyx-60vm1.onyx.whamcloud.com mount -t lustre -o user_xattr,flock onyx-71vm4@tcp:/lustre /mnt/lustre

Client1 is hanging on mds_connect with EAGAIN:

[Mon Nov 15 23:20:11 2021] Lustre: DEBUG MARKER: mount -t lustre -o user_xattr,flock onyx-71vm4@tcp:/lustre /mnt/lustre
[Mon Nov 15 23:21:43 2021] LustreError: 11-0: lustre-MDT0001-mdc-ffff8a5a9e7bd800: operation mds_connect to node 10.240.26.9@tcp failed: rc = -11
[Mon Nov 15 23:21:48 2021] LustreError: 11-0: lustre-MDT0000-mdc-ffff8a5a9e7bd800: operation mds_connect to node 10.240.25.255@tcp failed: rc = -11
[Mon Nov 15 23:21:48 2021] LustreError: Skipped 2 previous similar messages
[Mon Nov 15 23:24:17 2021] LustreError: 11-0: lustre-MDT0003-mdc-ffff8a5a9e7bd800: operation mds_connect to node 10.240.26.9@tcp failed: rc = -11

Generated at Sat Feb 10 03:16:45 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.