[LU-15161] recovery-double-scale test_pairwise_fail: skipped on setup with 5 clients Created: 25/Oct/21  Updated: 16/Nov/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Related
is related to LU-11073 enable DNE in recovery-mds-scale, rec... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Elena <elena.gryaznova@hpe.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/73b3aeac-7245-466a-af7f-aa54fa567fe9

test_pairwise_fail failed with the following error:

has less than 5 Clients, test 9 skipped

setup has 5 clients:
Nodes:

trevis-64vm1 - OST 1, OST 2, OST 3, OST 4, OST 5, OST 6, OST 7 (2.14.55.29, x86_64)
trevis-64vm2 - OST 1, OST 2, OST 3, OST 4, OST 5, OST 6, OST 7 (2.14.55.29, x86_64)
trevis-64vm3 - MDS 1, MDS 2 (2.14.55.29, x86_64)
trevis-64vm4 - MDS 1 (2.14.55.29, x86_64)
trevis-212vm6 - Client 2 (2.14.55.29, x86_64)
trevis-212vm7 - Client (2.14.55_29_g70b46f7, x86_64)
trevis-212vm8 - Client 3 (2.14.55.29, x86_64)
trevis-212vm9 - Client 4 (2.14.55.29, x86_64)
trevis-212vm10 - Client 1 (2.14.55.29, x86_64)

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
recovery-double-scale test_pairwise_fail - has less than 5 Clients, test 9 skipped



 Comments   
Comment by Elena Gryaznova [ 26/Oct/21 ]

https://testing.whamcloud.com/test_logs/a30e197a-729e-4d0d-8293-6c016e155abc/show_text

Started client load: dd on trevis-212vm10
Started client load: tar on trevis-212vm8
Started client load: dbench on trevis-212vm9

client load was not started on trevis-212vm6
It looks like that $CLIENTS contains not all of the clients.

Comment by Elena Gryaznova [ 26/Oct/21 ]

the added debug
https://review.whamcloud.com/#/c/32626/28/lustre/tests/recovery-double-scale.sh

log "Using NODES_TO_USE: $NODES_TO_USE and CLIENTS: $CLIENTS"
NODES_TO_USE=$(exclude_items_from_list $NODES_TO_USE $HOSTNAME)
log "Using remote NODES_TO_USE: $NODES_TO_USE HOSTNAME=$HOSTNAME"

shows that for session
https://testing.whamcloud.com/test_sets/6e63e4d6-ea5d-4ce3-93c9-7176761f6fd8
with Nodes:
trevis-205vm10 - Client 1 (2.14.55.29, x86_64)
trevis-205vm11 - Client (2.14.55_29_g74c51a8, x86_64)
trevis-205vm12 - Client 2 (2.14.55.29, x86_64)
trevis-205vm13 - Client 3 (2.14.55.29, x86_64)
trevis-205vm14 - Client 4 (2.14.55.29, x86_64)

the CLIENTS actually set to:
trevis-205vm10.trevis.whamcloud.com,trevis-205vm12,trevis-205vm13,trevis-205vm14
https://testing.whamcloud.com/test_logs/b5cabfa4-e914-4809-831f-5ed50dd7df65/show_text

i.e. trevis-205vm11 is missing in CLIENTS list.

adilger,
can you please advice whom we should address this issue?
Thank you.

Comment by Cory Spitz [ 29/Oct/21 ]

jamesanunez, this looks like a test automation problem of some kind. Can you please take a look and say if it is something you can handle? Thanks!

Generated at Sat Feb 10 03:15:58 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.