[LU-4191] lustre/tests/lnet-selftest.sh: possible misuse of lst --concurrency Created: 30/Oct/13  Updated: 30/Oct/13

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Isaac Huang (Inactive) Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 11340

 Description   

I looked at a lst test script and saw:
lst add_test --batch b --loop 1000 --concurrency 1 --distribute 2:2 --from c --to s brw read check=full size=4k
lst add_test --batch b --loop 1000 --concurrency 8 --distribute 2:2 --from c --to s brw read check=full size=4k

Unless I've missed something, the two commands are equivalent to a single command:
lst add_test --batch b --loop 1000 --concurrency 9 --distribute 2:2 --from c --to s brw read check=full size=4k

While it's not an error, it doesn't make sense to me to issue two commands while one should suffice.

In lustre/tests/lnet-selftest.sh:
lst_CONCR=${lst_CONCR:-"1 2 4 8"}
if [ "$SLOW" = no ]; then
lst_CONCR="1 8"
fi

I think it'd suffice to:
lst_CONCR=${lst_CONCR:-"8"}
if [ "$SLOW" = no ]; then
lst_CONCR="4"
fi

Running multiple tests at concurrency 8 should suffice to saturate the network, and concurrency 4 would make it run twice faster.



 Comments   
Comment by Isaac Huang (Inactive) [ 30/Oct/13 ]

Also, the script always uses #clients:#servers or #servers:#clients as the distribution. Liang can you please comment if this is right? What do we get with --distribute 3:2 for example?

Generated at Sat Feb 10 01:40:29 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.