[LU-1828] Test failure on test suite lnet-selftest, subtest test_smoke Created: 04/Sep/12  Updated: 13/Sep/12  Resolved: 12/Sep/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.3.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: Isaac Huang (Inactive)
Resolution: Duplicate Votes: 0
Labels: None
Environment:

server: lustreb2_3-tag2.2.94 RHEL6
client: lustreb2_3-tag2.2.94 SLES11


Issue Links:
Duplicate
Severity: 3
Rank (Obsolete): 5789

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/95843852-f253-11e1-9def-52540035b04c.

The sub-test test_smoke failed with the following error:

/tmp/smoke.sh failed: 254

Not sure if the same issue as ORI-130

client-22vm3: client-22vm4:    
#!/bin/bash
set -e
cleanup () { trap 0; echo killing $1 ... ; kill -9 $1 || true; }
/usr/sbin/lst new_session --timeo 100000 hh
/usr/sbin/lst add_group c 10.10.4.108@tcp 10.10.4.109@tcp
/usr/sbin/lst add_group s 10.10.4.110@tcp 10.10.4.111@tcp
/usr/sbin/lst add_batch b
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s brw read check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c brw read check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s brw read check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c brw read check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s brw read check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c brw read check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s brw read check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c brw read check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s brw read check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c brw read check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s brw read check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c brw read check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s brw read check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c brw read check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s brw read check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c brw read check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s brw read check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c brw read check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s brw read check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c brw read check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s brw read check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c brw read check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s brw read check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c brw read check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s brw read check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c brw read check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s brw read check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c brw read check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s brw read check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c brw read check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s brw read check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c brw read check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s brw write check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c brw write check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s brw write check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c brw write check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s brw write check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c brw write check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s brw write check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c brw write check=full size=4k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s brw write check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c brw write check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s brw write check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c brw write check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s brw write check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c brw write check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s brw write check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c brw write check=full size=8k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s brw write check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c brw write check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s brw write check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c brw write check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s brw write check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c brw write check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s brw write check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c brw write check=full size=256k
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s brw write check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c brw write check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s brw write check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c brw write check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s brw write check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c brw write check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s brw write check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c brw write check=full size=1M
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from c --to s ping 
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 1 --distribute 2:2 --from s --to c ping 
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from c --to s ping 
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 2 --distribute 2:2 --from s --to c ping 
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from c --to s ping 
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 4 --distribute 2:2 --from s --to c ping 
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from c --to s ping 
/usr/sbin/lst add_test --batch b --loop 100000  --concurrency 8 --distribute 2:2 --from s --to c ping 
/usr/sbin/lst run b
sleep 1
/usr/sbin/lst stat --delay 10 --timeout 10 c s &
pid=$!
trap "cleanup $pid" INT TERM
sleep 1800
cleanup $pid
SESSION: hh FEATURES: 0 TIMEOUT: 100000 FORCE: No
10.10.4.108@tcp are added to session
10.10.4.109@tcp are added to session
10.10.4.110@tcp are added to session
create session RPC failed on 12345-10.10.4.111@tcp: Unknown error 18446744073709551506
 lnet-selftest test_smoke: @@@@@@ FAIL: /tmp/smoke.sh failed: 254 


 Comments   
Comment by Peter Jones [ 06/Sep/12 ]

isaac is this a duplicate of LU-1728?

Comment by Jian Yu [ 11/Sep/12 ]

Another instance: https://maloo.whamcloud.com/test_sets/7180318a-fa51-11e1-887d-52540035b04c

I think this is a test environment issue. From the test outputs, we can see:

rpc.sh: line 13: /usr/lib64/lustre/tests/cfg/autotest_config.sh: No such file or directory

This will prevent do_rpc_nodes() from working properly in lst_* functions.

Comment by Sarah Liu [ 12/Sep/12 ]

I think this is a dup of TT-858

Comment by Isaac Huang (Inactive) [ 12/Sep/12 ]

dup of TT-858

Generated at Sat Feb 10 01:20:02 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.