[LU-10145] replay-single test_74: Timeout occurred after 151 mins, last suite running was replay-single, restarting cluster to continue tests Created: 20/Oct/17  Updated: 20/Oct/17

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.11.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Casper Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None
Environment:

trevis, full
servers: CentOS7.4, ldiskfs, branch master, v2.10.54, b3652
clients: SLES12sp2, branch master, v2.10.54, b3652


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

https://testing.hpdd.intel.com/test_sessions/7e242182-7767-4925-9974-77f1a02ec4f8

Unable to find a process hang, LBUG, panic, or "BUG:". The timeout happens during or after clients are being stopped.

From test_log:

Stopping clients: trevis-4vm10,trevis-4vm9 /mnt/lustre (opts:)

From client console:

[ 5518.605357] Lustre: DEBUG MARKER: == replay-single test 74: Ensure applications don't fail waiting for OST recovery ==================== 01:04:51 (1507881891)
[ 5518.764057] Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre' ' /proc/mounts);
[ 5518.764057] if [ $running -ne 0 ] ; then
[ 5518.764057] echo Stopping client $(hostname) /mnt/lustre opts:;
[ 5518.764057] lsof /mnt/lustre || need_kill=no;
[ 5518.764057] if [ x != x -a x$need_kill != xno ]; then
[ 5518.764057]     pids=$(lsof -t /mnt/lustre | sort -u);
[ 5518.764057]     if 
[ 5519.407774] Lustre: Unmounted lustre-client


 Comments   
Comment by James Casper [ 20/Oct/17 ]

Note: May be the same as LU-10102.

Generated at Sat Feb 10 02:32:27 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.