[LU-10145] replay-single test_74: Timeout occurred after 151 mins, last suite running was replay-single, restarting cluster to continue tests Created: 20/Oct/17 Updated: 20/Oct/17 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.11.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | James Casper | Assignee: | WC Triage |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Environment: |
trevis, full |
||
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
https://testing.hpdd.intel.com/test_sessions/7e242182-7767-4925-9974-77f1a02ec4f8 Unable to find a process hang, LBUG, panic, or "BUG:". The timeout happens during or after clients are being stopped. From test_log: Stopping clients: trevis-4vm10,trevis-4vm9 /mnt/lustre (opts:) From client console: [ 5518.605357] Lustre: DEBUG MARKER: == replay-single test 74: Ensure applications don't fail waiting for OST recovery ==================== 01:04:51 (1507881891) [ 5518.764057] Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre' ' /proc/mounts); [ 5518.764057] if [ $running -ne 0 ] ; then [ 5518.764057] echo Stopping client $(hostname) /mnt/lustre opts:; [ 5518.764057] lsof /mnt/lustre || need_kill=no; [ 5518.764057] if [ x != x -a x$need_kill != xno ]; then [ 5518.764057] pids=$(lsof -t /mnt/lustre | sort -u); [ 5518.764057] if [ 5519.407774] Lustre: Unmounted lustre-client |
| Comments |
| Comment by James Casper [ 20/Oct/17 ] |
|
Note: May be the same as LU-10102. |