[LU-10139] recovery-small, test_108: reconnect failed Created: 18/Oct/17 Updated: 25/Mar/22 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.11.0, Lustre 2.12.0, Lustre 2.13.0, Lustre 2.10.7 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | James Casper | Assignee: | WC Triage |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Environment: |
trevis, failover |
||
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
https://testing.hpdd.intel.com/test_sessions/e49d3517-4ad7-443a-9e1e-e6abf69de85d From test_log: dd: failed to open '/mnt/lustre/d108.recovery-small/f108.recovery-small': No space left on device CMD: trevis-22vm11 /usr/sbin/lctl set_param -n obdfilter.lustre-OST0000.evict_client 6458d6f0-9d70-1e40-ad98-52325e4c764e |
| Comments |
| Comment by Sarah Liu [ 27/Sep/18 ] |
|
also seen on tag-2.11.55 |
| Comment by James Nunez (Inactive) [ 25/Apr/19 ] |
|
There are several recovery-small test 108 failures with the error "reconnect failed" that do not have the dd failure in the test_log. For example, we see == recovery-small test 108: client eviction don't crash ============================================== 00:46:16 (1556066776) CMD: trevis-54vm5 /usr/sbin/lctl set_param -n obdfilter.lustre-OST0000.evict_client a24fbb1b-60aa-e424-e523-ac286d09c125 256+0 records in 256+0 records out 268435456 bytes (268 MB) copied, 1.35001 s, 199 MB/s recovery-small test_108: @@@@@@ FAIL: reconnect failed Full test session: Failover test session: |
| Comment by Sarah Liu [ 25/Mar/22 ] |
|
similar https://testing.whamcloud.com/test_sets/53efa484-2f57-40d9-acec-0f2ec4bff54e |