[LU-10139] recovery-small, test_108: reconnect failed Created: 18/Oct/17  Updated: 25/Mar/22

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.11.0, Lustre 2.12.0, Lustre 2.13.0, Lustre 2.10.7
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Casper Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None
Environment:

trevis, failover
servers: CentOS7.4, ldiskfs, branch master, v2.10.54, b3652
clients: CentOS7.4, branch master, v2.10.54, b3652


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

https://testing.hpdd.intel.com/test_sessions/e49d3517-4ad7-443a-9e1e-e6abf69de85d

From test_log:

dd: failed to open '/mnt/lustre/d108.recovery-small/f108.recovery-small': No space left on device
CMD: trevis-22vm11 /usr/sbin/lctl set_param -n obdfilter.lustre-OST0000.evict_client 6458d6f0-9d70-1e40-ad98-52325e4c764e


 Comments   
Comment by Sarah Liu [ 27/Sep/18 ]

also seen on tag-2.11.55
server: RHEL7
client: SLES12sp3
https://testing.whamcloud.com/test_sets/14de6544-b8a7-11e8-9df3-52540065bddc

Comment by James Nunez (Inactive) [ 25/Apr/19 ]

There are several recovery-small test 108 failures with the error "reconnect failed" that do not have the dd failure in the test_log. For example, we see

== recovery-small test 108: client eviction don't crash ============================================== 00:46:16 (1556066776)
CMD: trevis-54vm5 /usr/sbin/lctl set_param -n obdfilter.lustre-OST0000.evict_client a24fbb1b-60aa-e424-e523-ac286d09c125
256+0 records in
256+0 records out
268435456 bytes (268 MB) copied, 1.35001 s, 199 MB/s
 recovery-small test_108: @@@@@@ FAIL: reconnect failed 

Full test session:
https://testing.whamcloud.com/test_sets/474c4d6a-66c7-11e9-bd0e-52540065bddc

Failover test session:
https://testing.whamcloud.com/test_sets/c84c736c-19bb-11e9-8388-52540065bddc

Comment by Sarah Liu [ 25/Mar/22 ]

similar https://testing.whamcloud.com/test_sets/53efa484-2f57-40d9-acec-0f2ec4bff54e

Generated at Sat Feb 10 02:32:24 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.