[LU-10725] conf-sanity test_84: FAIL: 3/3 != 2/3 Created: 26/Feb/18  Updated: 01/Dec/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.11.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Elena Gryaznova Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Attachments: Zip Archive 5a944d9af72e6228f70618c8.zip    
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

100% reproducible failure on the following config:

MGSNID=fre909@tcp:fre910@tcp NETTYPE=tcp mds1_HOST=fre909 MDSDEV1=/dev/vdb mds_HOST=fre909 MDSDEV=/dev/vdb mds1failover_HOST=fre910 mdsfailover_HOST=fre910 mds2_HOST=fre910 MDSDEV2=/dev/vdc mds2failover_HOST=fre909 MDSCOUNT=2 ost1_HOST=fre911 OSTDEV1=/dev/vdd ost1failover_HOST=fre912 ost2_HOST=fre911 OSTDEV2=/dev/vde ost2failover_HOST=fre912 ost3_HOST=fre912 OSTDEV3=/dev/vdb ost3failover_HOST=fre911 ost4_HOST=fre912 OSTDEV4=/dev/vdc ost4failover_HOST=fre911 OSTCOUNT=4
== conf-sanity test 84: check recovery_hard_time ===================================================== 17:21:25 (1519665685)
start mds service on fre909
start mds service on fre909

...
last_transno: 8589936596
VBR: DISABLED
IR: DISABLED
 conf-sanity test_84: @@@@@@ FAIL: 3/3 != 2/3 


 Comments   
Comment by James Nunez (Inactive) [ 04/Apr/18 ]

We’ve seen the same or similar issue in 'full' test sessions, but we haven’t seen this test fail with this error since early February 2018.

 

Here are just a few instances of this failure:

2.10.56 - https://testing.hpdd.intel.com/test_sets/8c11a12e-f18f-11e7-8c23-52540065bddc

2.10.56 - https://testing.hpdd.intel.com/test_sets/ec88a794-f187-11e7-8c23-52540065bddc

2.10.3 - https://testing.hpdd.intel.com/test_sets/b978ec64-08f8-11e8-a7cd-52540065bddc

2.10.3 - https://testing.hpdd.intel.com/test_sets/d2d1095c-0921-11e8-bd00-52540065bddc

 

Generated at Sat Feb 10 02:37:38 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.