[LU-15239] replay-single test_35 : Timeout occurred after 248 mins Created: 16/Nov/21  Updated: 16/Nov/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Sergey Cheremencev Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

https://testing.whamcloud.com/test_sets/68256292-28e9-4664-a117-da82cba40996
== replay-single test 35: test recovery from llog for unlink op ========================================================== 22:10:18 (1637014218)
CMD: onyx-124vm17 lctl set_param fail_loc=0x80000119
fail_loc=0x80000119
...
CMD: onyx-124vm17 zfs get -H -o value lustre:svname lustre-mdt1/mdt1 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
pdsh@onyx-64vm10: onyx-124vm17: ssh exited with exit code 1
CMD: onyx-124vm17 zfs get -H -o value lustre:svname lustre-mdt1/mdt1 2>/dev/null
Started lustre-MDT0000
rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error
MDS:

[Mon Nov 15 22:10:17 2021] Lustre: DEBUG MARKER: == replay-single test 35: test recovery from llog for unlink op ========================================================== 22:10:18 (1637014218)
[Mon Nov 15 22:10:39 2021] LustreError: 167-0: lustre-MDT0000-mdc-ffff9c97fbee0800: This client was evicted by lustre-MDT0000; in progress operations using this service will fail.
[Mon Nov 15 22:10:48 2021] Lustre: lustre-MDT0000-mdc-ffff9c97fbee0800: Connection to lustre-MDT0000 (at 10.240.30.48@tcp) was lost; in progress operations using this service will wait for recovery to complete
[Mon Nov 15 22:10:48 2021] Lustre: Skipped 14 previous similar messages 

Generated at Sat Feb 10 03:16:38 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.