[LU-15892] replay-dual test_26: timeout mounting MDT llog_verify_record() Created: 26/May/22  Updated: 04/Jul/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/07235ef8-47e4-42ea-a13c-de5ef6c7aa40

test_26 failed with the following error:

CMD: onyx-71vm15 zfs get -H -o value lustre:svname 		                           lustre-mdt3/mdt3 2>/dev/null
Started lustre-MDT0002

Timeout occurred after 120 mins, last suite running was replay-dual

There are lots of console messages complaining about llog errors:

[ 3763.601623] LustreError: 104341:0:(llog.c:656:llog_process_thread()) lustre-MDT0000-osp-MDT0001: invalid record in llog [0x2:0x11d41:0x2] record for index 0/2: rc = -22
[ 3763.603953] LustreError: 104341:0:(llog.c:656:llog_process_thread()) Skipped 1173 previous similar messages
[ 3763.611056] LustreError: 104341:0:(llog.c:482:llog_verify_record()) lustre-MDT0000-osp-MDT0001: magic 0 is bad
[ 3763.612678] LustreError: 104341:0:(llog.c:482:llog_verify_record()) Skipped 586 previous similar messages
[ 3763.614206] LustreError: 104341:0:(llog.c:773:llog_process_thread()) lustre-MDT0000-osp-MDT0001 retry remote llog process
[ 3763.615924] LustreError: 104341:0:(llog.c:773:llog_process_thread()) Skipped 586 previous similar messages
[ 3771.598075] LustreError: 104341:0:(llog.c:472:llog_verify_record()) lustre-MDT0000-osp-MDT0001: record is too large: 0 > 32768
[ 3771.604754] LustreError: 104341:0:(llog.c:472:llog_verify_record()) Skipped 1653 previous similar messages

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
replay-dual test_26 - Timeout occurred after 120 mins, last suite running was replay-dual



 Comments   
Comment by Etienne Aujames [ 04/Jul/23 ]

+1 on b2_15: https://testing.whamcloud.com/test_sets/cb0e6468-58d2-426c-9403-86004e5b69ae

Generated at Sat Feb 10 03:22:11 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.