[LU-7291] recovery-small timed out at 'lctl set_param osd-ldiskfs.track_declares_assert=1' Created: 13/Oct/15  Updated: 13/Oct/21  Resolved: 13/Oct/21

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.8.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None
Environment:

Server/ Client - 2.7.61, build# 3205, RHEL 7.1


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Saurabh Tandan <saurabh.tandan@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/ed368e46-6d35-11e5-a8d6-5254006e85c2.

The sub-test recovery-small failed with the following error:

test failed to respond and timed out

Couldn't find much useful information. No logs present. got following information from lustre-initialization_1 mds 1 console:

21:01:17:[41261.364028] Lustre: DEBUG MARKER: /usr/sbin/lctl set_param 				 osd-ldiskfs.track_declares_assert=1 || true
21:01:17:[41261.695493] Lustre: DEBUG MARKER: lctl set_param -n mdt.lustre*.enable_remote_dir=1
21:01:17:[41261.841562] Lustre: lustre-MDT0000-o: trigger OI scrub by RPC for [0x200019a43:0xb0d:0x0], rc = 0 [1]
21:01:17:[41262.844721] Lustre: lustre-MDT0000-o: trigger OI scrub by RPC for [0x200019a43:0xb0d:0x0], rc = 0 [1]
21:01:17:[41264.849450] Lustre: lustre-MDT0000-o: trigger OI scrub by RPC for [0x200019a43:0xb0d:0x0], rc = 0 [1]
21:01:17:[41267.855382] Lustre: lustre-MDT0000-o: trigger OI scrub by RPC for [0x200019a43:0xb0d:0x0], rc = 0 [1]
21:01:17:[41271.864129] Lustre: lustre-MDT0000-o: trigger OI scrub by RPC for [0x200019a43:0xb0d:0x0], rc = 0 [1]
21:01:17:[41282.875454] Lustre: lustre-MDT0000-o: trigger OI scrub by RPC for [0x200019a43:0xb0d:0x0], rc = 0 [1]
21:01:17:[41282.877456] Lustre: Skipped 1 previous similar message
21:01:17:[41306.889950] Lustre: lustre-MDT0000-o: trigger OI scrub by RPC for [0x200019a43:0xb0d:0x0], rc = 0 [1]
21:01:17:[41306.892203] Lustre: Skipped 2 previous similar messages
21:01:17:[41339.902443] Lustre: lustre-MDT0000-o: trigger OI scrub by RPC for [0x200019a43:0xb0d:0x0], rc = 0 [1]
21:01:17:[41339.904486] Lustre: Skipped 2 previous similar messages
21:58:55:********** Timeout by autotest system **********


 Comments   
Comment by James Nunez (Inactive) [ 06/Jun/16 ]

In the suite_stdout log, the last thing we see is

20:58:36:Quota settings for sanityusr1 : 
20:58:36:Disk quotas for user sanityusr1 (uid 501):
20:58:36:     Filesystem  kbytes   quota   limit   grace   files   quota   limit   grace
20:58:36:    /mnt/lustre       0  13869872 14563365       -       0  916000  961800       -
20:58:36:lustre-MDT0000_UUID
20:58:36:                      0       -       0       -       0       -       0       -
20:58:36:lustre-OST0000_UUID
20:58:36:                      0       -       0       -       -       -       -       -
20:58:36:lustre-OST0001_UUID
20:58:36:                      0       -       0       -       -       -       -       -
20:58:36:lustre-OST0002_UUID
20:58:36:                      0       -       0       -       -       -       -       -
20:58:36:lustre-OST0003_UUID
20:58:36:                      0       -       0       -       -       -       -       -
20:58:36:lustre-OST0004_UUID
20:58:36:                      0       -       0       -       -       -       -       -
20:58:36:lustre-OST0005_UUID
20:58:36:                      0       -       0       -       -       -       -       -
20:58:36:lustre-OST0006_UUID
20:58:36:                      0       -       0       -       -       -       -       -
20:58:36:Total allocated inode limit: 0, total allocated block limit: 0
20:58:48:CMD: shadow-44vm3,shadow-44vm4 /usr/sbin/lctl set_param 				 osd-ldiskfs.track_declares_assert=1 || true
21:58:56:********** Timeout by autotest system **********
Generated at Sat Feb 10 02:07:37 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.