[LU-8760] sanity-lfsck test 31g hung - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: Lustre 2.10.1, Lustre 2.11.0, Lustre 2.13.0
Affects Version/s: Lustre 2.9.0
Labels:
None

Severity:
3
Rank (Obsolete):
9223372036854775807

Description

With patch http://review.whamcloud.com/7200 on master branch, sanity-lfsck test 31g hung as follows:

== sanity-lfsck test 31g: Repair the corrupted slave LMV EA ========================================== 06:38:18 (1476970698)
#####
For some reason, the stripe index in the slave LMV EA is
corrupted. The LFSCK should repair the slave LMV EA.
#####
Inject failure stub on MDT0 to simulate the case that the
slave LMV EA on the first shard of the striped directory
claims the same index as the second shard claims
CMD: onyx-35vm7 /usr/sbin/lctl set_param fail_loc=0x162b fail_val=0
fail_loc=0x162b
fail_val=0
CMD: onyx-35vm7 /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0
fail_loc=0x0
fail_val=0
Trigger namespace LFSCK to repair the slave LMV EA
CMD: onyx-35vm7 /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Started LFSCK on the device lustre-MDT0000: scrub namespace
CMD: onyx-35vm7 /usr/sbin/lctl lfsck_query -t namespace -M lustre-MDT0000 -w |
		      awk '/^namespace_mdts_completed/ { print \$2 }'

Maloo report: https://testing.hpdd.intel.com/test_sets/7e295098-96fa-11e6-a763-5254006e85c2

Attachments

Activity

People

Assignee:: nasf (Inactive)

Reporter:: Jian Yu

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 26/Oct/16 2:20 PM

Updated:: 24/Aug/22 8:21 AM

Resolved:: 08/Apr/19 2:20 PM