[LU-16181] validate_lod_and_idx() message is missing information Created: 22/Sep/22  Updated: 22/Sep/22

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Andreas Dilger Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Related
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

We've seen errors like "validate_lod_and_idx() fs-MDT0001: bad idx 103 of 32" in cases where there are sparse OST index numbers (e.g. OST index 0-15, 100-115 in use).

Firstly, the message doesn't print anything about the actual file FID that is having this problem, which should be done in lod_initialize_objects() if validate_lod_and_idx() returns an error.

Secondly, the problem itself shouldn't happen. I suspect it relates to the sparse vs. dense OST index being checked. ltd_tgts_size is the size of the ltd_tgt_idx[] array, but not necessarily the maximum index in the array. lod_initialize_objects() is calling this function with l_ost_idx, which appears to be correct ("/* OST index in LOV (lov_tgt_desc->tgts) */" but is clearly not in some cases.


Generated at Sat Feb 10 03:24:44 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.