[LU-587] mgs have corrupted config llog but forbid to re-register a target Created: 10/Aug/11  Updated: 28/Oct/11  Resolved: 28/Oct/11

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Alexey Lyashkov Assignee: Andreas Dilger
Resolution: Incomplete Votes: 0
Labels: None
Environment:

lustre 2.1/RHEL6


Severity: 2
Rank (Obsolete): 6570

 Description   

after hard reset mgs configuration llog can truncated. in case OST config llog also corrupted it's prevent ost to be started. Same situation can be occurred with MDT also.
it's easy replicated by truncate mgs config llog to zero.
In that case mgs have bit set - 'that target registered' and forbid to re-register target, but none configuration send to OST.



 Comments   
Comment by Peter Jones [ 24/Aug/11 ]

Shadow, can you please resubmit your patch - http://review.whamcloud.com/#change,1209 - again (after merging with current master so that the hash changes). Unfortunately it was submitted at unfortunate time where our testing infrastructure was having problems and os it was not test-built and tested. Thanks

Comment by Alexey Lyashkov [ 24/Aug/11 ]

Peter,

Done

Comment by Peter Jones [ 24/Aug/11 ]

Thx

Comment by Andreas Dilger [ 20/Sep/11 ]

I'm trying to understand this bug and the associated patch, but there is not nearly enough explanation in either to understand what is broken, how it is breaking, or why the patch would fix it. Error messages from the console or kernel debug log would definitely help, as to an explanation of why opening a file read-write would allow it to be corrupted on a hard reset, and why opening it read-only prevents that corruption.

Is there a specific test case that causes this corruption? I can't imagine that the local config llog gets corrupted after a hard reset, regardless of the open mode, unless the reset happens immediately after the llog is written and has not yet made it to disk.

Comment by Andreas Dilger [ 28/Oct/11 ]

There isn't any useful information in this bug, or the patch. Closing until some information about how/when this problem is being seen is provided.

Generated at Sat Feb 10 01:08:30 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.