[LU-587] mgs have corrupted config llog but forbid to re-register a target Created: 10/Aug/11 Updated: 28/Oct/11 Resolved: 28/Oct/11 |
|
| Status: | Closed |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Alexey Lyashkov | Assignee: | Andreas Dilger |
| Resolution: | Incomplete | Votes: | 0 |
| Labels: | None | ||
| Environment: |
lustre 2.1/RHEL6 |
||
| Severity: | 2 |
| Rank (Obsolete): | 6570 |
| Description |
|
after hard reset mgs configuration llog can truncated. in case OST config llog also corrupted it's prevent ost to be started. Same situation can be occurred with MDT also. |
| Comments |
| Comment by Peter Jones [ 24/Aug/11 ] |
|
Shadow, can you please resubmit your patch - http://review.whamcloud.com/#change,1209 - again (after merging with current master so that the hash changes). Unfortunately it was submitted at unfortunate time where our testing infrastructure was having problems and os it was not test-built and tested. Thanks |
| Comment by Alexey Lyashkov [ 24/Aug/11 ] |
|
Peter, Done |
| Comment by Peter Jones [ 24/Aug/11 ] |
|
Thx |
| Comment by Andreas Dilger [ 20/Sep/11 ] |
|
I'm trying to understand this bug and the associated patch, but there is not nearly enough explanation in either to understand what is broken, how it is breaking, or why the patch would fix it. Error messages from the console or kernel debug log would definitely help, as to an explanation of why opening a file read-write would allow it to be corrupted on a hard reset, and why opening it read-only prevents that corruption. Is there a specific test case that causes this corruption? I can't imagine that the local config llog gets corrupted after a hard reset, regardless of the open mode, unless the reset happens immediately after the llog is written and has not yet made it to disk. |
| Comment by Andreas Dilger [ 28/Oct/11 ] |
|
There isn't any useful information in this bug, or the patch. Closing until some information about how/when this problem is being seen is provided. |