LFSCK II: MDT-OST consistency check/repair
(LU-1267)
|
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.6.0 |
| Fix Version/s: | Lustre 2.6.0 |
| Type: | Technical task | Priority: | Critical |
| Reporter: | Jodi Levi (Inactive) | Assignee: | nasf (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Attachments: |
|
| Rank (Obsolete): | 8490 |
| Description |
|
Create the test plan for all of LFSCK Phase II and attach to the Jira ticket so that the test engineers know how to test the feature for the release. |
| Comments |
| Comment by nasf (Inactive) [ 18/Nov/13 ] |
|
FLSCK 2 test plan (ldiskfs only) 1. Correctness 2. Performance 2.1) LFSCK against healthy 2.x system for consistency routine check. 2.2) LFSCK against the lustre-2.x system with inconsistent layout OST-objects. 3. Small files create performance impact by LFSCK 4. Scale test 5. Resource requirement. |
| Comment by Jodi Levi (Inactive) [ 17/Dec/13 ] |
|
Link to Lustre Feature Test Plan Template: https://wiki.hpdd.intel.com/display/PMP/Lustre+Feature+Test+Plan+Template |
| Comment by Jodi Levi (Inactive) [ 03/Mar/14 ] |
|
Test plan is attached to ticket. Will update test plan as needed but will close this ticket as complete. |
| Comment by Cory Spitz [ 28/Mar/14 ] |
|
Thanks, Jodi. The attached plan has embedded links to things like https://wiki.hpdd.intel.com/display/ENG/Regression+Test+Suites+and+Failover+ |
| Comment by Jodi Levi (Inactive) [ 31/Mar/14 ] |
|
Sorry if the mention of the internal links in the Test Plan Template caused confusion. We will avoid using those links in any actual test plans. Please let me know if you see any of these links in actual test plans, and I will get it corrected. |
| Comment by Jodi Levi (Inactive) [ 30/Apr/14 ] |
|
The test plan needs to be updated based on changes made to the functionality since the plan was written. James Nunez has the details. I will ask him to provide in this ticket. |
| Comment by James Nunez (Inactive) [ 30/Apr/14 ] |
|
With recent patch landings to LFSCK, there are a few more options to choose from. Should we incorporate some of these into the existing test plan? All of the new options need to be tested, but for for any of the existing tests, that revolve around performance, do we want to : Since XATTR_NAME_FID does not exist, in test 2.2, is setting fail_loc to OBD_LFSCK_UNMATCHED_PAIR* or OBD_LFSCK_INVALID_PFID just as good or will repairing different failures cause dramatically different performance results? Is LFSCK_DANGLING preferred? |
| Comment by nasf (Inactive) [ 30/Apr/14 ] |
|
The new options "-c" has been tested in the sanity-lfsck.sh test_14/test_18; the new options "-o" has been tested by the sanity-lfsck.sh test_18. They are part of the test plan 1.1) LFSCK II test plan only needs to cover layout LFSCK, not necessary for namespace LFSCK which has already been done in LFSCK 1.5 Currently, the LFSCK use single async OUT RPC to repair the inconsistency except the orphan handling. Means in spite of dangling reference or unmatched MDT-OST pairs or multiple references or inconsistent owner, they should not affect the repairing performance much. So I select the dangling reference case in the lfsck-performance.sh, which is easy to be simulated. I do not think we need to test all kinds of different inconsistency performance unless we have strong requirement and enough time to do that. |
| Comment by Jodi Levi (Inactive) [ 01/May/14 ] |
|
James' questions have been answered and the test plan does not need to be updated at this point. |
| Comment by James Nunez (Inactive) [ 29/Jul/14 ] |
|
Here are the results for the LFSCK Phase II test plan. I plan to add past results to and clean up the presentation of results in this document. |