[LU-3420] OI scrubbing could not automatically engage after restoring a secondary MDT from a (file-level) backup Created: 30/May/13 Updated: 13/Sep/13 Resolved: 10/Jul/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.4.0 |
| Fix Version/s: | Lustre 2.4.1, Lustre 2.5.0 |
| Type: | Bug | Priority: | Critical |
| Reporter: | Li Wei (Inactive) | Assignee: | nasf (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | mq313 | ||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 8481 | ||||||||
| Description |
|
When adapting sanity-scrub 4 to exercise not only MDT 0 but also the secondary MDTs, I found that, after restoring a secondary MDT from its file-level backup, looking up corresponding "remote" directory would return ENOENT on clients:
"mdt1" was created by "lfs mkdir -i 1". And, OI scrubbing did not engage automatically:
The debug log shows that MDT 0 sent an UPDATE_OBJ OBJ_ATTR_GET RPC to MDT 1. The FID was found in the OI but the ino was (naturally) stale:
According to osd_fid_lookup(), OI scrubbing is not triggered in this case. |
| Comments |
| Comment by Li Wei (Inactive) [ 30/May/13 ] |
|
Attached the debug log. Note that this was a single-node setup. |
| Comment by Li Wei (Inactive) [ 30/May/13 ] |
|
CC'ed Wang Di and Fan Yong. |
| Comment by Li Wei (Inactive) [ 30/May/13 ] |
|
This and |
| Comment by Andreas Dilger [ 30/May/13 ] |
|
Fan Yong, I understand that remote directory checking for DNE MDTs is part of LFSCK Phase III, but could you please investigate what work would be needed to fix the file-level backup/restore? Li Wei, do you know if this is a problem on mdt0 or mdt1? Were both of them backed up and restored, or just mdt1? |
| Comment by Li Wei (Inactive) [ 31/May/13 ] |
|
Andreas, all MDTs (MDSCOUNT=2, so both MDT 0 and 1) were backed up and restored during the test. The problem, as far as I discussed with Fan Yong yesterday, was on MDT 1---the direct FID lookup (without a prior name lookup) does not trigger OI scrubbing. |
| Comment by nasf (Inactive) [ 01/Jun/13 ] |
|
I have made a patch to fix it: Related reason has been described in the patch commit message. |
| Comment by nasf (Inactive) [ 10/Jul/13 ] |
|
The patch has been landed to Lustre-2.5 |