[LU-10922] sanity-lfsck test_23b: (9) Fail to repair dangling name entry: 0 Created: 18/Apr/18 Updated: 15/Feb/21 |
|
| Status: | Reopened |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.13.0, Lustre 2.14.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Maloo | Assignee: | WC Triage |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | zfs | ||
| Issue Links: |
|
||||||||||||
| Severity: | 3 | ||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||
| Description |
|
This issue was created by maloo for nasf <fan.yong@intel.com> This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/d32d2bc0-428c-11e8-b45c-52540065bddc Inject failure stub on MDT0 to simulate dangling name entry fail_val=130 fail_loc=0x1621 fail_val=0 fail_loc=0 - unlinked 0 (time 1573179943 ; total 0 ; last 0) total: 10 unlinks in 0 seconds: inf unlinks/second 'ls' should fail because of dangling name entry Trigger namespace LFSCK to find out dangling name entry Started LFSCK on the device lustre-MDT0000: scrub namespace sanity-lfsck test_23b: @@@@@@ FAIL: (9) Fail to repair dangling name entry: 0 |
| Comments |
| Comment by nasf (Inactive) [ 18/Apr/18 ] |
|
The MDS logs show that: 00100000:10000000:1.0:1523995128.651061:0:23636:0:(lfsck_namespace.c:5706:lfsck_namespace_assistant_handler_p1()) lustre-MDT0000-osd: namespace LFSCK assistant fail to handle the entry: [0x200003ab2:0x89:0x0], parent [0x200003ab2:0x7d:0x0], name foo: rc = -61 The object [0x200003ab2:0x89:0x0] was just removed before the LFSCK, but the logic was not aware of that. |
| Comment by Gerrit Updater [ 18/Apr/18 ] |
|
Fan Yong (fan.yong@intel.com) uploaded a new patch: https://review.whamcloud.com/32042 |
| Comment by Gerrit Updater [ 06/May/18 ] |
|
Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/32042/ |
| Comment by Peter Jones [ 06/May/18 ] |
|
Landed for 2.12 |
| Comment by Nathaniel Clark [ 21/Nov/18 ] |
|
This is still happening on master: |
| Comment by Minh Diep [ 22/Mar/19 ] |
|
+1 on b2_12: https://testing.whamcloud.com/test_sets/5405e0be-4c6b-11e9-9646-52540065bddc |
| Comment by Jian Yu [ 19/Jun/19 ] |
|
+1 on master: https://testing.whamcloud.com/test_sets/c8b4d914-8d58-11e9-abe3-52540065bddc |
| Comment by Chris Horn [ 14/Jul/19 ] |
|
+1 on master: https://testing.whamcloud.com/test_sets/5506db70-a455-11e9-8fc1-52540065bddc |
| Comment by Jian Yu [ 15/Aug/19 ] |
|
+1 on master branch: https://testing.whamcloud.com/test_sets/90443a02-bf1c-11e9-98c8-52540065bddc |
| Comment by Andreas Dilger [ 08/Nov/19 ] |
|
During the past 4 weeks 8 of 391 runs failed (~2% failure rate), and all of the failures were on ZFS filesystems. |
| Comment by Emoly Liu [ 03/Feb/20 ] |
| Comment by Etienne Aujames [ 15/Feb/21 ] |
|
+1 on b2_12: https://testing.whamcloud.com/sub_tests/f12f033c-06a6-451d-b846-70b97b2c0f76 |