[LU-15432] sanity-lfsck test_18d: Expect 2 orphans have been fixed, but got: 0 (b2_12) Created: 11/Jan/22  Updated: 11/Jan/22

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for eaujames <eaujames@ddn.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/4a1f9e10-3c71-4b74-80e4-155ce972dbed

test_18d failed with the following error:

(5) Expect 2 orphans have been fixed, but got: 0

This issue has been seen on the b2_12 branch (2.12.8.x).
In the debug log on the MDT side we have only the dangling entries:

00100000:10000000:0.0:1641596880.896251:0:2858:0:(lfsck_layout.c:1576:lfsck_layout_ins_dangling_rec()) lustre-MDT0000-osd: insert the paris [0x200003ab1:0x26:0x0] => [0x100000000:0x8b3:0x0], comp_id = 0, ea_off = 0, ost_idx = 0, into the trace file for further dangling check: rc = 0
00100000:10000000:0.0:1641596880.896688:0:2858:0:(lfsck_layout.c:1576:lfsck_layout_ins_dangling_rec()) lustre-MDT0000-osd: insert the paris [0x200003ab1:0x28:0x0] => [0x100000000:0x8b5:0x0], comp_id = 1, ea_off = 0, ost_idx = 0, into the trace file for further dangling check: rc = 0
00100000:10000000:0.0:1641596881.005649:0:2858:0:(lfsck_layout.c:3644:lfsck_layout_repair_dangling()) lustre-MDT0000-osd: layout LFSCK assistant found dangling reference for: parent [0x200003ab1:0x26:0x0], child [0x100000000:0x8b3:0x0], comp_id 0, ea_off 0, ost_idx 0, Create the lost OST-object as required: rc = 1
00100000:10000000:0.0:1641596881.005764:0:2858:0:(lfsck_layout.c:3644:lfsck_layout_repair_dangling()) lustre-MDT0000-osd: layout LFSCK assistant found dangling reference for: parent [0x200003ab1:0x28:0x0], child [0x100000000:0x8b5:0x0], comp_id 1, ea_off 0, ost_idx 0, Create the lost OST-object as required: rc = 1

On the OST side we have some orphan but not matching f1...f4:

00100000:10000000:1.0:1641596719.453528:0:9351:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0001-osd: return orphan [0x340000402:0xa2:0x0], PFID [0x240002340:0x2:0x0], owner 1:1, stripe size 1048576, stripe count 2, COMP id 0, COMP start 0, COMP end 0, layout version 0, range 0
00100000:10000000:1.0:1641596719.456024:0:9351:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0002-osd: return orphan [0x380000401:0xa2:0x0], PFID [0x240002340:0x2:0x1], owner 1:1, stripe size 1048576, stripe count 2, COMP id 0, COMP start 0, COMP end 0, layout version 0, range 0
00100000:10000000:0.0:1641596719.459582:0:18277:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0000-osd: return orphan [0x100000000:0x8ad:0x0], PFID [0x200003ab1:0x16:0x0], owner 1:1, stripe size 1048576, stripe count 1, COMP id 0, COMP start 0, COMP end 0, layout version 0, range 0
00100000:10000000:0.0:1641596719.459594:0:18277:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0000-osd: return orphan [0x100000000:0x8ae:0x0], PFID [0x200003ab1:0x17:0x0], owner 1:1, stripe size 1048576, stripe count 1, COMP id 1, COMP start 0, COMP end 1048576, layout version 0, range 0
00100000:10000000:0.0:1641596719.466695:0:18277:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0001-osd: return orphan [0x100010000:0x764:0x0], PFID [0x200003ab1:0x17:0x0], owner 1:1, stripe size 1048576, stripe count 1, COMP id 2, COMP start 1048576, COMP end 18446744073709551615, layout version 0, range 0
00100000:10000000:1.0:1641596732.678386:0:9351:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0001-osd: return orphan [0x340000402:0xa3:0x0], PFID [0x240002340:0x4:0x0], owner 0:0, stripe size 1048576, stripe count 2, COMP id 0, COMP start 0, COMP end 0, layout version 0, range 0
00100000:10000000:1.0:1641596732.680987:0:9351:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0002-osd: return orphan [0x380000401:0xa3:0x0], PFID [0x240002340:0x4:0x1], owner 0:0, stripe size 1048576, stripe count 2, COMP id 0, COMP start 0, COMP end 0, layout version 0, range 0
00100000:10000000:1.0:1641596733.011126:0:18277:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0002-osd: return orphan [0x100020000:0x724:0x0], PFID [0x200003ab1:0x1c:0x0], owner 0:0, stripe size 1048576, stripe count 1, COMP id 2, COMP start 1048576, COMP end 18446744073709551615, layout version 0, range 0
00100000:10000000:1.0:1641596733.017586:0:18277:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0000-osd: return orphan [0x100000000:0x8af:0x0], PFID [0x200003ab1:0x1b:0x0], owner 0:0, stripe size 1048576, stripe count 1, COMP id 0, COMP start 0, COMP end 0, layout version 0, range 0
00100000:10000000:1.0:1641596733.017591:0:18277:0:(lfsck_layout.c:7373:lfsck_orphan_it_next()) lustre-OST0000-osd: return orphan [0x100000000:0x8b0:0x0], PFID [0x200003ab1:0x1c:0x0], owner 0:0, stripe size 1048576, stripe count 1, COMP id 1, COMP start 0, COMP end 1048576, layout version 0, range 0

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity-lfsck test_18d - (5) Expect 2 orphans have been fixed, but got: 0


Generated at Sat Feb 10 03:18:16 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.