[LU-2493] osp_precreate.c:781:osp_precreate_get_id()) ASSERTION( d->opd_pre_used_id < d->opd_pre_last_created ) Created: 13/Dec/12  Updated: 20/Aug/15  Resolved: 20/Aug/15

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Critical
Reporter: Oleg Drokin Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Duplicate
duplicates LU-2285 Test failure on replay-ost-single tes... Resolved
Severity: 3
Rank (Obsolete): 5848

 Description   

Got this assertion today while running replay-single test 0b:

[387592.038869] Lustre: lustre-OST0000: Recovery over after 0:01, of 2 clients 2
 recovered and 0 were evicted.
[387592.040120] Lustre: 5095:0:(ofd_obd.c:1069:ofd_orphans_destroy()) lustre-OST
0000: deleting orphan objects from 33 to 64
[387592.059354] LustreError: 5724:0:(osp_precreate.c:781:osp_precreate_get_id())
 ASSERTION( d->opd_pre_used_id < d->opd_pre_last_created ) failed: 
[387592.060459] LustreError: 5724:0:(osp_precreate.c:781:osp_precreate_get_id())
 LBUG
[387592.061232] Pid: 5724, comm: mdt01_002
[387592.061646] 
[387592.061648] Call Trace:
[387592.062737]  [<ffffffffa0afd915>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
[387592.063817]  [<ffffffffa0afdf27>] lbug_with_loc+0x47/0xb0 [libcfs]
[387592.064372]  [<ffffffffa0da2038>] osp_precreate_get_id+0xb8/0xd0 [osp]
[387592.064836]  [<ffffffffa0d9e580>] osp_object_assign_id+0x80/0x2c0 [osp]
[387592.065311]  [<ffffffffa0d9f2a6>] osp_object_create+0x346/0x570 [osp]
[387592.065822]  [<ffffffffa0d66bb3>] lod_striping_create+0x193/0x320 [lod]
[387592.066286]  [<ffffffffa0d66ebb>] lod_object_create+0x17b/0x280 [lod]
[387592.066729]  [<ffffffffa0be82df>] mdd_object_create_internal+0xcf/0x210 [mdd
]
[387592.067462]  [<ffffffffa0bf9eb7>] mdd_create+0xb97/0x1550 [mdd]
[387592.067904]  [<ffffffffa0d00479>] mdt_reint_open+0x1079/0x1860 [mdt]
[387592.068369]  [<ffffffffa0bff71e>] ? md_ucred+0x1e/0x60 [mdd]
[387592.068790]  [<ffffffffa0ccb635>] ? mdt_ucred+0x15/0x20 [mdt]
[387592.069225]  [<ffffffffa0ceb371>] mdt_reint_rec+0x41/0xe0 [mdt]
[387592.069709]  [<ffffffffa0ce4af3>] mdt_reint_internal+0x4e3/0x7e0 [mdt]
[387592.070215]  [<ffffffffa0ce50bd>] mdt_intent_reint+0x1ed/0x500 [mdt]
[387592.070671]  [<ffffffffa0ce0c85>] mdt_intent_policy+0x3c5/0x800 [mdt]
[387592.071162]  [<ffffffffa067043a>] ldlm_lock_enqueue+0x2ea/0x890 [ptlrpc]
[387592.071651]  [<ffffffffa06993b7>] ldlm_handle_enqueue0+0x4f7/0x1090 [ptlrpc]
[387592.072167]  [<ffffffffa0ce07d6>] mdt_enqueue+0x46/0x130 [mdt]
[387592.072630]  [<ffffffffa0cd4802>] mdt_handle_common+0x932/0x1750 [mdt]
[387592.073114]  [<ffffffffa0cd56f5>] mdt_regular_handle+0x15/0x20 [mdt]
[387592.073644]  [<ffffffffa06c7953>] ptlrpc_server_handle_request+0x463/0xe70 [ptlrpc]
[387592.074405]  [<ffffffffa0afe66e>] ? cfs_timer_arm+0xe/0x10 [libcfs]
[387592.074862]  [<ffffffffa06c0621>] ? ptlrpc_wait_event+0xb1/0x2a0 [ptlrpc]
[387592.075335]  [<ffffffff81051f73>] ? __wake_up+0x53/0x70
[387592.075786]  [<ffffffffa06ca48d>] ptlrpc_main+0xb3d/0x18e0 [ptlrpc]
[387592.076278]  [<ffffffffa06c9950>] ? ptlrpc_main+0x0/0x18e0 [ptlrpc]
[387592.076757]  [<ffffffff8100c14a>] child_rip+0xa/0x20
[387592.077213]  [<ffffffffa06c9950>] ? ptlrpc_main+0x0/0x18e0 [ptlrpc]
[387592.077735]  [<ffffffffa06c9950>] ? ptlrpc_main+0x0/0x18e0 [ptlrpc]
[387592.078192]  [<ffffffff8100c140>] ? child_rip+0x0/0x20
[387592.078596] 
[387592.079217] Kernel panic - not syncing: LBUG


 Comments   
Comment by Li Wei (Inactive) [ 13/Dec/12 ]

This might be another race condition being fixed under LU-2285:

~ osp_sync_thread()           ~ mdd_create()
-----------------------------------
(Recovery completed.)
Sent orphan cleanup request.
                              osp_precreate_reserve()
Got orphan cleanup reply.
Emptied pre-creation window.
                              osp_precreate_get_id()
Comment by Andreas Dilger [ 18/Dec/12 ]

Tentatively marking a duplicate of LU-2285.

Generated at Sat Feb 10 01:25:40 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.