[LU-15354] recovery-double-scale: osp_trans_commit_cb()) ASSERTION( atomic_read(&oth->ot_refcount) > 0 ) failed Created: 09/Dec/21  Updated: 09/Dec/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Elena <elena.gryaznova@hpe.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/0c01c94d-cd5f-4460-95c7-01d5a412ab15

[  225.680289] LustreError: 6477:0:(tgt_lastrcvd.c:1495:tgt_last_rcvd_update()) lustre-MDT0000: trying to overwrite bigger transno:on-disk: 12884948250, new: 12884948247 replay: 1. See LU-617.
[  225.684012] LustreError: 6477:0:(osd_handler.c:2079:osd_trans_stop()) lustre-MDT0000: failed in transaction hook: rc = -75
[  225.685844] LustreError: 3492:0:(osp_trans.c:1009:osp_trans_commit_cb()) ASSERTION( atomic_read(&oth->ot_refcount) > 0 ) failed: 
[  225.687666] LustreError: 3492:0:(osp_trans.c:1009:osp_trans_commit_cb()) LBUG
[  225.688800] Pid: 3492, comm: ptlrpcd_rcv 4.18.0-240.22.1.el8_lustre.x86_64 #1 SMP Mon Nov 8 22:25:55 UTC 2021
[  225.690348] Call Trace TBD:
[  225.690978] [<0>] libcfs_call_trace+0x6f/0x90 [libcfs]
[  225.691797] [<0>] lbug_with_loc+0x43/0x80 [libcfs]
[  225.692634] [<0>] osp_trans_commit_cb+0x108/0x110 [osp]
[  225.693475] [<0>] osp_request_commit_cb+0xa7/0x270 [osp]
[  225.694714] [<0>] after_reply+0xd88/0xe10 [ptlrpc]
[  225.695526] [<0>] ptlrpc_check_set+0x9b3/0x2140 [ptlrpc]
[  225.696422] [<0>] ptlrpcd+0x6c1/0xa50 [ptlrpc]
[  225.697159] [<0>] kthread+0x112/0x130
[  225.697770] [<0>] ret_from_fork+0x35/0x40
[  225.698420] Kernel panic - not syncing: LBUG

Generated at Sat Feb 10 03:17:36 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.