[LU-7934] overflow for tr->otr_next_id Created: 29/Mar/16  Updated: 14/Jun/18  Resolved: 26/Mar/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.8.0
Fix Version/s: Lustre 2.11.0

Type: Bug Priority: Major
Reporter: Alexander Boyko Assignee: WC Triage
Resolution: Fixed Votes: 0
Labels: bgti, patch

Issue Links:
Duplicate
is duplicated by LU-9626 LustreError: 16331:0:(osp_sync.c:1677... Resolved
Related
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   
crash> bt
PID: 231396  TASK: ffff880b73c5d540  CPU: 4   COMMAND: "mdt01_088"
 #0 [ffff880d7e5057e8] machine_kexec at ffffffff81038f3b
 #1 [ffff880d7e505848] crash_kexec at ffffffff810c5b02
 #2 [ffff880d7e505918] panic at ffffffff81524e75
 #3 [ffff880d7e505998] lbug_with_loc at ffffffffa04feeeb [libcfs]
 #4 [ffff880d7e5059b8] osp_sync_add_rec at ffffffffa10708eb [osp]
 #5 [ffff880d7e505a28] osp_sync_add at ffffffffa1070b07 [osp]
 #6 [ffff880d7e505a48] osp_object_destroy at ffffffffa1062346 [osp]
 #7 [ffff880d7e505a88] lod_object_destroy at ffffffffa1020b07 [lod]
 #8 [ffff880d7e505ac8] mdd_finish_unlink at ffffffffa0dc7538 [mdd]
 #9 [ffff880d7e505b38] mdd_unlink at ffffffffa0dc814a [mdd]
#10 [ffff880d7e505bf8] mdo_unlink at ffffffffa0e36458 [mdt]
#11 [ffff880d7e505c08] mdt_reint_unlink at ffffffffa0e3a24b [mdt]
#12 [ffff880d7e505c98] mdt_reint_rec at ffffffffa0e364ed [mdt]
#13 [ffff880d7e505cc8] mdt_reint_internal at ffffffffa0e1ca0b [mdt]
#14 [ffff880d7e505d08] mdt_reint at ffffffffa0e1d13b [mdt]
#15 [ffff880d7e505d48] tgt_request_handle at ffffffffa092287e [ptlrpc]
#16 [ffff880d7e505da8] ptlrpc_main at ffffffffa08d24d1 [ptlrpc]
#17 [ffff880d7e505ee8] kthread at ffffffff8109ac66
#18 [ffff880d7e505f48] kernel_thread at ffffffff8100c20a

Crash was happened due

<3>[11441257.937330] LustreError: 185667:0:(osp_sync.c:1544:osp_sync_id_get()) snx11126-OST0045-osc-MDT0000: next 0, last synced 4294967205
<0>[11441257.940592] LustreError: 231396:0:(osp_sync.c:1545:osp_sync_id_get()) LBUG

The root cause is u32 overflow for tr->otr_next_id.



 Comments   
Comment by Gerrit Updater [ 29/Mar/16 ]

Alexander Boyko (alexander.boyko@seagate.com) uploaded a new patch: http://review.whamcloud.com/19190
Subject: LU-7934 osp: fix tr->otr_next_id overflow
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: bc8de37aa21a8e0d777ef08479285e73df19bbb3

Comment by Gerrit Updater [ 16/Jun/16 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/19190/
Subject: LU-7934 osp: fix tr->otr_next_id overflow
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: b9e1bb635039c6d2d985754a9a029c9d5c20b569

Comment by Gerrit Updater [ 17/Jun/16 ]

Alexander Boyko (alexander.boyko@seagate.com) uploaded a new patch: http://review.whamcloud.com/20847
Subject: LU-7934 tests: compatibility check
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 3c6047b3e718c2c76d20df5b7e1bf943f6fa8d92

Comment by Gerrit Updater [ 06/Feb/18 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/20847/
Subject: LU-7934 tests: compatibility check
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 49306327b929e5ab3c62c2ed27199e7ccb69139f

Generated at Sat Feb 10 02:13:11 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.