Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Duplicate
Priority: Minor
Fix Version/s: None
Affects Version/s: Lustre 2.16.0
Labels:
None

Severity:
3
Rank (Obsolete):
9223372036854775807

Description

Starting from March 30, right after landings on that day, a new assertion crash appeared in large-scale test 3a (only gets run in full testing I guess, so flew under radar)

LustreError: 676976:0:(osp_precreate.c:488:osp_precreate_rollover_new_seq()) ASSERTION( fid_seq(fid) != fid_seq(last_fid) ) failed: fid [0x240000bd0:0x1:0x0], last_fid [0x240000bd0:0x3fff:0x0]
LustreError: 676976:0:(osp_precreate.c:488:osp_precreate_rollover_new_seq()) LBUG
Pid: 676976, comm: osp-pre-0-0 4.18.0-425.10.1.el8_lustre.x86_64 #1 SMP Thu Mar 2 00:54:22 UTC 2023
Call Trace TBD:
[<0>] libcfs_call_trace+0x6f/0xa0 [libcfs]
[<0>] lbug_with_loc+0x3f/0x70 [libcfs]
[<0>] osp_precreate_thread+0x121d/0x1230 [osp]
[<0>] kthread+0x10b/0x130
[<0>] ret_from_fork+0x35/0x40

Example crashes:

https://testing.whamcloud.com/test_sets/5173c0c5-ff80-4f5b-aec2-d6e1419cbd85

https://testing.whamcloud.com/test_sets/68c90481-1450-4526-a659-b6d5d6b97f0a

https://testing.whamcloud.com/test_sets/20a4a76a-e1bf-4f46-985c-b8cbed94e51b

I suspect this is due to ~~LU-11912~~ patch landing, the timing checks out.

Attachments

Issue Links

is related to

LU-17538 lov_objseq file contains 0x0BD0 contstant in low bytes

Open

LU-17658 sanity check when ofd assign a new sequence to osp

Resolved

is related to

LU-16692 replay-single: test_70c osp_fid_diff()) ASSERTION( fid_seq(fid1) == fid_seq(fid2) )

Resolved

LU-11912 reduce number of OST objects created per MDS Sequence

Resolved

Activity

People

Assignee:: Dongyang Li

Reporter:: Oleg Drokin

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 07/Apr/23 12:48 AM

Updated:: 09/Apr/24 7:27 PM

Resolved:: 09/Apr/24 7:27 PM