Details
-
Bug
-
Resolution: Duplicate
-
Minor
-
None
-
Lustre 2.16.0
-
None
-
3
-
9223372036854775807
Description
Starting from March 30, right after landings on that day, a new assertion crash appeared in large-scale test 3a (only gets run in full testing I guess, so flew under radar)
LustreError: 676976:0:(osp_precreate.c:488:osp_precreate_rollover_new_seq()) ASSERTION( fid_seq(fid) != fid_seq(last_fid) ) failed: fid [0x240000bd0:0x1:0x0], last_fid [0x240000bd0:0x3fff:0x0] LustreError: 676976:0:(osp_precreate.c:488:osp_precreate_rollover_new_seq()) LBUG Pid: 676976, comm: osp-pre-0-0 4.18.0-425.10.1.el8_lustre.x86_64 #1 SMP Thu Mar 2 00:54:22 UTC 2023 Call Trace TBD: [<0>] libcfs_call_trace+0x6f/0xa0 [libcfs] [<0>] lbug_with_loc+0x3f/0x70 [libcfs] [<0>] osp_precreate_thread+0x121d/0x1230 [osp] [<0>] kthread+0x10b/0x130 [<0>] ret_from_fork+0x35/0x40
Example crashes:
https://testing.whamcloud.com/test_sets/5173c0c5-ff80-4f5b-aec2-d6e1419cbd85
https://testing.whamcloud.com/test_sets/68c90481-1450-4526-a659-b6d5d6b97f0a
https://testing.whamcloud.com/test_sets/20a4a76a-e1bf-4f46-985c-b8cbed94e51b
I suspect this is due to LU-11912 patch landing, the timing checks out.