Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
None
-
3
-
9223372036854775807
Description
there are two interesting stack traces:
PID: 6269 TASK: ffff9c4a9c2243c0 CPU: 1 COMMAND: "ll_ost_io00_001" #0 [ffff9c4a9c2eba50] __schedule at ffffffff9b6f6bd6 /tmp/kernel/kernel/sched/core.c: 3755 #1 [ffff9c4a9c2ebaa8] schedule at ffffffff9b6f7170 /tmp/kernel/kernel/sched/core.c: 4602 #2 [ffff9c4a9c2ebac0] schedule_preempt_disabled at ffffffff9b6f75bf /tmp/kernel/kernel/sched/core.c: 4661 #3 [ffff9c4a9c2ebac8] rwsem_down_read_slowpath at ffffffff9b6facc0 /tmp/kernel/kernel/locking/rwsem.c: 1088 #4 [ffff9c4a9c2ebb50] down_read_nested at ffffffff9b12bb71 /tmp/kernel/./include/linux/err.h: 36 #5 [ffff9c4a9c2ebb68] osd_read_lock at ffffffffc0b3b643 [osd_ldiskfs] /home/lustre/master-mine/lustre/osd-ldiskfs/osd_handler.c: 2809 #6 [ffff9c4a9c2ebb90] ofd_preprw at ffffffffc0ec7093 [ofd] /home/lustre/master-mine/lustre/ofd/ofd_internal.h: 207 #7 [ffff9c4a9c2ebc38] tgt_brw_read at ffffffffc05af2c9 [ptlrpc] /home/lustre/master-mine/libcfs/include/libcfs/libcfs_debug.h: 126 #8 [ffff9c4a9c2ebda8] tgt_request_handle at ffffffffc05ad03c [ptlrpc] /home/lustre/master-mine/lustre/include/lu_target.h: 638 #9 [ffff9c4a9c2ebe20] ptlrpc_main at ffffffffc04fbb53 [ptlrpc] /home/lustre/master-mine/lustre/include/lustre_net.h: 2413 #10 [ffff9c4a9c2ebf10] kthread at ffffffff9b10383e /tmp/kernel/kernel/kthread.c: 354 #11 [ffff9c4a9c2ebf50] ret_from_fork at ffffffff9b8001c4 /tmp/kernel/arch/x86/entry/entry_64.S: 328 PID: 11853 TASK: ffff9c4af1f94c40 CPU: 0 COMMAND: "ll_ost_io00_011" #0 [ffff9c4af13bfa58] __schedule at ffffffff9b6f6bd6 /tmp/kernel/kernel/sched/core.c: 3755 #1 [ffff9c4af13bfab0] schedule at ffffffff9b6f7170 /tmp/kernel/kernel/sched/core.c: 4602 #2 [ffff9c4af13bfac8] io_schedule at ffffffff9b6f769d /tmp/kernel/./arch/x86/include/asm/current.h: 15 #3 [ffff9c4af13bfad8] __lock_page at ffffffff9b1d405d /tmp/kernel/./arch/x86/include/asm/current.h: 15 #4 [ffff9c4af13bfb68] pagecache_get_page at ffffffff9b1d5207 /tmp/kernel/./include/linux/pagemap.h: 480 #5 [ffff9c4af13bfba8] ldiskfs_block_zero_page_range at ffffffffc0abeedc [ldiskfs] /home/lustre/master-mine/ldiskfs/inode.c: 4043 #6 [ffff9c4af13bfc00] ldiskfs_truncate at ffffffffc0ac4b6c [ldiskfs] /home/lustre/master-mine/ldiskfs/inode.c: 4169 #7 [ffff9c4af13bfc40] osd_execute_truncate at ffffffffc0b63f78 [osd_ldiskfs] /home/lustre/linux-4.18.0-477.15.1.el8_8/include/linux/fs.h: 792 #8 [ffff9c4af13bfc78] osd_punch at ffffffffc0b64249 [osd_ldiskfs] /home/lustre/master-mine/libcfs/include/libcfs/libcfs_debug.h: 126 #9 [ffff9c4af13bfcb8] ofd_object_punch at ffffffffc0ec2421 [ofd] /home/lustre/master-mine/lustre/ofd/ofd_objects.c: 986 #10 [ffff9c4af13bfd30] ofd_punch_hdl at ffffffffc0eaa09c [ofd] /home/lustre/master-mine/lustre/ofd/ofd_dev.c: 2131 #11 [ffff9c4af13bfda8] tgt_request_handle at ffffffffc05ad03c [ptlrpc] /home/lustre/master-mine/lustre/include/lu_target.h: 638 #12 [ffff9c4af13bfe20] ptlrpc_main at ffffffffc04fbb53 [ptlrpc] /home/lustre/master-mine/lustre/include/lustre_net.h: 2413 #13 [ffff9c4af13bff10] kthread at ffffffff9b10383e /tmp/kernel/kernel/kthread.c: 354 #14 [ffff9c4af13bff50] ret_from_fork at ffffffff9b8001c4 /tmp/kernel/arch/x86/entry/entry_64.S: 328
not sure how this is possible to have overlapping truncate and read.
Attachments
Issue Links
- is related to
-
LU-17364 osc_page_delete LBUG - trying to delete a page under write
-
- Resolved
-
Andreas, my test had added to test a client side problems, while backtrace say about OFD side.
High likely, this is regression from OFD external truncate on OFD. Similar bug hits on Cray with group lock.