Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-12462

osc_cache_writeback_range(): ASSERTION( ext->oe_start >= start && ext->oe_end <= end ) failed

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.13.0, Lustre 2.12.4
    • Lustre 2.13.0, Lustre 2.12.4
    • None
    • 3
    • 9223372036854775807

    Description

      This is a new failure in racer happening only in maloo full testning. First crash recorded was on June 1st right after landing this below bunch of patches. On both patched and patchless master builds:

      LU-12034 obdclass: put all service's env on the list (detail / gitweb)
      LU-11359 mdt: fix mdt_dom_discard_data() timeouts (detail / gitweb)
      LU-9846 lod: Add overstriping support (detail / gitweb)
      LU-11893 o2iblnd: add secondary IP address handling (detail / gitweb)
      LU-11526 rpc: support maximum 64MB I/O RPC (detail / gitweb)
      LU-11872 utils: don't follow link files in default (detail / gitweb)
      LU-10894 dom: per-resource ELC for WRITE lock enqueue (detail / gitweb)
      LU-10894 dom: mdc_lock_flush() improvement (detail / gitweb)
      LU-11946 build: no zlib check during configure --enable-dist (detail / gitweb)
      LU-11946 build: no yaml check during configure --enable-dist (detail / gitweb)
      LU-12302 lnet: Fix NI status in proc for loopback ni (detail / gitweb)
      LU-12333 ptlrpc: Add more flags to DEBUG_REQ_FLAGS macro (detail / gitweb)
      LU-12342 spec: mark lsvcgss as a config file in the rpm (detail / gitweb)
      LU-12350 tests: Do not use background failover (detail / gitweb)
      LU-11089 obd: use wait_event_var() in lu_context_key_degister() (detail / gitweb)
      LU-11089 obd: remove lock from key register/degister (detail / gitweb)
      LU-6142 ldlm: Fix style issues for ldlm_plain.c (detail / gitweb)
      LU-12323 libcfs: check if save_stack_trace_tsk is exported (detail / gitweb)
      LU-10467 ptlrpc: discard SVC_SIGNAL and related functions (detail / gitweb)
      LU-12345 ldiskfs: optimize nodelalloc mode (detail / gitweb)
      LU-11041 kernel: Enable tons of kernel debug options (detail / gitweb)
      LU-10948 llite: Revalidate dentries in ll_intent_file_open (detail / gitweb)

      This does not appear to happen in any of my testing, with major difference being in my testing I have DOM disabled, but then again the backtrace is not in DOM:

      Lustre: DEBUG MARKER: DURATION=900 MDSCOUNT=4 OSTCOUNT=8 RACER_ENABLE_REMOTE_DIRS=true RACER_ENABLE_STRIPED_DIRS=true RACER_ENABLE_MIGRATION=false RACER_ENABLE_PFL=true RACER_ENABLE_DOM=true RACER_ENABLE_FLR=true LFS=/usr/bin/lfs LCTL=/usr/sbin/lctl
      Lustre: DEBUG MARKER: DURATION=900 MDSCOUNT=4 OSTCOUNT=8 RACER_ENABLE_REMOTE_DIRS=true RACER_ENABLE_STRIPED_DIRS=true RACER_ENABLE_MIGRATION=false RACER_ENABLE_PFL=true RACER_ENABLE_DOM=true RACER_ENABLE_FLR=true LFS=/usr/bin/lfs LCTL=/usr/sbin/lctl
      Lustre: DEBUG MARKER: DURATION=900 MDSCOUNT=4 OSTCOUNT=8 RACER_ENABLE_REMOTE_DIRS=true RACER_ENABLE_STRIPED_DIRS=true RACER_ENABLE_MIGRATION=false RACER_ENABLE_PFL=true RACER_ENABLE_DOM=true RACER_ENABLE_FLR=true LFS=/usr/bin/lfs LCTL=/usr/sbin/lctl
      Lustre: DEBUG MARKER: DURATION=900 MDSCOUNT=4 OSTCOUNT=8 RACER_ENABLE_REMOTE_DIRS=true RACER_ENABLE_STRIPED_DIRS=true RACER_ENABLE_MIGRATION=false RACER_ENABLE_PFL=true RACER_ENABLE_DOM=true RACER_ENABLE_FLR=true LFS=/usr/bin/lfs LCTL=/usr/sbin/lctl
      Lustre: DEBUG MARKER: DURATION=900 MDSCOUNT=4 OSTCOUNT=8 RACER_ENABLE_REMOTE_DIRS=true RACER_ENABLE_STRIPED_DIRS=true RACER_ENABLE_MIGRATION=false RACER_ENABLE_PFL=true RACER_ENABLE_DOM=true RACER_ENABLE_FLR=true LFS=/usr/bin/lfs LCTL=/usr/sbin/lctl
      Lustre: DEBUG MARKER: DURATION=900 MDSCOUNT=4 OSTCOUNT=8 RACER_ENABLE_REMOTE_DIRS=true RACER_ENABLE_STRIPED_DIRS=true RACER_ENABLE_MIGRATION=false RACER_ENABLE_PFL=true RACER_ENABLE_DOM=true RACER_ENABLE_FLR=true LFS=/usr/bin/lfs LCTL=/usr/sbin/lctl
      Lustre: DEBUG MARKER: DURATION=900 MDSCOUNT=4 OSTCOUNT=8 RACER_ENABLE_REMOTE_DIRS=true RACER_ENABLE_STRIPED_DIRS=true RACER_ENABLE_MIGRATION=false RACER_ENABLE_PFL=true RACER_ENABLE_DOM=true RACER_ENABLE_FLR=true LFS=/usr/bin/lfs LCTL=/usr/sbin/lctl
      Lustre: DEBUG MARKER: DURATION=900 MDSCOUNT=4 OSTCOUNT=8 RACER_ENABLE_REMOTE_DIRS=true RACER_ENABLE_STRIPED_DIRS=true RACER_ENABLE_MIGRATION=false RACER_ENABLE_PFL=true RACER_ENABLE_DOM=true RACER_ENABLE_FLR=true LFS=/usr/bin/lfs LCTL=/usr/sbin/lctl
      15[30843]: segfault at 0 ip (null) sp 00007fff2f60f558 error 14 in 15[400000+6000]
      19[11605]: segfault at 8 ip 00007fb7cd1f2958 sp 00007fffbcc378f0 error 4 in ld-2.17.so[7fb7cd1e7000+22000]
      5[24620]: segfault at 8 ip 00007f432a0fe958 sp 00007fff4e77a210 error 4 in ld-2.17.so[7f432a0f3000+22000]
      16[23010]: segfault at 8 ip 00007f4ab1b40958 sp 00007ffebe439030 error 4 in ld-2.17.so[7f4ab1b35000+22000]
      LustreError: 3977:0:(osc_cache.c:3035:osc_cache_writeback_range()) extent ffff8eb4a553f4d0@{[0 -> 255/255], [1|0|-|cache|wiY|ffff8eb48a5f3000], [1703936|128|+|-|ffff8eb498770d80|256| (null)]}
      LustreError: 3977:0:(osc_cache.c:3035:osc_cache_writeback_range()) ### extent: ffff8eb4a553f4d0 ns: lustre-OST0003-osc-ffff8eb4b8348800 lock: ffff8eb498770d80/0x127dda6c42633b5a lrc: 2/0,0 mode: PW/PW res: [0xb1c:0x0:0x0].0x0 rrc: 2 type: EXT [0->18446744073709551615] (req 524288->18446744073709551615) flags: 0x20080020000 nid: local remote: 0xf6ccd1090fb739f3 expref: -99 pid: 3963 timeout: 0 lvb_type: 1
      LustreError: 3977:0:(osc_cache.c:1246:osc_extent_tree_dump0()) Dump object ffff8eb48a5f3000 extents at osc_cache_writeback_range:3035, mppr: 256.
      LustreError: 3977:0:(osc_cache.c:1251:osc_extent_tree_dump0()) extent ffff8eb4a553f4d0@{[0 -> 255/255], [1|0|-|cache|wiY|ffff8eb48a5f3000], [1703936|128|+|-|ffff8eb498770d80|256| (null)]} in tree 1.
      LustreError: 3977:0:(osc_cache.c:1251:osc_extent_tree_dump0()) ### extent: ffff8eb4a553f4d0 ns: lustre-OST0003-osc-ffff8eb4b8348800 lock: ffff8eb498770d80/0x127dda6c42633b5a lrc: 2/0,0 mode: PW/PW res: [0xb1c:0x0:0x0].0x0 rrc: 2 type: EXT [0->18446744073709551615] (req 524288->18446744073709551615) flags: 0x20080020000 nid: local remote: 0xf6ccd1090fb739f3 expref: -99 pid: 3963 timeout: 0 lvb_type: 1
      LustreError: 3977:0:(osc_cache.c:3035:osc_cache_writeback_range()) ASSERTION( ext->oe_start >= start && ext->oe_end <= end ) failed:
      LustreError: 3977:0:(osc_cache.c:3035:osc_cache_writeback_range()) LBUG
      Pid: 3977, comm: cat 3.10.0-862.14.4.el7.x86_64 #1 SMP Wed Sep 26 15:12:11 UTC 2018
      Call Trace:
      [<ffffffffc06e47cc>] libcfs_call_trace+0x8c/0xc0 [libcfs]
      [<ffffffffc06e487c>] lbug_with_loc+0x4c/0xa0 [libcfs]
      [<ffffffffc0cd0cad>] osc_cache_writeback_range+0xacd/0x1260 [osc]
      [<ffffffffc0cbf765>] osc_io_fsync_start+0x85/0x1a0 [osc]
      [<ffffffffc087b4c8>] cl_io_start+0x68/0x130 [obdclass]
      [<ffffffffc0d10cb7>] lov_io_call.isra.7+0x87/0x140 [lov]
      [<ffffffffc0d10e76>] lov_io_start+0x56/0x150 [lov]
      [<ffffffffc087b4c8>] cl_io_start+0x68/0x130 [obdclass]
      [<ffffffffc087d69c>] cl_io_loop+0xcc/0x1c0 [obdclass]
      [<ffffffffc0d77c2b>] cl_sync_file_range+0x2db/0x380 [lustre]
      [<ffffffffc0d8d090>] ll_delete_inode+0x150/0x220 [lustre]
      [<ffffffff9423c504>] evict+0xb4/0x180
      [<ffffffff9423ce0c>] iput+0xfc/0x190
      [<ffffffff942376f0>] __dentry_kill+0x120/0x180
      [<ffffffff942377f9>] dput+0xa9/0x160
      [<ffffffff9422158e>] __fput+0x17e/0x260
      [<ffffffff9422175e>] ____fput+0xe/0x10
      [<ffffffff940bab8b>] task_work_run+0xbb/0xe0
      [<ffffffff9402bc55>] do_notify_resume+0xa5/0xc0
      [<ffffffff94725ae4>] int_signal+0x12/0x17
      [<ffffffffffffffff>] 0xffffffffffffffff
      

      Very first crash:
      https://testing.whamcloud.com/test_sessions/2d05c3e5-2252-4056-8082-3c9183b4aef4

      last crash as of the time of this writing:
      https://testing.whamcloud.com/test_sets/35838dd8-93da-11e9-a1a3-52540065bddc

      as of now 13 crashes like this were recorded, all in delete_inode path.

      Attachments

        Issue Links

          Activity

            [LU-12462] osc_cache_writeback_range(): ASSERTION( ext->oe_start >= start && ext->oe_end <= end ) failed
            pjones Peter Jones added a comment -

            Seems like everything has landed for 2.13

            pjones Peter Jones added a comment - Seems like everything has landed for 2.13

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35733/
            Subject: LU-12462 osc: layout and chunkbits alignment mismatch
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 7a9f7dec700c5c553396475daad272475f1b20be

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35733/ Subject: LU-12462 osc: layout and chunkbits alignment mismatch Project: fs/lustre-release Branch: master Current Patch Set: Commit: 7a9f7dec700c5c553396475daad272475f1b20be

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35525/
            Subject: LU-12462 osc: Do not assert for first extent
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 092ecd66127eade284550b83192fa004ff55501b

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35525/ Subject: LU-12462 osc: Do not assert for first extent Project: fs/lustre-release Branch: master Current Patch Set: Commit: 092ecd66127eade284550b83192fa004ff55501b

            Patrick Farrell (pfarrell@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35525
            Subject: LU-12462 osc: align writeback to chunk size
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: efa7e2fc1918310b0b0ab7201250719a0c629f40

            gerrit Gerrit Updater added a comment - Patrick Farrell (pfarrell@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35525 Subject: LU-12462 osc: align writeback to chunk size Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: efa7e2fc1918310b0b0ab7201250719a0c629f40

            Oh, that's a good idea!  I like that, I'll give it a try.

            pfarrell Patrick Farrell (Inactive) added a comment - Oh, that's a good idea!  I like that, I'll give it a try.
            tappro Mikhail Pershin added a comment - - edited

            Hm, that means we are comparing just different values - extent 'chunks' vs page index, which is wrong obviously. The 'start' and 'end' parameters much be not page index but also in 'chunks' units. Considering that it seems whole osc_cache_writeback_range() is wrong when it checks oe_start/end against start/end parameters, isn't it?

            Ah, it is not in chunks but aligned to that, right?

            OK, what if we will align start/end inside osc_cache_writeback_range() also by cl_chunkbits before using them?

            tappro Mikhail Pershin added a comment - - edited Hm, that means we are comparing just different values - extent 'chunks' vs page index, which is wrong obviously. The 'start' and 'end' parameters much be not page index but also in 'chunks' units. Considering that it seems whole osc_cache_writeback_range() is wrong when it checks oe_start/end against start/end parameters, isn't it? Ah, it is not in chunks but aligned to that, right? OK, what if we will align start/end inside osc_cache_writeback_range() also by cl_chunkbits before using them?

            Ah, good catch, askulysh.

            Do you have a thought on how to fix this?  I can think of several approaches, but I'm struggling to think what's best.

            1. It's not really necessary for the start of the first OSC extent to be chunk aligned, as long as it's smaller than one chunk.  (I think?)
            2. We could modify the lov code to give a start of zero for the first non-DOM component, but while that would work, it seems wrong - The component does not start at zero.

            I like (1), but it would require making the OSC layer aware that it's the first extent.  So that would require passing some info down, might be complicated.

            tappro, if you have an opinion too...

            pfarrell Patrick Farrell (Inactive) added a comment - Ah, good catch, askulysh . Do you have a thought on how to fix this?  I can think of several approaches, but I'm struggling to think what's best. 1. It's not really necessary for the start of the first OSC extent to be chunk aligned, as long as it's smaller than one chunk.  (I think?) 2. We could modify the lov code to give a start of zero for the first non-DOM component, but while that would work, it seems wrong - The component does not start at zero. I like (1), but it would require making the OSC layer aware that it's the first extent.  So that would require passing some info down, might be complicated. tappro , if you have an opinion too...

            The failure isn't limited by  signed vs unsigned issue.
            OST extent is created with wrong oe_start in case of existing DoM extent.
            os_start is aligned to cl_chunkbits=20, so it is 0 with 262144 index which is real start of OST extent.

            crash> osc_extent.oe_start,oe_end ffff90895c8b7d10 
              oe_start = 0 
              oe_end = 255
            
                ci_fsync = {
                  fi_start = 262144, 
                  fi_end = -1, 
                  fi_fid = 0xffff908956911b18, 
                  fi_mode = CL_FSYNC_DISCARD, 
                  fi_nr_written = 0
                }, 
            
            00000080:00200000:0.0:1560698103.488527:0:6682:0:(file.c:357:ll_file_release()) VFS Op:inode=[0x200000403:0x1350:0x0](ffff908956911b90)
            00010000:00010000:0.0:1560698103.490992:0:12953:0:(ldlm_lockd.c:1715:ldlm_handle_bl_callback()) ### client blocking AST callback handler ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 3/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x0 rrc: 3 type: IBT flags: 0x400080010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0
            00010000:00010000:0.0:1560698103.490997:0:12953:0:(ldlm_request.c:1318:ldlm_cli_cancel_local()) ### client-side cancel ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 4/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x408480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0
            00000008:00010000:0.0:1560698103.491006:0:12953:0:(osc_cache.c:1113:osc_extent_make_ready()) ### extent: ffff9089f89e2a50 ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 4/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x449480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0
            00000008:00010000:0.0:1560698103.491062:0:12953:0:(osc_cache.c:867:osc_extent_finish()) ### extent: ffff9089f89e2a50 ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 4/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x449480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0
            00000008:00010000:0.0:1560698103.491085:0:12953:0:(osc_cache.c:489:__osc_extent_remove()) ### extent: ffff9089f89e2a50 ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 4/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x449480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0
            00010000:00010000:0.0:1560698103.491227:0:12953:0:(ldlm_lockd.c:1761:ldlm_handle_bl_callback()) ### client blocking callback handler END ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 1/0,0 mode: --/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x4c49480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0
            00000008:00020000:0.0:1560698103.551596:0:6682:0:(osc_cache.c:3074:osc_cache_writeback_range()) extent ffff90895c8b7d10@\{[0 -> 255/255], [1|0|-|cache|wiY|ffff90895ea34500], [1703936|192|+|-|ffff90895de79440|256| (null)]} 
            00000008:00020000:0.0:1560698103.563094:0:6682:0:(osc_cache.c:3074:osc_cache_writeback_range()) ### extent: ffff90895c8b7d10 ns: lustre-OST0000-osc-ffff90897640f000 lock: ffff90895de79440/0xeb72e01621e5d8fc lrc: 2/0,0 mode: PW/PW res: [0x511:0x0:0x0].0x0 rrc: 2 type: EXT [0->18446744073709551615] (req 262144->18446744073709551615) flags: 0x20000020000 nid: local remote: 0x5d533c3e80fa6ceb expref: -99 pid: 6682 timeout: 0 lvb_type: 1
            00000008:00020000:0.0:1560698103.579593:0:6682:0:(osc_cache.c:1216:osc_extent_tree_dump0()) Dump object ffff90895ea34500 extents at osc_cache_writeback_range:3074, mppr: 256.
            00000008:00020000:0.0:1560698103.586774:0:6682:0:(osc_cache.c:1221:osc_extent_tree_dump0()) extent ffff90895c8b7d10@\{[0 -> 255/255], [1|0|-|cache|wiY|ffff90895ea34500], [1703936|192|+|-|ffff90895de79440|256| (null)]} in tree 1.
            00000008:00020000:0.0:1560698103.595536:0:6682:0:(osc_cache.c:1221:osc_extent_tree_dump0()) ### extent: ffff90895c8b7d10 ns: lustre-OST0000-osc-ffff90897640f000 lock: ffff90895de79440/0xeb72e01621e5d8fc lrc: 2/0,0 mode: PW/PW res: [0x511:0x0:0x0].0x0 rrc: 2 type: EXT [0->18446744073709551615] (req 262144->18446744073709551615) flags: 0x20000020000 nid: local remote: 0x5d533c3e80fa6ceb expref: -99 pid: 6682 timeout: 0 lvb_type: 1
            00000008:00040000:0.0:1560698103.611993:0:6682:0:(osc_cache.c:3074:osc_cache_writeback_range()) ASSERTION( ext->oe_start >= start && ext->oe_end <= end ) failed: 
            
            askulysh Andriy Skulysh added a comment - The failure isn't limited by  signed vs unsigned issue. OST extent is created with wrong oe_start in case of existing DoM extent. os_start is aligned to cl_chunkbits=20, so it is 0 with 262144 index which is real start of OST extent. crash> osc_extent.oe_start,oe_end ffff90895c8b7d10  oe_start = 0  oe_end = 255 ci_fsync = { fi_start = 262144, fi_end = -1, fi_fid = 0xffff908956911b18, fi_mode = CL_FSYNC_DISCARD, fi_nr_written = 0 }, 00000080:00200000:0.0:1560698103.488527:0:6682:0:(file.c:357:ll_file_release()) VFS Op:inode=[0x200000403:0x1350:0x0](ffff908956911b90) 00010000:00010000:0.0:1560698103.490992:0:12953:0:(ldlm_lockd.c:1715:ldlm_handle_bl_callback()) ### client blocking AST callback handler ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 3/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x0 rrc: 3 type: IBT flags: 0x400080010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0 00010000:00010000:0.0:1560698103.490997:0:12953:0:(ldlm_request.c:1318:ldlm_cli_cancel_local()) ### client-side cancel ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 4/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x408480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0 00000008:00010000:0.0:1560698103.491006:0:12953:0:(osc_cache.c:1113:osc_extent_make_ready()) ### extent: ffff9089f89e2a50 ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 4/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x449480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0 00000008:00010000:0.0:1560698103.491062:0:12953:0:(osc_cache.c:867:osc_extent_finish()) ### extent: ffff9089f89e2a50 ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 4/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x449480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0 00000008:00010000:0.0:1560698103.491085:0:12953:0:(osc_cache.c:489:__osc_extent_remove()) ### extent: ffff9089f89e2a50 ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 4/0,0 mode: PW/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x449480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0 00010000:00010000:0.0:1560698103.491227:0:12953:0:(ldlm_lockd.c:1761:ldlm_handle_bl_callback()) ### client blocking callback handler END ns: lustre-MDT0000-mdc-ffff90897640f000 lock: ffff9089ec0ee000/0xeb72e01621e5d8a8 lrc: 1/0,0 mode: --/PW res: [0x200000403:0x1350:0x0].0x0 bits 0x40/0x40 rrc: 3 type: IBT flags: 0x4c49480010000 nid: local remote: 0x76683b3d6040c693 expref: -99 pid: 6682 timeout: 0 lvb_type: 0 00000008:00020000:0.0:1560698103.551596:0:6682:0:(osc_cache.c:3074:osc_cache_writeback_range()) extent ffff90895c8b7d10@\{[0 -> 255/255], [1|0|-|cache|wiY|ffff90895ea34500], [1703936|192|+|-|ffff90895de79440|256| (null)]} 00000008:00020000:0.0:1560698103.563094:0:6682:0:(osc_cache.c:3074:osc_cache_writeback_range()) ### extent: ffff90895c8b7d10 ns: lustre-OST0000-osc-ffff90897640f000 lock: ffff90895de79440/0xeb72e01621e5d8fc lrc: 2/0,0 mode: PW/PW res: [0x511:0x0:0x0].0x0 rrc: 2 type: EXT [0->18446744073709551615] (req 262144->18446744073709551615) flags: 0x20000020000 nid: local remote: 0x5d533c3e80fa6ceb expref: -99 pid: 6682 timeout: 0 lvb_type: 1 00000008:00020000:0.0:1560698103.579593:0:6682:0:(osc_cache.c:1216:osc_extent_tree_dump0()) Dump object ffff90895ea34500 extents at osc_cache_writeback_range:3074, mppr: 256. 00000008:00020000:0.0:1560698103.586774:0:6682:0:(osc_cache.c:1221:osc_extent_tree_dump0()) extent ffff90895c8b7d10@\{[0 -> 255/255], [1|0|-|cache|wiY|ffff90895ea34500], [1703936|192|+|-|ffff90895de79440|256| (null)]} in tree 1. 00000008:00020000:0.0:1560698103.595536:0:6682:0:(osc_cache.c:1221:osc_extent_tree_dump0()) ### extent: ffff90895c8b7d10 ns: lustre-OST0000-osc-ffff90897640f000 lock: ffff90895de79440/0xeb72e01621e5d8fc lrc: 2/0,0 mode: PW/PW res: [0x511:0x0:0x0].0x0 rrc: 2 type: EXT [0->18446744073709551615] (req 262144->18446744073709551615) flags: 0x20000020000 nid: local remote: 0x5d533c3e80fa6ceb expref: -99 pid: 6682 timeout: 0 lvb_type: 1 00000008:00040000:0.0:1560698103.611993:0:6682:0:(osc_cache.c:3074:osc_cache_writeback_range()) ASSERTION( ext->oe_start >= start && ext->oe_end <= end ) failed:

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35339/
            Subject: LU-12462 llite: Remove old fsync versions
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 4cb6ce1863d0498ece59dabe6d38ef12c0a5fd41

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35339/ Subject: LU-12462 llite: Remove old fsync versions Project: fs/lustre-release Branch: master Current Patch Set: Commit: 4cb6ce1863d0498ece59dabe6d38ef12c0a5fd41

            Patrick Farrell (pfarrell@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35339
            Subject: LU-12462 llite: Remove old fsync versions
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: a030f9c6123d8e3e1d377da32ee05ea3765bf596

            gerrit Gerrit Updater added a comment - Patrick Farrell (pfarrell@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35339 Subject: LU-12462 llite: Remove old fsync versions Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: a030f9c6123d8e3e1d377da32ee05ea3765bf596

            green, jamesanunez :

            This failure has only been seen in "full" testing, and I'm not sure how to invoke that for my patch.  Can one of you do that?

            pfarrell Patrick Farrell (Inactive) added a comment - green , jamesanunez : This failure has only been seen in "full" testing, and I'm not sure how to invoke that for my patch.  Can one of you do that?

            People

              pfarrell Patrick Farrell (Inactive)
              green Oleg Drokin
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: