Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4581

ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageWriteback(cl_page_vmpage(env, page)))) ) failed:

Details

    • Bug
    • Resolution: Fixed
    • Major
    • None
    • Lustre 2.4.1
    • None
    • client: slessp2 2.4.1-3nas
    • 3
    • 12522

    Description

      3891131.052019] LustreError: 27617:0:(cl_lock.c:1964:discard_cb()) ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageWriteback(cl_page_vmpage(env, page)))) ) failed: ^M
      [3891131.067131] LustreError: 27617:0:(cl_lock.c:1964:discard_cb()) LBUG^M
      [3891131.073568] Pid: 27617, comm: geos.xco2.osse.^M
      
      
      PID: 27617  TASK: ffff8809be7a04c0  CPU: 21  COMMAND: "geos.xco2.osse."
       #0 [ffff88103321f620] panic at ffffffff81450800
       #1 [ffff88103321f6a0] lbug_with_loc at ffffffffa04c2dc3 [libcfs]
       #2 [ffff88103321f6c0] discard_cb at ffffffffa0633774 [obdclass]
       #3 [ffff88103321f6f0] cl_page_gang_lookup at ffffffffa06309d6 [obdclass]
       #4 [ffff88103321f780] cl_lock_discard_pages at ffffffffa063346a [obdclass]
       #5 [ffff88103321f7c0] osc_lock_flush at ffffffffa08fe272 [osc]
       #6 [ffff88103321f810] osc_lock_cancel at ffffffffa08fe4d9 [osc]
       #7 [ffff88103321f850] cl_lock_cancel0 at ffffffffa0631155 [obdclass]
       #8 [ffff88103321f870] cl_lock_cancel at ffffffffa0631eab [obdclass]
       #9 [ffff88103321f890] osc_lock_blocking at ffffffffa08ff01d [osc]
      #10 [ffff88103321f8c0] osc_dlm_blocking_ast0 at ffffffffa08ff8c9 [osc]
      #11 [ffff88103321f900] osc_ldlm_blocking_ast at ffffffffa08ffa2c [osc]
      #12 [ffff88103321f940] ldlm_cancel_callback at ffffffffa0742e0f [ptlrpc]
      #13 [ffff88103321f950] ldlm_cli_cancel_local at ffffffffa075121f [ptlrpc]
      #14 [ffff88103321f970] ldlm_cli_cancel_list_local at ffffffffa07545b2 [ptlrpc]
      #15 [ffff88103321f9d0] ldlm_prep_elc_req at ffffffffa07563bf [ptlrpc]
      #16 [ffff88103321fa40] ldlm_prep_enqueue_req at ffffffffa075648f [ptlrpc]
      #17 [ffff88103321fa50] osc_enqueue_base at ffffffffa08e525f [osc]
      #18 [ffff88103321faf0] osc_lock_enqueue at ffffffffa08ff440 [osc]
      #19 [ffff88103321fb60] cl_enqueue_kick at ffffffffa0632652 [obdclass]
      #20 [ffff88103321fb90] cl_enqueue_try at ffffffffa0635961 [obdclass]
      #21 [ffff88103321fbc0] lov_lock_enqueue_one at ffffffffa0995d8d [lov]
      #22 [ffff88103321fbf0] lov_lock_enqueue at ffffffffa09984bb [lov]
      #23 [ffff88103321fc60] cl_enqueue_kick at ffffffffa0632652 [obdclass]
      #24 [ffff88103321fc90] cl_enqueue_try at ffffffffa0635961 [obdclass]
      #25 [ffff88103321fcc0] cl_enqueue_locked at ffffffffa0636717 [obdclass]
      #26 [ffff88103321fcf0] cl_lock_request at ffffffffa06373e9 [obdclass]
      #27 [ffff88103321fd40] cl_glimpse_lock at ffffffffa0a6ae5d [lustre]
      #28 [ffff88103321fda0] cl_glimpse_size0 at ffffffffa0a6b31f [lustre]
      #29 [ffff88103321fdf0] ll_glimpse_size at ffffffffa0a19695 [lustre]
      #30 [ffff88103321fe10] ll_inode_revalidate_it at ffffffffa0a1eb68 [lustre]
      #31 [ffff88103321fe40] ll_getattr_it at ffffffffa0a1ebae [lustre]
      #32 [ffff88103321fe70] ll_getattr at ffffffffa0a1ecff [lustre]
      #33 [ffff88103321fed0] vfs_fstat at ffffffff81155d27
      #34 [ffff88103321fef0] sys_newfstat at ffffffff81155d6f
      #35 [ffff88103321ff80] system_call_fastpath at ffffffff8145b412
          RIP: 00002aaaacfe9394  RSP: 00007fffffff5a20  RFLAGS: 00000202
          RAX: 0000000000000005  RBX: ffffffff8145b412  RCX: 0000000000000001
          RDX: 00007fffffdafbf8  RSI: 00007fffffdafbf8  RDI: 0000000000000009
          RBP: 00007fffffdb1cc0   R8: 0000000000000000   R9: 000000000000000a
          R10: 00007fffffdac960  R11: 0000000000000246  R12: 0000000000000001
          R13: 00007fffffdb1dc0  R14: 0000000000000000  R15: 00007fffffdb1cc0
          ORIG_RAX: 0000000000000005  CS: 0033  SS: 002b
      
      

      Attachments

        Issue Links

          Activity

            [LU-4581] ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageWriteback(cl_page_vmpage(env, page)))) ) failed:
            jgmitter Joseph Gmitter (Inactive) made changes -
            Link New: This issue is related to LDEV-134 [ LDEV-134 ]
            cliffw Cliff White (Inactive) made changes -
            Resolution New: Fixed [ 1 ]
            Status Original: Reopened [ 4 ] New: Resolved [ 5 ]
            cliffw Cliff White (Inactive) made changes -
            Comment [ lustre log dump ]
            cliffw Cliff White (Inactive) made changes -
            Comment [ We appear to be hitting this issue on Hyperion with 2.7 FE
            {code}
            Jul 30 16:33:37 iwc151 kernel: LustreError: 22705:0:(ldlm_resource.c:1418:ldlm_resource_dump()) Granted locks (in reverse order):
            Jul 30 16:33:37 iwc151 kernel: LustreError: 22705:0:(ldlm_resource.c:1421:ldlm_resource_dump()) ### ### ns: lustre-OST0009-osc-ffff8808726fb400 lock: ffff8808537b6b00/0x58cbc34f0b2d4dcb lrc: 2/0,0 mode: PW/PW res: [0xd4c6b8:0x0:0x0].0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->8191) flags: 0x126400020000 nid: local remote: 0xdb6a7d473dfbfe9b expref: -99 pid: 22607 timeout: 0 lvb_type: 1
            Jul 30 16:33:37 iwc151 kernel: Lustre: lustre-OST0009-osc-ffff8808726fb400: Connection restored to lustre-OST0009 (at 192.168.120.22@o2ib)
            Jul 30 16:33:37 iwc151 kernel: LustreError: 91227:0:(osc_cache.c:3115:discard_cb()) ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageDirty(cl_page_vmpage(page)))) ) failed:
            Jul 30 16:33:37 iwc151 kernel: LustreError: 91227:0:(osc_cache.c:3115:discard_cb()) ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageDirty(cl_page_vmpage(page)))) ) failed:
            Jul 30 16:33:37 iwc151 kernel: LustreError: 91227:0:(osc_cache.c:3115:discard_cb()) LBUG
            Jul 30 16:33:37 iwc151 kernel: LustreError: 91227:0:(osc_cache.c:3115:discard_cb()) LBUG
            Jul 30 16:33:37 iwc151 kernel: Pid: 91227, comm: ldlm_bl_79
            Jul 30 16:33:37 iwc151 kernel:
            Jul 30 16:33:37 iwc151 kernel: Call Trace:
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa045e895>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa045ee97>] lbug_with_loc+0x47/0xb0 [libcfs]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dccaa6>] discard_cb+0x156/0x190 [osc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dcce1c>] osc_page_gang_lookup+0x1ac/0x330 [osc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dcc950>] ? discard_cb+0x0/0x190 [osc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dcd0e4>] osc_lock_discard_pages+0x144/0x240 [osc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dcc950>] ? discard_cb+0x0/0x190 [osc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dc37cb>] osc_lock_flush+0x8b/0x260 [osc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dc3c48>] osc_ldlm_blocking_ast+0x2a8/0x3c0 [osc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c2a04c>] ldlm_cancel_callback+0x6c/0x1a0 [ptlrpc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c395ba>] ldlm_cli_cancel_local+0x8a/0x470 [ptlrpc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c3e040>] ldlm_cli_cancel+0x60/0x360 [ptlrpc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dc3a7b>] osc_ldlm_blocking_ast+0xdb/0x3c0 [osc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffff8152cfae>] ? _spin_lock+0x1e/0x30
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c417a0>] ldlm_handle_bl_callback+0x130/0x400 [ptlrpc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c41ce1>] ldlm_bl_thread_main+0x271/0x3f0 [ptlrpc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffff81064c00>] ? default_wake_function+0x0/0x20
            Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c41a70>] ? ldlm_bl_thread_main+0x0/0x3f0 [ptlrpc]
            Jul 30 16:33:37 iwc151 kernel: [<ffffffff8109e78e>] kthread+0x9e/0xc0
            Jul 30 16:33:37 iwc151 kernel: [<ffffffff8100c28a>] child_rip+0xa/0x20
            Jul 30 16:33:37 iwc151 kernel: [<ffffffff8109e6f0>] ? kthread+0x0/0xc0
            Jul 30 16:33:37 iwc151 kernel: [<ffffffff8100c280>] ? child_rip+0x0/0x20
            Jul 30 16:33:37 iwc151 kernel:
            {code} ]
            cliffw Cliff White (Inactive) made changes -
            Attachment New: iwc151.lbug.txt [ 18536 ]
            cliffw Cliff White (Inactive) made changes -
            Resolution Original: Fixed [ 1 ]
            Status Original: Resolved [ 5 ] New: Reopened [ 4 ]
            jfc John Fuchs-Chesney (Inactive) made changes -
            Resolution New: Fixed [ 1 ]
            Status Original: Open [ 1 ] New: Resolved [ 5 ]

            Looks like the patch we supplied has fixed this problem, as far as we can tell.
            ~ jfc.

            jfc John Fuchs-Chesney (Inactive) added a comment - Looks like the patch we supplied has fixed this problem, as far as we can tell. ~ jfc.
            pjones Peter Jones made changes -
            Labels Original: p4d

            Hello Mahmoud,
            I see from the comments above that that patch we supplied has been merged.
            Do you want us to keep this ticket open?
            Thanks,
            ~ jfc.

            jfc John Fuchs-Chesney (Inactive) added a comment - Hello Mahmoud, I see from the comments above that that patch we supplied has been merged. Do you want us to keep this ticket open? Thanks, ~ jfc.

            People

              jay Jinshan Xiong (Inactive)
              mhanafi Mahmoud Hanafi
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: