Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7170

recovery-random-scale test_fail_client_mds: (osc_cache.c:3140:discard_cb()) LBUG

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • Lustre 2.8.0
    • None
    • client and server: lustre-master build# 3175 RHEL7 ldiskfs
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for sarah_lw <wei3.liu@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/4af039d6-54a6-11e5-b25e-5254006e85c2.

      The sub-test test_fail_client_mds failed with the following error:

      test_fail_client_mds returned 4
      

      client 1 console

      00:28:24:[  750.886359] LustreError: 166-1: MGC10.1.4.79@tcp: Connection to MGS (at 10.1.4.75@tcp) was lost; in progress operations using this service will fail
      00:28:25:[  751.122374] Lustre: Evicted from MGS (at 10.1.4.75@tcp) after server handle changed from 0x602be9f99d06d6d0 to 0x602be9f99d149c0d
      00:28:25:[  751.171984] LustreError: 2409:0:(file.c:184:ll_close_inode_openhandle()) lustre-clilmv-ffff88007a155000: inode [0x200000bd1:0x4:0x0] mdc close failed: rc = -108
      00:28:25:[  751.719717] LustreError: 2368:0:(osc_cache.c:3140:discard_cb()) ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageDirty(cl_page_vmpage(page)))) ) failed: 
      00:28:25:[  751.721020] LustreError: 2368:0:(osc_cache.c:3140:discard_cb()) LBUG
      00:28:25:[  751.722112] Pid: 2368, comm: ldlm_bl_04
      00:28:25:[  751.722463] 
      00:28:25:[  751.722463] Call Trace:
      00:28:25:[  751.722852]  [<ffffffffa05397d3>] libcfs_debug_dumpstack+0x53/0x80 [libcfs]
      00:28:25:[  751.723451]  [<ffffffffa0539d75>] lbug_with_loc+0x45/0xc0 [libcfs]
      00:28:25:[  751.723994]  [<ffffffffa0b7cb71>] discard_cb+0x111/0x150 [osc]
      00:28:25:[  751.724503]  [<ffffffffa0b8d410>] osc_page_gang_lookup+0x1e0/0x320 [osc]
      00:28:25:[  751.725070]  [<ffffffffa0b7ca60>] ? discard_cb+0x0/0x150 [osc]
      00:28:25:[  751.725559]  [<ffffffffa0b8d669>] osc_lock_discard_pages+0x119/0x218 [osc]
      00:28:25:[  751.726134]  [<ffffffffa0b7ca60>] ? discard_cb+0x0/0x150 [osc]
      00:28:25:[  751.726800]  [<ffffffffa0b763e9>] osc_lock_flush+0x89/0x280 [osc]
      00:28:27:[  751.727327]  [<ffffffffa0b769a3>] osc_ldlm_blocking_ast+0x2e3/0x3a0 [osc]
      00:28:27:[  751.727983]  [<ffffffffa0863f3d>] ldlm_cancel_callback+0x6d/0x150 [ptlrpc]
      00:28:27:[  751.728589]  [<ffffffffa0870360>] ldlm_cli_cancel_local+0xa0/0x420 [ptlrpc]
      00:28:27:[  751.729229]  [<ffffffffa08761bf>] ldlm_cli_cancel+0x6f/0x350 [ptlrpc]
      00:28:28:[  751.729799]  [<ffffffffa0b7683a>] osc_ldlm_blocking_ast+0x17a/0x3a0 [osc]
      00:28:28:[  751.730410]  [<ffffffffa0879adf>] ldlm_handle_bl_callback+0xcf/0x410 [ptlrpc]
      00:28:28:[  751.731063]  [<ffffffffa087a2b8>] ldlm_bl_thread_main+0x498/0x910 [ptlrpc]
      00:28:28:[  751.731675]  [<ffffffff810a9500>] ? default_wake_function+0x0/0x20
      00:28:28:[  751.732230]  [<ffffffffa0879e20>] ? ldlm_bl_thread_main+0x0/0x910 [ptlrpc]
      00:28:28:[  751.732856]  [<ffffffff8109726f>] kthread+0xcf/0xe0
      00:28:28:[  751.733303]  [<ffffffff810971a0>] ? kthread+0x0/0xe0
      00:28:28:[  751.733758]  [<ffffffff81614158>] ret_from_fork+0x58/0x90
      00:28:28:[  751.734262]  [<ffffffff810971a0>] ? kthread+0x0/0xe0
      00:28:28:[  751.734682] 
      00:28:28:[  751.900650] Kernel panic - not syncing: LBUG
      00:28:28:[  751.901120] CPU: 1 PID: 2368 Comm: ldlm_bl_04 Tainted: GF          O--------------   3.10.0-229.7.2.el7.x86_64 #1
      00:28:28:[  751.901120] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
      00:28:28:[  751.901120]  ffffffffa0556ecf 00000000af4ad058 ffff88004f073a78 ffffffff81604386
      00:28:28:[  751.901120]  ffff88004f073af8 ffffffff815fdc2a ffffffff00000008 ffff88004f073b08
      00:28:28:[  751.901120]  ffff88004f073aa8 00000000af4ad058 ffffffffa0b94004 0000000000000246
      00:28:28:[  751.901120] Call Trace:
      00:28:28:[  751.901120]  [<ffffffff81604386>] dump_stack+0x19/0x1b
      00:28:28:[  751.901120]  [<ffffffff815fdc2a>] panic+0xd8/0x1e7
      00:28:28:[  751.901120]  [<ffffffffa0539ddb>] lbug_with_loc+0xab/0xc0 [libcfs]
      00:28:28:[  751.901120]  [<ffffffffa0b7cb71>] discard_cb+0x111/0x150 [osc]
      00:28:28:[  751.901120]  [<ffffffffa0b8d410>] osc_page_gang_lookup+0x1e0/0x320 [osc]
      00:28:29:[  751.901120]  [<ffffffffa0b7ca60>] ? check_and_discard_cb+0x150/0x150 [osc]
      00:28:29:[  751.901120]  [<ffffffffa0b8d669>] osc_lock_discard_pages+0x119/0x218 [osc]
      00:28:29:[  751.901120]  [<ffffffffa0b7ca60>] ? check_and_discard_cb+0x150/0x150 [osc]
      00:28:29:[  751.901120]  [<ffffffffa0b763e9>] osc_lock_flush+0x89/0x280 [osc]
      00:28:29:[  751.901120]  [<ffffffffa0b769a3>] osc_ldlm_blocking_ast+0x2e3/0x3a0 [osc]
      00:28:29:[  751.901120]  [<ffffffffa0863f3d>] ldlm_cancel_callback+0x6d/0x150 [ptlrpc]
      00:28:29:[  751.901120]  [<ffffffffa0870360>] ldlm_cli_cancel_local+0xa0/0x420 [ptlrpc]
      00:28:29:[  751.901120]  [<ffffffffa08761bf>] ldlm_cli_cancel+0x6f/0x350 [ptlrpc]
      00:28:29:[  751.901120]  [<ffffffffa0b7683a>] osc_ldlm_blocking_ast+0x17a/0x3a0 [osc]
      00:28:29:[  751.901120]  [<ffffffffa0879adf>] ldlm_handle_bl_callback+0xcf/0x410 [ptlrpc]
      00:28:29:[  751.901120]  [<ffffffffa087a2b8>] ldlm_bl_thread_main+0x498/0x910 [ptlrpc]
      00:28:29:[  751.901120]  [<ffffffff810a9500>] ? wake_up_state+0x20/0x20
      00:28:29:[  751.901120]  [<ffffffffa0879e20>] ? ldlm_handle_bl_callback+0x410/0x410 [ptlrpc]
      00:28:29:[  751.901120]  [<ffffffff8109726f>] kthread+0xcf/0xe0
      00:28:29:[  751.901120]  [<ffffffff810971a0>] ? kthread_create_on_node+0x140/0x140
      00:28:29:[  751.901120]  [<ffffffff81614158>] ret_from_fork+0x58/0x90
      00:28:29:[  751.901120]  [<ffffffff810971a0>] ? kthread_create_on_node+0x140/0x140
      00:28:29:[  751.901120] drm_kms_helper: panic occurred, switching back to text console
      00:28:29:[  751.901120] ------------[ cut here ]------------
      00:28:29:[  751.901120] kernel BUG at arch/x86/mm/pageattr.c:216!
      

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: