Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-6939

nrs_tbf.c:155:nrs_tbf_cli_reset()) ASSER TION( cli->tc_rule == ((void *)0) ) failed

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.8.0
    • Lustre 2.8.0
    • None
    • 3
    • 9223372036854775807

    Description

      Just hit this in sanity.sh 77f, running master with a couple of unrelaed patches.

      <4>[22133.708730] Lustre: DEBUG MARKER: == sanityn test 77f: check TBF JobID nrs policy == 04:53:19 (1438419199)
      <4>[22133.874062] LNet: 22701:0:(nidstrings.c:271:parse_nidrange()) can't parse nidrange: "iozone.500"
      <4>[22135.187825] Lustre: DEBUG MARKER: cancel_lru_locks osc start
      <0>[22135.320574] LustreError: 27616:0:(nrs_tbf.c:155:nrs_tbf_cli_reset()) ASSERTION( cli->tc_rule == ((void *)0) ) failed: 
      <0>[22135.322270] LustreError: 27616:0:(nrs_tbf.c:155:nrs_tbf_cli_reset()) LBUG
      <0>[22135.326308] Kernel panic - not syncing: LBUG in interrupt.
      <0>[22135.326312] 
      <4>[22135.327246] Pid: 27616, comm: ll_ost00_004 Tainted: P           ---------------    2.6.32-rhe6.6-debug #1
      <4>[22135.327671] Call Trace:
      <4>[22135.327914]  [<ffffffff8151dcd9>] ? panic+0xa7/0x16f
      <4>[22135.328502]  [<ffffffffa0c04ecd>] ? lbug_with_loc+0x8d/0xb0 [libcfs]
      <4>[22135.328893]  [<ffffffffa05e6e68>] ? nrs_tbf_cli_reset+0xb8/0x120 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05e7560>] ? nrs_tbf_res_get+0x270/0x300 [ptlrpc]
      <4>[22135.330302]  [<ffffffff81041d01>] ? native_patch+0x151/0x180
      <4>[22135.330302]  [<ffffffffa05d8d26>] ? nrs_resource_get+0x56/0x110 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05da41e>] ? nrs_resource_get_safe+0x8e/0x100 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa059b120>] ? lustre_swab_niobuf_remote+0x0/0x30 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05dcf3b>] ? ptlrpc_nrs_req_hp_move+0x6b/0x210 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05bfa95>] ? req_capsule_client_get+0x15/0x20 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa0575458>] ? ldlm_server_blocking_ast+0x228/0x8b0 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05f8279>] ? tgt_blocking_ast+0x1b9/0x8c0 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa054823d>] ? ldlm_work_bl_ast_lock+0xdd/0x290 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa058c7d7>] ? ptlrpc_set_wait+0x77/0x9d0 [ptlrpc]
      <4>[22135.330302]  [<ffffffff811766c4>] ? kmem_cache_alloc_node_trace+0x144/0x210
      <4>[22135.330302]  [<ffffffffa0583a6f>] ? ptlrpc_prep_set+0x5f/0x290 [ptlrpc]
      <4>[22135.330302]  [<ffffffff8109d704>] ? __init_waitqueue_head+0x24/0x40
      <4>[22135.330302]  [<ffffffffa0583af3>] ? ptlrpc_prep_set+0xe3/0x290 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa0548160>] ? ldlm_work_bl_ast_lock+0x0/0x290 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa0544eff>] ? ldlm_run_ast_work+0xcf/0x440 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa0562abf>] ? ldlm_process_extent_lock+0x1bf/0xab0 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa054b23e>] ? ldlm_lock_enqueue+0x3fe/0x860 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa0576a57>] ? ldlm_handle_enqueue0+0x7e7/0x1520 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05fcfe1>] ? tgt_enqueue+0x61/0x230 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05fdbf2>] ? tgt_request_handle+0xa42/0x1230 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05a94a4>] ? ptlrpc_main+0xd74/0x1850 [ptlrpc]
      <4>[22135.330302]  [<ffffffffa05a8730>] ? ptlrpc_main+0x0/0x1850 [ptlrpc]
      <4>[22135.330302]  [<ffffffff8109ce4e>] ? kthread+0x9e/0xc0
      <4>[22135.330302]  [<ffffffff8100c24a>] ? child_rip+0xa/0x20
      <4>[22135.330302]  [<ffffffff8109cdb0>] ? kthread+0x0/0xc0
      <4>[22135.330302]  [<ffffffff8100c240>] ? child_rip+0x0/0x20
      

      crashdump is in /exports/crashdumps/192.168.10.224-2015-08-01-04\:53\:23/
      tag in my tree master-20150731

      Attachments

        Issue Links

          Activity

            [LU-6939] nrs_tbf.c:155:nrs_tbf_cli_reset()) ASSER TION( cli->tc_rule == ((void *)0) ) failed
            bogl Bob Glossman (Inactive) added a comment - on b2_7_fe: http://review.whamcloud.com/21054

            Landed for 2.8.0

            jgmitter Joseph Gmitter (Inactive) added a comment - Landed for 2.8.0

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/17596/
            Subject: LU-6939 nrs: add lock to protect TBF rule linkage
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 839360d35e27087e8266b02c5dd69e5e940577c5

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/17596/ Subject: LU-6939 nrs: add lock to protect TBF rule linkage Project: fs/lustre-release Branch: master Current Patch Set: Commit: 839360d35e27087e8266b02c5dd69e5e940577c5
            emoly.liu Emoly Liu added a comment - Another failure: https://testing.hpdd.intel.com/test_sets/f2558890-ad78-11e5-9134-5254006e85c2

            Li Xi (lixi@ddn.com) uploaded a new patch: http://review.whamcloud.com/17596
            Subject: LU-6939 nrs: add lock to protect TBF rule linkage
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 48ea72f8dbf8e4a2732e096a54558fc7f7388d0c

            gerrit Gerrit Updater added a comment - Li Xi (lixi@ddn.com) uploaded a new patch: http://review.whamcloud.com/17596 Subject: LU-6939 nrs: add lock to protect TBF rule linkage Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 48ea72f8dbf8e4a2732e096a54558fc7f7388d0c
            yujian Jian Yu added a comment - Another instance on master branch: https://testing.hpdd.intel.com/test_sets/3e2f8460-8e7e-11e5-978b-5254006e85c2
            jamesanunez James Nunez (Inactive) added a comment - - edited More failures on master. Logs at 2015-11-04 00:50:08 - https://testing.hpdd.intel.com/test_sets/1f879970-82c7-11e5-8da0-5254006e85c2 2015-11-19 20:59:26 - https://testing.hpdd.intel.com/test_sets/3384b944-8f2e-11e5-815c-5254006e85c2 2015-11-20 22:47:18 - https://testing.hpdd.intel.com/test_sets/889ef798-902b-11e5-a833-5254006e85c2 2015-11-24 11:09:25 - https://testing.hpdd.intel.com/test_sets/6513da0a-92d8-11e5-afc2-5254006e85c2 2015-12-24 10:20:27 - https://testing.hpdd.intel.com/test_sets/fbdf44ea-aa54-11e5-a386-5254006e85c2 2015-12-28 10:06:23 - https://testing.hpdd.intel.com/test_sets/f2558890-ad78-11e5-9134-5254006e85c2 2016-01-05 19:34:38 - https://testing.hpdd.intel.com/test_sets/3476120a-b411-11e5-8114-5254006e85c2

            Just hit this same LBUG with sanityn test_77e. Logs are at https://testing.hpdd.intel.com/test_sets/14ec05ac-73bb-11e5-8f32-5254006e85c2

            jamesanunez James Nunez (Inactive) added a comment - Just hit this same LBUG with sanityn test_77e. Logs are at https://testing.hpdd.intel.com/test_sets/14ec05ac-73bb-11e5-8f32-5254006e85c2
            bogl Bob Glossman (Inactive) added a comment - another seen on master: https://testing.hpdd.intel.com/test_sets/d9169bdc-6e42-11e5-8442-5254006e85c2
            jamesanunez James Nunez (Inactive) added a comment - We've hit this once in review-zfs-part-1. Logs are at: https://testing.hpdd.intel.com/test_sets/c9807ec0-3a0a-11e5-9795-5254006e85c2

            People

              wc-triage WC Triage
              green Oleg Drokin
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: