Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1480

failure on replay-single test_74: ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed: Refcount is 1

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.5.0
    • Lustre 2.4.0, Lustre 2.4.1
    • 3
    • 4293

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/8506fd4e-ad5b-11e1-8152-52540035b04c.

      The sub-test test_74 failed with the following error:

      test failed to respond and timed out

      Info required for matching: replay-single 74

      Attachments

        Issue Links

          Activity

            [LU-1480] failure on replay-single test_74: ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed: Refcount is 1
            adilger Andreas Dilger made changes -
            Link Original: This issue is related to EX-1460 [ EX-1460 ]
            adilger Andreas Dilger made changes -
            Link New: This issue is related to EX-1460 [ EX-1460 ]
            green Oleg Drokin made changes -
            Link New: This issue is related to LU-3228 [ LU-3228 ]
            pjones Peter Jones made changes -
            Resolution New: Fixed [ 1 ]
            Status Original: Reopened [ 4 ] New: Resolved [ 5 ]
            pjones Peter Jones added a comment -

            Landed for 2.5.0

            pjones Peter Jones added a comment - Landed for 2.5.0
            jlevi Jodi Levi (Inactive) made changes -
            Fix Version/s New: Lustre 2.5.0 [ 10295 ]
            Priority Original: Blocker [ 1 ] New: Critical [ 2 ]
            sarah Sarah Liu added a comment -

            Also hit this error when running interop between 2.4.0 server and 2.5 client:
            https://maloo.whamcloud.com/test_sets/a58ce5fe-19c7-11e3-bb73-52540035b04c

            server: 2.4.0
            client: lustre-master build #1652

            19:44:33:LustreError: 10863:0:(lu_object.c:1141:lu_device_fini()) ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed: Refcount is 1
            19:44:33:LustreError: 10863:0:(lu_object.c:1141:lu_device_fini()) LBUG
            19:44:34:Pid: 10863, comm: umount
            19:44:34:
            19:44:35:Call Trace:
            19:44:35: [<ffffffffa0478895>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
            19:44:35: [<ffffffffa0478e97>] lbug_with_loc+0x47/0xb0 [libcfs]
            19:44:36: [<ffffffffa05ec4b8>] lu_device_fini+0xb8/0xc0 [obdclass]
            19:44:36: [<ffffffffa05d0727>] ls_device_put+0x87/0x1e0 [obdclass]
            19:44:36: [<ffffffffa05d0a3c>] local_oid_storage_fini+0x1bc/0x270 [obdclass]
            19:44:36: [<ffffffffa0d6bd74>] mgs_fs_cleanup+0x64/0x80 [mgs]
            19:44:36: [<ffffffffa0d68ae0>] mgs_device_fini+0x1d0/0x5a0 [mgs]
            19:44:36: [<ffffffffa05ddba7>] class_cleanup+0x577/0xda0 [obdclass]
            19:44:36: [<ffffffffa05b2b36>] ? class_name2dev+0x56/0xe0 [obdclass]
            19:44:37: [<ffffffffa05df48c>] class_process_config+0x10bc/0x1c80 [obdclass]
            19:44:37: [<ffffffffa05d8cb3>] ? lustre_cfg_new+0x353/0x7e0 [obdclass]
            19:44:38: [<ffffffffa05e01c9>] class_manual_cleanup+0x179/0x6f0 [obdclass]
            19:44:38: [<ffffffffa05b2b36>] ? class_name2dev+0x56/0xe0 [obdclass]
            19:44:38: [<ffffffffa0614d7d>] server_put_super+0x46d/0xf00 [obdclass]
            19:44:38: [<ffffffff8118334b>] generic_shutdown_super+0x5b/0xe0
            19:44:38: [<ffffffff81183436>] kill_anon_super+0x16/0x60
            19:44:39: [<ffffffffa05e2026>] lustre_kill_super+0x36/0x60 [obdclass]
            19:44:39: [<ffffffff81183bd7>] deactivate_super+0x57/0x80
            19:44:39: [<ffffffff811a1c4f>] mntput_no_expire+0xbf/0x110
            19:44:39: [<ffffffff811a26bb>] sys_umount+0x7b/0x3a0
            19:44:39: [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
            19:44:39:
            19:44:40:Kernel panic - not syncing: LBUG
            19:44:40:Pid: 10863, comm: umount Not tainted 2.6.32-358.6.2.el6_lustre.g230b174.x86_64 #1
            
            sarah Sarah Liu added a comment - Also hit this error when running interop between 2.4.0 server and 2.5 client: https://maloo.whamcloud.com/test_sets/a58ce5fe-19c7-11e3-bb73-52540035b04c server: 2.4.0 client: lustre-master build #1652 19:44:33:LustreError: 10863:0:(lu_object.c:1141:lu_device_fini()) ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed: Refcount is 1 19:44:33:LustreError: 10863:0:(lu_object.c:1141:lu_device_fini()) LBUG 19:44:34:Pid: 10863, comm: umount 19:44:34: 19:44:35:Call Trace: 19:44:35: [<ffffffffa0478895>] libcfs_debug_dumpstack+0x55/0x80 [libcfs] 19:44:35: [<ffffffffa0478e97>] lbug_with_loc+0x47/0xb0 [libcfs] 19:44:36: [<ffffffffa05ec4b8>] lu_device_fini+0xb8/0xc0 [obdclass] 19:44:36: [<ffffffffa05d0727>] ls_device_put+0x87/0x1e0 [obdclass] 19:44:36: [<ffffffffa05d0a3c>] local_oid_storage_fini+0x1bc/0x270 [obdclass] 19:44:36: [<ffffffffa0d6bd74>] mgs_fs_cleanup+0x64/0x80 [mgs] 19:44:36: [<ffffffffa0d68ae0>] mgs_device_fini+0x1d0/0x5a0 [mgs] 19:44:36: [<ffffffffa05ddba7>] class_cleanup+0x577/0xda0 [obdclass] 19:44:36: [<ffffffffa05b2b36>] ? class_name2dev+0x56/0xe0 [obdclass] 19:44:37: [<ffffffffa05df48c>] class_process_config+0x10bc/0x1c80 [obdclass] 19:44:37: [<ffffffffa05d8cb3>] ? lustre_cfg_new+0x353/0x7e0 [obdclass] 19:44:38: [<ffffffffa05e01c9>] class_manual_cleanup+0x179/0x6f0 [obdclass] 19:44:38: [<ffffffffa05b2b36>] ? class_name2dev+0x56/0xe0 [obdclass] 19:44:38: [<ffffffffa0614d7d>] server_put_super+0x46d/0xf00 [obdclass] 19:44:38: [<ffffffff8118334b>] generic_shutdown_super+0x5b/0xe0 19:44:38: [<ffffffff81183436>] kill_anon_super+0x16/0x60 19:44:39: [<ffffffffa05e2026>] lustre_kill_super+0x36/0x60 [obdclass] 19:44:39: [<ffffffff81183bd7>] deactivate_super+0x57/0x80 19:44:39: [<ffffffff811a1c4f>] mntput_no_expire+0xbf/0x110 19:44:39: [<ffffffff811a26bb>] sys_umount+0x7b/0x3a0 19:44:39: [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b 19:44:39: 19:44:40:Kernel panic - not syncing: LBUG 19:44:40:Pid: 10863, comm: umount Not tainted 2.6.32-358.6.2.el6_lustre.g230b174.x86_64 #1
            yujian Jian Yu made changes -
            Affects Version/s New: Lustre 2.4.1 [ 10294 ]
            yujian Jian Yu added a comment -

            Lustre build: http://build.whamcloud.com/job/lustre-b2_4/44/ (2.4.1 RC1)
            Distro/Arch: RHEL6.4/x86_64 + FC18/x86_64 (Server + Client)

            sanity test 232 hit the same failure:
            https://maloo.whamcloud.com/test_sets/0cbde1d0-14ee-11e3-ac48-52540035b04c

            yujian Jian Yu added a comment - Lustre build: http://build.whamcloud.com/job/lustre-b2_4/44/ (2.4.1 RC1) Distro/Arch: RHEL6.4/x86_64 + FC18/x86_64 (Server + Client) sanity test 232 hit the same failure: https://maloo.whamcloud.com/test_sets/0cbde1d0-14ee-11e3-ac48-52540035b04c
            yujian Jian Yu made changes -
            Labels Original: yuc New: yuc2

            People

              bobijam Zhenyu Xu
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: