Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1480

failure on replay-single test_74: ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed: Refcount is 1

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.5.0
    • Lustre 2.4.0, Lustre 2.4.1
    • 3
    • 4293

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/8506fd4e-ad5b-11e1-8152-52540035b04c.

      The sub-test test_74 failed with the following error:

      test failed to respond and timed out

      Info required for matching: replay-single 74

      Attachments

        Issue Links

          Activity

            [LU-1480] failure on replay-single test_74: ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed: Refcount is 1
            bobijam Zhenyu Xu added a comment -

            pushed a debug patch at http://review.whamcloud.com/6105

            bobijam Zhenyu Xu added a comment - pushed a debug patch at http://review.whamcloud.com/6105
            mdiep Minh Diep added a comment -

            I hit this very frequent using fc18 client running sanity test

            mdiep Minh Diep added a comment - I hit this very frequent using fc18 client running sanity test
            yujian Jian Yu added a comment -

            Lustre Build: https://build.whamcloud.com/job/lustre-master/1340/

            Server: el6, x86_64
            clients: fc18, x86_64

            The issue occurred again and was reported in LU-3116.

            yujian Jian Yu added a comment - Lustre Build: https://build.whamcloud.com/job/lustre-master/1340/ Server: el6, x86_64 clients: fc18, x86_64 The issue occurred again and was reported in LU-3116 .
            pjones Peter Jones added a comment -

            Dropping priority as no longer occurring regularly

            pjones Peter Jones added a comment - Dropping priority as no longer occurring regularly
            bobijam Zhenyu Xu added a comment -

            haven't seen this issue for recent tests, I think we can lower the severity.

            bobijam Zhenyu Xu added a comment - haven't seen this issue for recent tests, I think we can lower the severity.
            adilger Andreas Dilger added a comment - Bobijam, this bug was reported hit 14 times in the past week, according to: https://maloo.whamcloud.com/test_sets/query?utf8=%E2%9C%93&test_set[test_set_script_id]=&test_set[status]=&test_set[query_bugs]=LU-1480&test_session[test_host]=&test_session[test_group]=&test_session[user_id]=&test_session[query_date]=&test_session[query_recent_period]=&test_node[os_type_id]=&test_node[distribution_type_id]=&test_node[architecture_type_id]=&test_node[file_system_type_id]=&test_node[lustre_branch_id]=&test_node_network[network_type_id]=&commit=Update+results The most recent is at: https://maloo.whamcloud.com/test_sets/8ab52280-3536-11e2-918f-52540035b04c Can you please check if the information you need is in one of these failures. If not, is there something that can be done to improve the debugging patch to capture the information you need?
            bobijam Zhenyu Xu added a comment -

            Didn't find it, the failure illustrated in the above maloo report is that a client cannot finish inode sync while trying to umount the mount point, not relating to the device refcount issue.

            bobijam Zhenyu Xu added a comment - Didn't find it, the failure illustrated in the above maloo report is that a client cannot finish inode sync while trying to umount the mount point, not relating to the device refcount issue.

            Bobijam, can you please look at the https://maloo.whamcloud.com/test_sets/be5714e2-2ce7-11e2-9af4-52540035b04c to see if your debugging patch contains the information you need to resolve this problem.

            adilger Andreas Dilger added a comment - Bobijam, can you please look at the https://maloo.whamcloud.com/test_sets/be5714e2-2ce7-11e2-9af4-52540035b04c to see if your debugging patch contains the information you need to resolve this problem.
            ian Ian Colle (Inactive) added a comment - https://maloo.whamcloud.com/test_sets/be5714e2-2ce7-11e2-9af4-52540035b04c
            bobijam Zhenyu Xu added a comment - - edited

            status update:

            A debugging patch has landed in master branch and waiting for re-hits with the debug message.

            bobijam Zhenyu Xu added a comment - - edited status update: A debugging patch has landed in master branch and waiting for re-hits with the debug message.

            Alex reported in LU-2070:

            please use http://review.whamcloud.com/4151 to debug

            adilger Andreas Dilger added a comment - Alex reported in LU-2070 : please use http://review.whamcloud.com/4151 to debug

            People

              bobijam Zhenyu Xu
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: