Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4272

lu_device_fini()) ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed from lovsub_device_free

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • Lustre 2.8.0
    • Lustre 2.6.0
    • 3
    • 11737

    Description

      Apparently we have a problem with inode cleaning on unmount exposed by at least nfs.

      The easiest reproducer I have right now is this:

      sh llmount.sh
      echo 0 >/proc/sys/lnet/panic_on_lbug
      service nfs start
      mount localhost:/mnt/lustre /mnt/nfs -t nfs
      touch /mnt/nfs/file
      ls -l /mnt/lustre
      cp -f /etc/passwd /mnt/nfs/file
      
      umount /mnt/nfs
      service nfs stop
      sh llmountcleanup.sh
      

      Reading the logs, It appears that after the write file dentry is deleted, but because we have locks on inode - it stays in place (unlike when we do not use nfs where it also goes away after sync).
      Then on unmount file inode is found, but is considered dirty and not cleaned, meaning the layout lock and objects remain in place.
      Now we get to kill_super and that blows up trying to clean up lov.

      Attachments

        Issue Links

          Activity

            [LU-4272] lu_device_fini()) ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed from lovsub_device_free
            green Oleg Drokin added a comment -

            it was fixed by LU-6794 which is essentially the same patch.

            green Oleg Drokin added a comment - it was fixed by LU-6794 which is essentially the same patch.
            jgmitter Joseph Gmitter (Inactive) added a comment - - edited

            Oleg is testing patch from LU-6794 to see if it resolves this issue. He will close this as a duplicate if it is resolved.

            jgmitter Joseph Gmitter (Inactive) added a comment - - edited Oleg is testing patch from LU-6794 to see if it resolves this issue. He will close this as a duplicate if it is resolved.

            Oleg, if you are no longer able to reproduce this, please close it with "Cannot Reproduce" to get it off the tracking list.

            adilger Andreas Dilger added a comment - Oleg, if you are no longer able to reproduce this, please close it with "Cannot Reproduce" to get it off the tracking list.

            Oleg,
            Is this problem still occurring? If so, would you be able to provide the logs for Niu?
            Thank you!

            jlevi Jodi Levi (Inactive) added a comment - Oleg, Is this problem still occurring? If so, would you be able to provide the logs for Niu? Thank you!

            It's strange, I tried again, but it still can't be reproduced in my test environment. Oleg, could you post the log here for analysis? Thank you.

            niu Niu Yawei (Inactive) added a comment - It's strange, I tried again, but it still can't be reproduced in my test environment. Oleg, could you post the log here for analysis? Thank you.
            green Oleg Drokin added a comment -

            The problem still exists, I just tried all the steps on current master and it still fails in the exact same way

            green Oleg Drokin added a comment - The problem still exists, I just tried all the steps on current master and it still fails in the exact same way

            Has this been reproduced? Does the problem still exist or should this ticket be closed?

            jlevi Jodi Levi (Inactive) added a comment - Has this been reproduced? Does the problem still exist or should this ticket be closed?

            I can't reproduce it with current master. Oleg, did you test it with clean master?

            niu Niu Yawei (Inactive) added a comment - I can't reproduce it with current master. Oleg, did you test it with clean master?

            This might be a side effect of the LU-2613 patch landing.

            adilger Andreas Dilger added a comment - This might be a side effect of the LU-2613 patch landing.
            pjones Peter Jones added a comment -

            Niu

            Is this related to your patch for LU-2613?

            Peter

            pjones Peter Jones added a comment - Niu Is this related to your patch for LU-2613 ? Peter

            People

              niu Niu Yawei (Inactive)
              green Oleg Drokin
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: