Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8856

ZFS-MDT 100% full. Cannot delete files.

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.11.0, Lustre 2.10.4
    • Lustre 2.8.0
    • CentOS 6.8 2.6.32_504.30.3.el6.x86_64, Lustre 2.8.0 (g0bcd520), ZFS 0.6.5.4-1
    • 2
    • 9223372036854775807

    Description

      End Customer: MSU (Michigan State Univ)

      A user generated tons of small files and exhausted the available inodes of the MDT (single MDT, no DNE). Any attempts at deleting files as root fail.

      I looked at LU-8787 and LU-8714 but they don't seem to follow this closely enough.

      zdb -d ls15-mds-00.mdt/mdt
      Dataset ls15-mds-00.mdt/mdt [ZPL], ID 66, cr_txg 20442, 2.82T, 280362968 objects

      ls15-mds-00.mdt/mdt 2.82T 0 2.82T /ls15-mds-00.mdt/mdt

      [root@lac-373 roth]# lfs df -i
      UUID Inodes IUsed IFree IUse% Mounted on
      ls15-MDT0000_UUID 280362968 280362968 0 100% /mnt/ls15[MDT:0]

      But we can't remove any files:

      [root@lac-000 1mk5_5998]# rm tor.mat
      rm: cannot remove `tor.mat': No space left on device

      I'm going to take a stab at deregistering the changelog which might free up enough space to get the MDT able to process some file deletions. If anyone has any other 'best practices' please advise.

      Attachments

        Issue Links

          Activity

            People

              bzzz Alex Zhuravlev
              aeonjeffj Jeff Johnson (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: