Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.8.0
-
CentOS 6.8 2.6.32_504.30.3.el6.x86_64, Lustre 2.8.0 (g0bcd520), ZFS 0.6.5.4-1
Description
End Customer: MSU (Michigan State Univ)
A user generated tons of small files and exhausted the available inodes of the MDT (single MDT, no DNE). Any attempts at deleting files as root fail.
I looked at LU-8787 and LU-8714 but they don't seem to follow this closely enough.
zdb -d ls15-mds-00.mdt/mdt
Dataset ls15-mds-00.mdt/mdt [ZPL], ID 66, cr_txg 20442, 2.82T, 280362968 objects
ls15-mds-00.mdt/mdt 2.82T 0 2.82T /ls15-mds-00.mdt/mdt
[root@lac-373 roth]# lfs df -i
UUID Inodes IUsed IFree IUse% Mounted on
ls15-MDT0000_UUID 280362968 280362968 0 100% /mnt/ls15[MDT:0]
But we can't remove any files:
[root@lac-000 1mk5_5998]# rm tor.mat
rm: cannot remove `tor.mat': No space left on device
I'm going to take a stab at deregistering the changelog which might free up enough space to get the MDT able to process some file deletions. If anyone has any other 'best practices' please advise.