Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.5.4
-
None
-
RHEL-6.6, lustre-2.5.4
-
2
-
9223372036854775807
Description
We had 4 OSTs that we deactivated because of an imbalance in utilization that was causing ENOSPC messages to our users. We identified a file that was consuming a significant amount of space that we deleted while the OSTs were deactivated. The file is no longer seen in the directory structure (the MDS processed the request), but the objects on the OSTs were not marked as free. After re-activating the OSTs, it doesn't appear that the llog was flushed, which should free up those objects.
At this time, some users are not able to run jobs because they cannot allocated any space.
We understand how this is supposed to work, but as the user in LU-4295 pointed out, it is not.
Please advise.
Well, I was trying to reproduce that locally and objects are not deleted while OSP is deactivated but they are deleted immediately when I re-activate OSP back. I used 'lctl --device <osp device> deactivate' command to deactivate an OSP. Then destroy big file that was previously created on that OST. The 'df' shows that space on related OST is not freed, after that I re-activated OSP back and 'df' shows space is returned back. Any thoughts what else may affect that?