[LU-483] Lustre quota not usabe Created: 05/Jul/11  Updated: 10/Oct/11  Resolved: 10/Oct/11

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.0.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Patrick Valentin (Inactive) Assignee: Johann Lombardi (Inactive)
Resolution: Fixed Votes: 0
Labels: None

Attachments: File syslog.curie.gz    
Severity: 3
Rank (Obsolete): 6589

 Description   

The CEA Bull customer is complaining about quota that are not working properly with Luster 2.0 Bull.
Until last week, quota were disabled because they were experiencing frequent system crashes on some IO servers.
We delivered an Efix containing the patch proposed in LU-369, and they re-enabled quota and made some tests.
There is no longer system crashes, but a user is able to write more than allowed by quotas.

Here is the description of there tests:
1. starting of the fs
2. quotaoff/quotacheck/quotaon/quotacheck
3. set the quota to a user who has already exceeded it's quota
4. this user is able to continue to write
5. set the user quota to a value higher than the actual used capacity
6. this user is able to write and to exceed is quota

I attached the syslog of the IO servers they provided (MDS:curie113, OSS:curie200-207).



 Comments   
Comment by Johann Lombardi (Inactive) [ 05/Jul/11 ]

> Here is the description of there tests:
> 1. starting of the fs
> 2. quotaoff/quotacheck/quotaon/quotacheck
> 3. set the quota to a user who has already exceeded it's quota
> 4. this user is able to continue to write

Please note that there is no integration between the data writeback cache and quota. This means that a user can overrun his quota limit, up to #clients x #OSC x max_dirty_mb in the worst case scenario.
Could you please tell us how much data you tried to write? Do you eventually get"quota exceeded" if you write more than 32MB from one client? Could you please give us the output of lfs quota -u $username?

Comment by Patrick Valentin (Inactive) [ 08/Jul/11 ]

Hi Johan,

I got in touch with our on-site support and it seems it was a copy of about 1 Tb.
They will transmit your questions to the customer.

Patrick

Comment by Sebastien Buisson (Inactive) [ 12/Jul/11 ]

No news from the customer since last Friday.

Comment by Sebastien Buisson (Inactive) [ 21/Jul/11 ]

Hi,

Is there any way to check per-OST quota accounting? An internal file stored in the MDT or in each OST for instance?
That would help this problem's diagnostic.

TIA,
Sebastien.

Comment by Johann Lombardi (Inactive) [ 21/Jul/11 ]

Sure, lfs quota -uv $username /path should return those info.

Comment by Johann Lombardi (Inactive) [ 10/Oct/11 ]

Any news or shall we just close this bug? Please advise.

Comment by Patrick Valentin (Inactive) [ 10/Oct/11 ]

After one of the latest CEA EFIX installation, in the course of september, they made some quota tests and it seems to work correctly.
This ticket can be closed as it was not reproduced (probably a system which was not at the latest EFIX level).
Cheers

Comment by Peter Jones [ 10/Oct/11 ]

Good news! Thanks Patrick!

Generated at Sat Feb 10 01:07:29 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.