[LU-338] Lustre client reports wrong file size (zero, or too small) on first stat Created: 17/May/11  Updated: 21/Sep/11  Resolved: 24/May/11

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 1.8.6
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Christopher Morrone Assignee: Niu Yawei (Inactive)
Resolution: Duplicate Votes: 0
Labels: None
Environment:

1.8.5.0-3chaos, RHEL5.5ish (CHAOS4.4-2)


Severity: 3
Rank (Obsolete): 10167

 Description   

I'm getting reports that lustre is reporting the wrong file size on the first stat (that they know of) after file creation. This seems to be relatively rare, and difficult to reproduce, but is happening at least a few times a week site wide that we know about.

The reports are a bit fuzzy so far, and I am trying to get more useful info out of folks. Often on the first stat, a file's size will be reported as zero. They claim that occasionally it will be a "power of 2", but smaller than the actual file size. I didn't get a more specific number than that yet.

The claim is that the file creation and stats happened on the same client node.

(Note that our filesystem default stripe count is 2.)

I wish that I had better info than that. I will continue to try to get more information. But I seem to remember that Oak Ridge may have seen the same issue, so I wanted to get a tracker open.



 Comments   
Comment by Andreas Dilger [ 17/May/11 ]

I suspect from the description that this is a duplicate of LU-274. That issue already has a patch to fix the problem.

Comment by Peter Jones [ 18/May/11 ]

Niu

Could you please confirm whether this is a duplicate of LU274?

Thanks

Peter

Comment by Niu Yawei (Inactive) [ 18/May/11 ]

It probably be a duplicated of LU-274, but we need more information to confirm it, I think can ask customer to try out the patch in LU-274.

Comment by Christopher Morrone [ 18/May/11 ]

Ah, yes, that does sound like the likely culprit, thanks! You can mark this a duplicate of LU-274.

Comment by Peter Jones [ 24/May/11 ]

Duplicate of LU274

Comment by Prakash Surya (Inactive) [ 21/Sep/11 ]

We have received a report of this issue happening again since applying
the LU-274 patch. Although, that patch seems to have made this issue
occur far less frequently, it appears as though it isn't completely
gone. Any suggestions as to where to go from here?

Comment by Niu Yawei (Inactive) [ 21/Sep/11 ]

I'm afraid that if the problem is caused by another rare race in size glimpse (LU-287)? If possible, could you try to apply the patch in LU-287 to see if it can resolve your problem completely?

Generated at Sat Feb 10 01:06:02 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.