[LU-631] IO errors when using automounter and Lustre Created: 24/Aug/11 Updated: 09/Jul/13 Resolved: 25/Apr/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 1.8.6 |
| Fix Version/s: | Lustre 2.4.0 |
| Type: | Bug | Priority: | Minor |
| Reporter: | Jeremy Filizetti | Assignee: | Hongchao Zhang |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | ptr | ||
| Environment: |
various |
||
| Severity: | 3 |
| Rank (Obsolete): | 7892 |
| Description |
|
Ever since we moved from Lustre 1.6.6 to 1.8 I've seen issues with using Original email thread from lustre-devel: |
| Comments |
| Comment by Peter Jones [ 03/Nov/11 ] |
|
Hongchao Can you please look into this one? Thanks Peter |
| Comment by Hongchao Zhang [ 04/Nov/11 ] |
|
the problem(-EIO) caused by "ls -l /lustre/xen1/tmp/testfile" is in "lov_enqueue", where "lov_prep_enqueue_set" find in "lov_prep_enqueue_set", here, we can wait these OSCs to be connected & activated, but it will need long time if the OST is recovering, |
| Comment by Jeremy Filizetti [ 06/Nov/11 ] |
|
I think the easiest way to make a satisfactory fix (to me) is to make sure that nothing is queued to the OSC before it has been set active so that we don't return -EIO from lov_prep_enqueue_set on operations that might have triggered the mount from the automounter. As for the bug you mention about not accounting for {a,c,m}time and size from all of the OSC if some of them are done should also be fixed. Maybe that should be tracked under a separate bug. |
| Comment by Peter Jones [ 23/Nov/11 ] |
|
Bobi Hongchao is out for a while. Could you please investigate this issue in his absence? Thanks Peter |
| Comment by Hongchao Zhang [ 02/Dec/11 ] |
|
the patch is tracked at http://review.whamcloud.com/#change,2469 |
| Comment by Peter Jones [ 25/Apr/13 ] |
|
Landed for 2.4 |
| Comment by Alexey Lyashkov [ 09/Jul/13 ] |
|
good patch to make MDT hang if someone will add OST which unreachable in config change time. |