[LU-5546] Client getting "revalidate FID" errors, a lot of them Created: 26/Aug/14 Updated: 24/Mar/18 Resolved: 24/Mar/18 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.5.2 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Haisong Cai (Inactive) | Assignee: | Emoly Liu |
| Resolution: | Won't Fix | Votes: | 0 |
| Labels: | sdsc | ||
| Environment: |
Lustre server 2.4.2 |
||
| Severity: | 3 |
| Rank (Obsolete): | 15465 |
| Description |
|
Aug 26 07:55:01 tscc-login1 postfix/qmgr[3091]: 80B853C070: removed Aug 26 11:50:13 tscc-login1 kernel: LustreError: 18441:0:(file.c:3077:ll_inode_revalidate_fini()) dolphin: revalidate FID [0x200005416:0x15ef:0x0] error: rc = -116 |
| Comments |
| Comment by Peter Jones [ 27/Aug/14 ] |
|
Emoly Does this seem related to Thanks Peter |
| Comment by Emoly Liu [ 28/Aug/14 ] |
|
I will look into this one. |
| Comment by Emoly Liu [ 29/Aug/14 ] |
|
Hi Cai, |
| Comment by Andreas Dilger [ 29/Aug/14 ] |
|
This message shouldn't be printed onto the console at all for -ESTALE (-116), since this is a common error that can be hit during normal operation. There is some underlying problem that is causing the revalidate to be called repeatedly (400k calls in 20s by the end of the log) that also needs to be fixed. Cai, is there a client that is hung during this time? The question of NFS exports is also important. Also, it would be great if you could get a stack trace for when this is happening (ideally through systemtap, but also possibly via repeatedly trying sysrq-p) to see what the callpath is. It isn't clear why the caller isn't handling -ESTALE correctly and actually moving on to some new FID. |
| Comment by Haisong Cai (Inactive) [ 29/Aug/14 ] |
|
Hi Andreas, The error showed after we rebooted MDS (because of We have since reboot that client as well and it is now error free. thanks, |
| Comment by Peter Jones [ 24/Mar/18 ] |
|
SDSC have moved onto more current releases so I do not think any further work is needed here |