Details
-
Bug
-
Resolution: Cannot Reproduce
-
Major
-
None
-
Lustre 2.10.3, Lustre 2.10.4
-
None
-
clients: 2.10.4 clients, servers: 2.10.3 +
LU-10783(kernel update RHEL7.4)
-
3
-
9223372036854775807
Description
Hello,
Today our users started to report intermittent file access issues on Oak. I noticed the following messages on one client (2.10.4):
Jul 05 14:32:21 sh-ln01.stanford.edu kernel: LustreError: 155141:0:(xattr.c:377:ll_xattr_list()) server bug: replied size 236 > 132 Jul 05 14:32:41 sh-ln01.stanford.edu kernel: LustreError: 171588:0:(xattr.c:377:ll_xattr_list()) server bug: replied size 164 > 132 Jul 05 14:32:41 sh-ln01.stanford.edu kernel: LustreError: 171588:0:(xattr.c:377:ll_xattr_list()) Skipped 5 previous similar messages Jul 05 14:32:47 sh-ln01.stanford.edu kernel: LustreError: 176583:0:(xattr.c:377:ll_xattr_list()) server bug: replied size 172 > 132 Jul 05 14:32:47 sh-ln01.stanford.edu kernel: LustreError: 176583:0:(xattr.c:377:ll_xattr_list()) Skipped 59 previous similar messages Jul 05 14:33:23 sh-ln01.stanford.edu kernel: LustreError: 10776:0:(xattr.c:377:ll_xattr_list()) server bug: replied size 172 > 132 Jul 05 14:33:23 sh-ln01.stanford.edu kernel: LustreError: 10776:0:(xattr.c:377:ll_xattr_list()) Skipped 58 previous similar messages
These errors messages are the only Lustre Error I can see on this impacted client, however they are not very helpful as I'm not even sure it happened on Oak or another Lustre filesystem...
The impacted directories are using ACLs but only a very few, less than 10. We have other directories with >32 ACLs and haven't seen this issue.
The issue doesn't seem to be easily reproducible neither. I'm still investigating.
If you have any ideas on how to troubleshoot this, please let me know.
Thanks!
Stephane
Attachments
Issue Links
- is related to
-
LU-11074 Invalid argument reading file caps
- Resolved