[LU-4522]  ldlm_cli_enqueue and ll_inode_revalidate_fini LustreError messages on 2.4.1 clients Created: 21/Jan/14  Updated: 13/Sep/17  Resolved: 11/Mar/14

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.1
Fix Version/s: Lustre 2.6.0

Type: Bug Priority: Minor
Reporter: Oz Rentas Assignee: Andreas Dilger
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
is related to LU-4705 LustreError: 89827:0:(mdc_locks.c:916... Resolved
Severity: 3
Rank (Obsolete): 12365

 Description   

From time to time, KIT are seeing ldlm_cli_enqueue and ll_inode_revalidate_fini LustreError messages. They mainly appear on login nodes of our clusters.

Here is an example of the error messages:
Dec 17 10:24:28 ic2n988 kernel: [512992.139174] LustreError: 11865:0 (mdc_locks.c:840:mdc_enqueue()) ldlm_cli_enqueue: -13
Dec 17 10:24:28 ic2n988 kernel: [512992.139183] LustreError: 11865:0:(mdc_locks.c:840:mdc_enqueue()) Skipped 2 previous similar messages
Dec 17 10:24:28 ic2n988 kernel: [512992.139202] LustreError: 11865:0:(file.c:2716:ll_inode_revalidate_fini()) pfs2wor1: revalidate FID [0x1d080001:0x86b7421d:0x0] error: rc = -13
Dec 17 10:24:28 ic2n988 kernel: [512992.139208] LustreError: 11865:0:(file.c:2716:ll_inode_revalidate_fini()) Skipped 2 previous similar messages
Dec 17 10:24:29 ic2n988 kernel: [512993.347645] LustreError: 13000:0:(file.c:2716:ll_inode_revalidate_fini()) pfs2wor1: revalidate FID [0x1d080001:0x86b7421d:0x0] error: rc = -13

The Lustre client in this case is at version 2.4.1 plus patch for LU-3645. The servers are also at version 2.4.1.

What do the messages mean, and how can we get rid of these error messages?



 Comments   
Comment by Oleg Drokin [ 21/Jan/14 ]

Could this be that login nodes have some users that are not known to MDS, similar to LU-4084 for example?

Comment by Andreas Dilger [ 24/Jan/14 ]

In Lustre 1.8 we used to return -EIDRM (Identifier Removed) so that it was clear this was an issue with missing users in the MDS user database, rather than some other kind of permission problem.

We should also avoid printing these messages on the console, since they are just a distraction. I pushed http://review.whamcloud.com/8988 to quiet the client error messages for the common -EACCES error message.

Comment by Oz Rentas [ 10/Mar/14 ]

We can go ahead and close this one. Thanks.

From the customer:
The update from Andreas Dilger at 24/Jan/14 5:30 AM provided what we expected from this case: Improving the error message or get rid of unnecessary LustreError messages. Therefore, you can close this case.

Comment by Andreas Dilger [ 11/Mar/14 ]

Patch 8828 was landed to master for 2.6.0.

Generated at Sat Feb 10 01:43:28 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.