[LU-1100] metabench failed on NFSv3/v4 over Lustre Created: 14/Feb/12  Updated: 27/Sep/12  Resolved: 25/May/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.1.1
Fix Version/s: Lustre 2.1.2, Lustre 1.8.8

Type: Bug Priority: Minor
Reporter: Jian Yu Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

Lustre Tag: v2_1_1_0_RC2
Lustre Build: http://build.whamcloud.com/job/lustre-b2_1/41/
Distro/Arch: RHEL6/x86_64 (kernel version: 2.6.32-220.el6)
Network: IB (in-kernel OFED)
ENABLE_QUOTA=yes


Issue Links:
Duplicate
duplicates LU-1191 Test failure on test suite parallel-s... Resolved
Severity: 3
Rank (Obsolete): 5423

 Description   

Lustre Client/NFS Server: fat-intel-2-ib
NFS Clients: client-1-ib,client-4-ib

The metabench test failed on NFSv3/v4 clients as follows:

[02/13/2012 22:19:44] Entering par_create_multidir to create 4343 files in 1 dirs
Removed 1095 files in     25.466 seconds
[02/13/2012 22:20:09] FATAL error on process 7
Proc 7: Cant remove directory [/mnt/lustre/d0.metabench/TIME_CREATE_007.000]: Directory not empty

Dmesg on NFS client client-4-ib showed that:

NFS: directory d0.metabench/TIME_CREATE_007.000 contains a readdir loop.  Please contact your server vendor.  Offending cookie: 304394735309107493
NFS: directory d0.metabench/TIME_CREATE_007.001 contains a readdir loop.  Please contact your server vendor.  Offending cookie: 309878210750602462
NFS: directory d0.metabench/TIME_CREATE_007.000 contains a readdir loop.  Please contact your server vendor.  Offending cookie: 73053889
NFS: directory d0.metabench/TIME_CREATE_007.000 contains a readdir loop.  Please contact your server vendor.  Offending cookie: 73053889
NFS: directory d0.metabench/TIME_CREATE_007.000 contains a readdir loop.  Please contact your server vendor.  Offending cookie: 76752781
NFS: directory d0.metabench/TIME_CREATE_007.000 contains a readdir loop.  Please contact your server vendor.  Offending cookie: 76752781

Maloo report: https://maloo.whamcloud.com/test_sets/3586a652-56d5-11e1-8ba4-5254004bbbd3

The issue is related to CentOS 6.2 with kernel 2.6.32-220.el6:
https://www.centos.org/modules/newbb/viewtopic.php?topic_id=35387&forum=55&post_id=152924#forumpost152924



 Comments   
Comment by Peter Jones [ 14/Feb/12 ]

Lai

Could you please look into this one?

Thanks

Peter

Comment by Peter Jones [ 14/Feb/12 ]

As this is an NFS issue rather than a Lustre issue, dropping as a blocker

Comment by Sarah Liu [ 15/Feb/12 ]

2.1.55 NFS v3/v4 also has this issue.

Lustre: DEBUG MARKER: == parallel-scale test metabench: metabench == 17:26:11 (1329355571)
NFS: directory d0.metabench/TIME_CREATE_003.000 contains a readdir loop. Please contact your server vendor. Offending cookie: 233958805
NFS: directory d0.metabench/TIME_CREATE_003.000 contains a readdir loop. Please contact your server vendor. Offending cookie: 233958805
Lustre: DEBUG MARKER: parallel-scale test_metabench: @@@@@@ FAIL: metabench failed! 1

Comment by Li Wei (Inactive) [ 24/May/12 ]

https://maloo.whamcloud.com/test_sets/fbe67f94-a599-11e1-8432-52540035b04c

Comment by Jian Yu [ 25/May/12 ]

This is a duplicate of LU-1191, which is fixed by the kernel update tracked under LU-1424 and landed for 1.8.8, 2.1.2 and 2.3.

Comment by Li Wei (Inactive) [ 25/May/12 ]

Yu Jian, thanks for the info.

Generated at Sat Feb 10 01:13:30 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.