[LU-5524] parallel-scale-nfsv3: FAIL: setup nfs failed! Created: 20/Aug/14 Updated: 28/Aug/14 Resolved: 28/Aug/14 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.7.0, Lustre 2.5.3 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Jian Yu | Assignee: | WC Triage |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Lustre Build: https://build.hpdd.intel.com/job/lustre-b2_5/82/ |
||
| Severity: | 3 |
| Rank (Obsolete): | 15382 |
| Description |
|
parallel-scale-nfsv3 test failed as follows: CMD: shadow-40vm7 service nfs restart shadow-40vm7: Cannot register service: RPC: Unable to receive; errno = Connection refused shadow-40vm7: rpc.rquotad: unable to register (RQUOTAPROG, RQUOTAVERS, udp). shadow-40vm7: rpc.nfsd: writing fd to kernel failed: errno 5 (Input/output error) shadow-40vm7: rpc.nfsd: writing fd to kernel failed: errno 5 (Input/output error) shadow-40vm7: rpc.nfsd: unable to set any sockets for nfsd Shutting down NFS daemon: [ OK ] Shutting down NFS mountd: [ OK ] Shutting down NFS quotas: [ OK ] Shutting down RPC idmapd: [ OK ] Starting NFS services: [ OK ] Starting NFS quotas: [FAILED] Starting NFS mountd: [FAILED] Starting NFS daemon: [FAILED] CMD: shadow-40vm1,shadow-40vm2.shadow.whamcloud.com chkconfig --list rpcidmapd 2>/dev/null | grep -q rpcidmapd && service rpcidmapd restart || true CMD: shadow-40vm7 exportfs -o rw,async,no_root_squash *:/mnt/lustre && exportfs -v /mnt/lustre <world>(rw,async,wdelay,no_root_squash,no_subtree_check) Mounting NFS clients (version 3)... CMD: shadow-40vm1,shadow-40vm2.shadow.whamcloud.com mkdir -p /mnt/lustre CMD: shadow-40vm1,shadow-40vm2.shadow.whamcloud.com mount -t nfs -o nfsvers=3,async shadow-40vm7:/mnt/lustre /mnt/lustre shadow-40vm1: mount.nfs: Connection timed out shadow-40vm2: mount.nfs: Connection timed out parallel-scale-nfsv3 : @@@@@@ FAIL: setup nfs failed! parallel-scale-nfsv4 hit the same failure. Maloo reports: |
| Comments |
| Comment by Jian Yu [ 20/Aug/14 ] |
|
Lustre client build: https://build.hpdd.intel.com/job/lustre-b2_4/73/ (2.4.3) The same failure occurred: Is this related to the change of http://review.whamcloud.com/11246 ? |
| Comment by Jian Yu [ 21/Aug/14 ] |
|
This is blocking parallel-scale-nfsv{3,4} testing on Lustre b2_5 branch: |
| Comment by Oleg Drokin [ 22/Aug/14 ] |
|
I guess kernel update to rhel might have changed something without us noticing and broke nfs. |
| Comment by Jian Yu [ 23/Aug/14 ] |
|
Here are some instances occurred in the recent month on master branch: And here are all of the instances against parallel-scale-nfsv3 on master branch: The kernel update on RHEL6.5 is not the cause. |
| Comment by Jian Yu [ 23/Aug/14 ] |
|
By looking into the test sessions on Lustre b2_5 build #83, I found that only SLES11SP3 client + RHEL6.5 server test session hit this issue: Other test sessions did not hit this issue. Maybe this is a sporadic test environment issue? Let's wait for the test results of Lustre b2_5 build $84. |
| Comment by Jian Yu [ 24/Aug/14 ] |
|
For Lustre b2_5 build $84, it's also only the SLES11SP3 client + RHEL6.5 server test session hit this issue: |
| Comment by Jian Yu [ 28/Aug/14 ] |
|
The issue did not occur on Lustre b2_5 build #85. It seems it's a sporadic test environment issue. Let's close this ticket now. If it occurs again, please reopen this ticket. |