[LU-4592] mdt_reint_open()) @@@ OPEN & CREAT not in open replay Created: 05/Feb/14 Updated: 23/Jan/16 Resolved: 23/Jan/16 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.1.5 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Oz Rentas | Assignee: | Hongchao Zhang |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Lustre 2.1.5 servesr, LLNL Chaos clients |
||
| Attachments: |
|
| Severity: | 2 |
| Rank (Obsolete): | 12545 |
| Description |
|
As part of preparation testing, the customer performed a failover tests. The customer rebooted the primary MDS in order to confirm the standby MDS would takeover and not interrupt the job. The job died when the client was unable to open a file. 3 files attached. Error reported on MDS01 - ERRORs on client - |
| Comments |
| Comment by Peter Jones [ 06/Feb/14 ] |
|
Hongchao Could you please advise on this one? Thanks Peter |
| Comment by Hongchao Zhang [ 07/Feb/14 ] |
|
Hi Oz, do you mount the Lustre with ACL enabled and disabled the "identity_upcall"? Jan 31 17:17:55 l1-mds01 kernel: : Lustre: 22128:0:(mdt_lproc.c:414:lprocfs_wr_identity_upcall()) l1-MDT0000: disable "identity_upcall" with ACL enabled maybe cause unexpected "EACCESS" Jan 31 17:17:55 l1-mds01 kernel: : Lustre: 22128:0:(mdt_lproc.c:416:lprocfs_wr_identity_upcall()) l1-MDT0000: identity upcall set to NONE the problem in the job is just -EACCESS, ... Rank 26 Host mu0104.localdomain FATAL ERROR 1391213863: Unable to open file /lustre/lscratch1/atorrez/out.1391213561.26 for read. (errno=Permission denied) (MPI_Error = 42) Rank 28 Host mu0104.localdomain FATAL ERROR 1391213863: Unable to open file /lustre/lscratch1/atorrez/out.1391213561.28 for read. (errno=Permission denied) (MPI_Error = 42) Rank 29 Host mu0104.localdomain FATAL ERROR 1391213863: Unable to open file /lustre/lscratch1/atorrez/out.1391213561.29 for read. (errno=Permission denied) (MPI_Error = 42) Rank 47 Host mu0104.localdomain FATAL ERROR 1391213863: Unable to open file /lustre/lscratch1/atorrez/out.1391213561.47 for read. (errno=Permission denied) (MPI_Error = 42) ... could you please test it without ACL to check whether it is the cause? Thanks! |
| Comment by Oz Rentas [ 14/Feb/14 ] |
|
The customer reports they are not mounting with ACL support, as seen here: Any other suggestions on where we can look? Side note - On my system I was able to duplicate the error they received by setting upcall_identity to NONE, and mounting with ACL: [root@es0 ~]# mount |grep mdt |
| Comment by Oz Rentas [ 18/Feb/14 ] |
|
Any updates on this one? |
| Comment by Hongchao Zhang [ 19/Feb/14 ] |
|
sorry for delayed response. from the code, this debug line is only printed when mounted with ACL static int lprocfs_wr_identity_upcall(struct file *file, const char *buffer, unsigned long count, void *data) { ... if (strcmp(hash->uc_upcall, "NONE") == 0 && mdt->mdt_opts.mo_acl) <---- here, "mo_acl is 1" CWARN("%s: disable \"identity_upcall\" with ACL enabled maybe " "cause unexpected \"EACCESS\"\n", mdt_obd_name(mdt)); ... } Is it possible that mds00 mounts without ACL but mds01 with it? Thanks! |
| Comment by Bobbie Lind (Inactive) [ 26/Mar/14 ] |
|
After being onsite with customer I can confirm that when running the mount command that the system appears to NOT be mounting with acls. /dev/mapper/vg_l1-mdt on /lustre/l1/mdt type lustre (rw) Re-asking to Oz's question, is there another place that it may show as being mounted with acl's that I can check the next time I'm onsite? |
| Comment by Hongchao Zhang [ 11/Apr/14 ] |
|
currently, the mount options is not printed when showing the mount info if the mount type is "lustre" (and it will show when mounting it with "ldiskfs" type) the default mount options could contain "ACL" (it's the case in my local node RHEL6.5/x86_64), |
| Comment by Hongchao Zhang [ 22/Jan/16 ] |
|
Hi Oz, |
| Comment by Oz Rentas [ 22/Jan/16 ] |
|
yes, it can be closed. thanks. |
| Comment by John Fuchs-Chesney (Inactive) [ 23/Jan/16 ] |
|
Thanks Oz and Hongchao. ~ jfc. |