Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-6528

permission denied over NFS (intermittent

Details

    • Bug
    • Resolution: Fixed
    • Major
    • Lustre 2.8.0
    • Lustre 2.5.3
    • None
    • 3
    • 9223372036854775807

    Description

      (This might be related to LU-5730 but as the symptoms are different, I'm opening a new ticket.)

      Our users are

      Attachments

        1. LU-6528-reproducer1.sh
          0.6 kB
        2. dk-dump-client.bz2
          191 kB
        3. dk-dump-mds.bz2
          2.44 MB
        4. dk-dump-client-2015-05-07.bz2
          1.40 MB
        5. dk-dump-mds-2015-05-07.bz2
          1.27 MB
        6. messages-client.bz2
          964 kB
        7. LU-6528_debug.diff
          0.4 kB
        8. dk-dump-client-2015-05-11.bz2
          1.46 MB
        9. dk-dump-mds-2015-05-11.bz2
          1.62 MB
        10. LU-6528.diff
          4 kB

        Issue Links

          Activity

            [LU-6528] permission denied over NFS (intermittent
            pjones Peter Jones added a comment -

            Landed for 2.8

            pjones Peter Jones added a comment - Landed for 2.8

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/14978/
            Subject: LU-6528 mdt: keep FS capability for intent_getattr
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: a55aebd70e9073ef1d1bd7de280c8393af0ea30d

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/14978/ Subject: LU-6528 mdt: keep FS capability for intent_getattr Project: fs/lustre-release Branch: master Current Patch Set: Commit: a55aebd70e9073ef1d1bd7de280c8393af0ea30d

            Lai Siyao (lai.siyao@intel.com) uploaded a new patch: http://review.whamcloud.com/14978
            Subject: LU-6528 mdt: keep FS capability for intent_getattr
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: b288a18408a59cf3ac8e3d9bd3ed074547689c57

            gerrit Gerrit Updater added a comment - Lai Siyao (lai.siyao@intel.com) uploaded a new patch: http://review.whamcloud.com/14978 Subject: LU-6528 mdt: keep FS capability for intent_getattr Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: b288a18408a59cf3ac8e3d9bd3ed074547689c57

            FWIW, I've been running our reproducer for nearly 24h after applying the patch. So far we have not seen the permission denied issue at all, so it looks good AFAICT.

            ferner Frederik Ferner (Inactive) added a comment - FWIW, I've been running our reproducer for nearly 24h after applying the patch. So far we have not seen the permission denied issue at all, so it looks good AFAICT.
            laisiyao Lai Siyao added a comment -

            Once it works(I'll wait until tomorrow), I'll put it on gerrit later.

            laisiyao Lai Siyao added a comment - Once it works(I'll wait until tomorrow), I'll put it on gerrit later.

            I've had the reproducer running over night so far it hasn't triggered the issue, so looking good. I'll leave the reproducer running...

            I noticed the patch had been directly attached to the ticket and not to review.whamcloud.com as I would have expected, does this mean it's not yet a good idea to consider applying it to production systems?

            ferner Frederik Ferner (Inactive) added a comment - I've had the reproducer running over night so far it hasn't triggered the issue, so looking good. I'll leave the reproducer running... I noticed the patch had been directly attached to the ticket and not to review.whamcloud.com as I would have expected, does this mean it's not yet a good idea to consider applying it to production systems?

            I've applied the proposed patch on the MDS and I'm now running the tests.... I guess we'll see tomorrow, though I'll keep running those tests for a t least a day continuously...

            ferner Frederik Ferner (Inactive) added a comment - I've applied the proposed patch on the MDS and I'm now running the tests.... I guess we'll see tomorrow, though I'll keep running those tests for a t least a day continuously...
            laisiyao Lai Siyao added a comment - - edited

            I just uploaded a possible patch LU-6528.diff for this, could you apply it on MDS and test again?

            laisiyao Lai Siyao added a comment - - edited I just uploaded a possible patch LU-6528 .diff for this, could you apply it on MDS and test again?
            laisiyao Lai Siyao added a comment -

            This patch will keep FS capability for intent getattr, this is because NFSD will raise FS capability to allow exportfs_decode_fh() to reconnect dentry into dcache, if MDS dropped this capability, reconnect path may fail with -EACCES.

            laisiyao Lai Siyao added a comment - This patch will keep FS capability for intent getattr, this is because NFSD will raise FS capability to allow exportfs_decode_fh() to reconnect dentry into dcache, if MDS dropped this capability, reconnect path may fail with -EACCES.

            As the NFS export (on a RHEL6 machine) currently doesn't contain either "no_subtree_check" or "subtree_check" and by my reading of the documentation, it seems no_subtree_check is in force, yes. (also confirmed by running exportfs -v.)

            I can try (at least on our test system) to re-export with explicit subtree_check and re-test.

            Cheers,
            Frederik

            ferner Frederik Ferner (Inactive) added a comment - - edited As the NFS export (on a RHEL6 machine) currently doesn't contain either "no_subtree_check" or "subtree_check" and by my reading of the documentation, it seems no_subtree_check is in force, yes. (also confirmed by running exportfs -v.) I can try (at least on our test system) to re-export with explicit subtree_check and re-test. Cheers, Frederik

            People

              laisiyao Lai Siyao
              ferner Frederik Ferner (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: