Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11926

Lost lease lock on migrate error

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.13.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

         All the file operations have the following locking order - parent,
         child. If a lock for a child is returned to the client, the following
         operations on this file are done by the child fid.
          
         However, the migrate is an exception - it takes the lease lock first and
         takes the PW parent lock next during the MDS_REINT.
          
         At the same time, if there is a parallel racing operation (open) which
         has taken a lock on parent (conflicting with the next MDS_REINT) and
         is trying to take a lock on child - it is blocked until
         the lease cancel comes.
          
         The lease cancel is piggy-backed on the MDS_REINT RPC and is handled
         at the end of the operation, trying to take the conflicting parent lock
         first - thus a deadlock occurs.

      Attachments

        Activity

          People

            askulysh Andriy Skulysh
            askulysh Andriy Skulysh
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: