Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17565

migrate racing with another ops can corrupt filesystem

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Minor Minor
    • None
    • None
    • None
    • 3
    • 9223372036854775807

      mdt_reint_migrate() lookups names, then grabs ldlm locks and doesn't recheck names. another operation like unlink can race and modify directories within this lookup-to-locking window. after that mdt_reint_migrate() generates a distributed transaction which fails on a remote node (due to missing already name), but doesn't rollback properly.

            wc-triage WC Triage
            bzzz Alex Zhuravlev
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated: