Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10911 FLR2: Read only erasure coding
  3. LU-19782

FLR-EC: Do no allow writes to EC files if any OSTs are offline

    XMLWordPrintable

Details

    • Technical task
    • Resolution: Unresolved
    • Medium
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      When we write to a EC protected file the EC mirror will become stale.

      If an OST is missing we can still recover the data IFF the EC mirror is not stale. I.e. via degraded read.

      But if an OST is already missing, then if we allow writing to any of the other OSTs in the data mirror, we would now cause the parities to become stale and can no longer recover using degraded read. I.e. writing to an OST while the raid set is degraded will cause data loss.

      One way to partially address this is to not allow writes to a file if/when the client knows OSTs are offline and fail the write with -EROFS.

      An OST might only be temporarily off-line, during OSS restart or similar, so when this trigger, instead of failing a write immediately we might want to delay the write for some set amount of time, in case the OST becomes on-line again before failing it and returning an error to the application.

      We might also want a configuration setting where we can control this behavior.

      Attachments

        Activity

          People

            wc-triage WC Triage
            rsahlberg@whamcloud.com Ronnie Sahlberg
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: