Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-6631

Retry LDLM_CANCEL in ldlm_cli_cancel_req

    XMLWordPrintable

Details

    • Bug
    • Resolution: Not a Bug
    • Major
    • None
    • Lustre 2.6.0, Lustre 2.7.0
    • 3
    • 9223372036854775807

    Description

      ldlm_cli_cancel_req immediately returns if it gets -EWOULDBLOCK(-EAGAIN) from a LDLM_CANCEL request it tried to send, and the client is being evicted before long.

      Some network errors can be recovered in many cases then we can avoid eviction. (eviction is really annoying in suppress_ping environment. so I want to avoid it as far as I can)

      that's why I believe ldlm_cli_cancel_req should retry LDLM_CANCEL here.

      By the way, the patch I'll upload has worked well in Luster-1.8.8 base FEFS for a very long time and the same in Lustre-2.6 based FEFS. I haven't seen any serious issues so far. That's why I'm convinced it will work in Lustre-2.x too.

      Attachments

        Activity

          People

            wc-triage WC Triage
            nozaki Hiroya Nozaki (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: