Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-3645

Interop 2.1.5 <--> 2.4 Write operations during failover errors out instead of stalling

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • None
    • Lustre 2.4.0, Lustre 2.1.5
    • 3
    • 9382

    Description

      During acceptance testing for KIT, they tried OSS failover while running several applications. And applications got IO errors (can't create file and similar messages). This should not happen and IO should just stall till failover happens.
      The clients were running 2.4 and servers were 2.1.5. We tried with 2.1.5 clients and did not see this issue. I have attached the client and server logs.

      Attachments

        1. client_lctl_dk_20130911.tgz
          18 kB
          Kit Westneat
        2. client_messages_20130911.tgz
          228 kB
          Kit Westneat
        3. ll10987.out.gz
          0.2 kB
          Kit Westneat
        4. LU-XXXX.tgz
          463 kB
          Girish Shilamkar
        5. mds1.llog.gz
          224 kB
          Kit Westneat
        6. mds2.llog.gz
          0.2 kB
          Kit Westneat
        7. server_lctl_dk_20130911.tgz
          392 kB
          Kit Westneat
        8. ucbn003.localdomain.llog.gz
          0.2 kB
          Kit Westneat

        Activity

          People

            hongchao.zhang Hongchao Zhang
            gshilamkar Girish Shilamkar (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: