Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-12754

ALL OST lost contact with OSS after disk failures

    XMLWordPrintable

Details

    • Question/Request
    • Resolution: Unresolved
    • Critical
    • None
    • Lustre 2.7.0
    • None
    • centos 7.2
    • 9223372036854775807

    Description

      I am writing to ask if you can help our group on this emergency issue with our lustre system. the system is running centos 7.2 with lustre 2.7.

      There are 2 OSS(oss1 and oss2) and two MDS ( mds1 and mds2) running as failover servers, on oss1 and oss2, after reporting disk errors on both oss1 and oss2, I managed to reboot it and both lost contact with all OSTs!!

      I'd like to ask your advice on how to recover it, we have over 400TB data and desperately need it back .

      Attachments

        1. dmesg.save
          777 kB
        2. messages
          9.15 MB

        Activity

          People

            wc-triage WC Triage
            huan.wang Helen Wang (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: