Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11800

MDT stuck during recovery

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Minor
    • None
    • Lustre 2.12.0
    • None
    • soak with 2.12-RC2 ib build #173 EL7.6
    • 3
    • 9223372036854775807

    Description

      After soak runs about 2 days, 1 MDS stuck during recovery

      MDS

      Dec 16 13:45:36 soak-10 kernel: LustreError: 12500:0:(llog_osd.c:988:llog_osd_next_block()) soaked-MDT0001-osp-MDT0002: invalid llog tail at log id [0x54cf:0x400069cc:0x2]:0 offset 16121856 bytes 32768
      Dec 16 13:45:36 soak-10 kernel: LustreError: 12500:0:(lod_dev.c:428:lod_sub_recovery_thread()) soaked-MDT0001-osp-MDT0002 get update log failed: rc = -22
      Dec 16 13:45:37 soak-10 multipathd: 360080e50001fedb80000015952012962: sdi - rdac checker reports path is ghost
      Dec 16 13:45:37 soak-10 kernel: device-mapper: multipath: Reinstating path 8:128.
      Dec 16 13:45:37 soak-10 multipathd: 8:128: reinstated
      Dec 16 13:45:37 soak-10 multipathd: 360080e50001fedb80000015952012962: queue_if_no_path enabled
      Dec 16 13:45:37 soak-10 multipathd: 360080e50001fedb80000015952012962: Recovered to normal mode
      Dec 16 13:45:37 soak-10 multipathd: 360080e50001fedb80000015952012962: remaining active paths: 1
      Dec 16 13:45:37 soak-10 kernel: device-mapper: multipath: Failing path 8:128.
      Dec 16 13:45:37 soak-10 multipathd: sdi: mark as failed
      Dec 16 13:45:37 soak-10 multipathd: 360080e50001fedb80000015952012962: Entering recovery mode: max_retries=300
      Dec 16 13:45:37 soak-10 multipathd: 360080e50001fedb80000015952012962: remaining active paths: 0
      Dec 16 13:45:37 soak-10 multipathd: 360080e50001fedb80000015952012962: Entering recovery mode: max_retries=300
      Dec 16 13:45:37 soak-10 kernel: LustreError: 12498:0:(lod_dev.c:428:lod_sub_recovery_thread()) soaked-MDT0002-osd get update log failed: rc = -108
      Dec 16 13:45:37 soak-10 kernel: LustreError: 12498:0:(lod_dev.c:428:lod_sub_recovery_thread()) Skipped 2 previous similar messages
      

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              sarah Sarah Liu
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: