Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10419

LFSCK fails to start, hangs systems.

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: Lustre 2.11.0, Lustre 2.10.2, Lustre 2.10.3
    • Fix Version/s: Lustre 2.12.0, Lustre 2.10.5
    • Labels:
    • Environment:
      Soak performance cluster - Lustre version=2.10.2_4_gb151f34
    • Severity:
      3
    • Rank (Obsolete):
      9223372036854775807

      Description

      We do OSS failover, trigger LFSCK:

      
      

      lctl lfsck_start -M soaked-MDT0000 -s 1000 -t all -A{code]

      The lfsck start hangs, lfsck is not started, the clients wedge in state 'comp' the entire system wedges. I have dumped Lustre Logs from all MDS, attached. I have crash-dumped all the MDT nodes and the dumps are available on Spirit. lfsck_layout is unkillable.

        Attachments

        1. soak-10.lustre.log.gz
          2.57 MB
          Cliff White
        2. soak-11.lustre.log.gz
          2.22 MB
          Cliff White
        3. soak-8.lustre.log.gz
          2.14 MB
          Cliff White
        4. soak-9.lustre.log.gz
          2.33 MB
          Cliff White

          Issue Links

            Activity

              People

              • Assignee:
                yong.fan nasf (Inactive)
                Reporter:
                cliffw Cliff White (Inactive)
              • Votes:
                0 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: