Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-9196

MDS server for Atlas file system crashed due to memory exhaustion.

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • None
    • Lustre 2.8.0
    • None
    • RHEL6.8 running non patched Lustre 2.8 server using ldiskfs.
    • 3
    • 9223372036854775807

    Description

      Our MDS server crashed due to memory exhaustion. Examination of the system logs show nothing out of the ordinary expect it was noticed that an IO scrub did start off on the MDS server:

      Lustre: atlas1-MDT0000-o: trigger OI scrub by RPC for the [0x20003f37b:0x3e5:0x0] with flags 0x4a, rc = 0

      Some time after that we encountered the following crash which is attached.

       

      Attachments

        Activity

          People

            yong.fan nasf (Inactive)
            simmonsja James A Simmons
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: