Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-9196

MDS server for Atlas file system crashed due to memory exhaustion.

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Minor Minor
    • None
    • Lustre 2.8.0
    • None
    • RHEL6.8 running non patched Lustre 2.8 server using ldiskfs.
    • 3
    • 9223372036854775807

      Our MDS server crashed due to memory exhaustion. Examination of the system logs show nothing out of the ordinary expect it was noticed that an IO scrub did start off on the MDS server:

      Lustre: atlas1-MDT0000-o: trigger OI scrub by RPC for the [0x20003f37b:0x3e5:0x0] with flags 0x4a, rc = 0

      Some time after that we encountered the following crash which is attached.

       

            yong.fan nasf (Inactive)
            simmonsja James A Simmons
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: