Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4264

Excessive slab usage on 1.8.9 server

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • None
    • Lustre 1.8.9
    • None
    • 1
    • 11714

    Description

      NOAA has been having a problem with OOM on their OSSes causing failover. Looking at collectl output from right before the crash, it appears that all the memory is being consumed by the size-256 slab:
      size-256 168M 41122M 168M 41122M 11229K 43864M 11229K 43864M 200704 0.0

      Is there a way to determine what those objects are and reduce the amount of memory they are taking? The vmcore is available if necessary.

      Attachments

        1. collectl.out
          1.15 MB
        2. DE11223-fix-mlx4-leak.patch
          1 kB
        3. dk1.gz
          8.29 MB
        4. main.c-308.11.1.el5
          27 kB
        5. main.c-348.1.1.el5
          42 kB
        6. vmcore.log
          496 kB

        Activity

          People

            green Oleg Drokin
            orentas Oz Rentas
            Votes:
            1 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: