Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4264

Excessive slab usage on 1.8.9 server

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • None
    • Lustre 1.8.9
    • None
    • 1
    • 11714

    Description

      NOAA has been having a problem with OOM on their OSSes causing failover. Looking at collectl output from right before the crash, it appears that all the memory is being consumed by the size-256 slab:
      size-256 168M 41122M 168M 41122M 11229K 43864M 11229K 43864M 200704 0.0

      Is there a way to determine what those objects are and reduce the amount of memory they are taking? The vmcore is available if necessary.

      Attachments

        1. collectl.out
          1.15 MB
        2. vmcore.log
          496 kB
        3. dk1.gz
          8.29 MB
        4. main.c-308.11.1.el5
          27 kB
        5. main.c-348.1.1.el5
          42 kB
        6. DE11223-fix-mlx4-leak.patch
          1 kB

        Activity

          People

            green Oleg Drokin
            orentas Oz Rentas (Inactive)
            Votes:
            1 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: