Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8043

MDS running lustre 2.5.5+ OOM when running with Lustre 2.8 GA clients

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Critical
    • Lustre 2.5.5
    • Lustre 2.5.5
    • None
    • Cray clients running unpatched lustre 2.8 GA clients. Server side running Lustre 2.5.5 with a patch set in a RHEL6.7 environment.
    • 3
    • 9223372036854775807

    Description

      Today we performed a test shot on our smaller Cray Aries cluster (700 nodes) with a non-patched lustre 2.8 GA client specially build for this system. The test were run against our atlas file system which is running a RHEL6.7 distro with the lustre version 2.5.5 with patches. During our test shot while running an IOR single shared file test across all nodes with the stripe count of 1008 the MDS server ran out of memory. I attached the dmesg output to this ticket.

      Attachments

        1. mylog.dk.gz
          4.50 MB
        2. vmcore-dmesg.txt
          454 kB

        Issue Links

          Activity

            [LU-8043] MDS running lustre 2.5.5+ OOM when running with Lustre 2.8 GA clients
            pjones Peter Jones made changes -
            Link Original: This issue is related to JFC-17 [ JFC-17 ]
            pjones Peter Jones made changes -
            Link Original: This issue is related to LDEV-367 [ LDEV-367 ]
            pjones Peter Jones made changes -
            Link Original: This issue is related to LDEV-341 [ LDEV-341 ]
            yujian Jian Yu made changes -
            Resolution New: Cannot Reproduce [ 5 ]
            Status Original: Open [ 1 ] New: Resolved [ 5 ]
            pjones Peter Jones made changes -
            Link Original: This issue is related to LDEV-142 [ LDEV-142 ]
            pjones Peter Jones made changes -
            Link New: This issue is related to LDEV-367 [ LDEV-367 ]
            cliffw Cliff White (Inactive) made changes -
            Remote Link New: This issue links to "Page (HPDD Community Wiki)" [ 17186 ]
            jhammond John Hammond made changes -
            Link New: This issue is related to LU-7535 [ LU-7535 ]
            pjones Peter Jones made changes -
            Link Original: This issue is related to JFC-10 [ JFC-10 ]
            pjones Peter Jones made changes -
            Link New: This issue is related to LDEV-142 [ LDEV-142 ]

            People

              bzzz Alex Zhuravlev
              simmonsja James A Simmons
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: