Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-16389

Lustre 2.12.9 ksocklnd crash with 100+GB ethernet

    XMLWordPrintable

Details

    • Bug
    • Resolution: Won't Fix
    • Critical
    • None
    • Lustre 2.12.9
    • RHEL8 running 2.12.9 which is nearly vanilla. This is a 200GiB ethernet setup.
    • 3
    • 9223372036854775807

    Description

      Using a nearly plain vanilla 2.12.9 Lustre version we see on our production 100GiB system the following crashes from time to time

      kernel:LNetError: 6003:0:(socklnd_cb.c:1985:ksocknal_connect()) ASSERTION( (wanted & (1 << 3)) != 0 ) failed:
       kernel:LNetError: 6003:0:(socklnd_cb.c:1985:ksocknal_connect()) LBUG
      kernel:Kernel panic - not syncing: LBUG in interrupt.

      Attachments

        Activity

          People

            pjones Peter Jones
            simmonsja James A Simmons
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: