Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-14297

Can't compile lustre client against MLNX OFED-5.2-1.0.4 on Centos 7.8

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • Lustre 2.12.7
    • Lustre 2.12.5
    • None
    • Dell and Lenovo hardware. MLNX OFED-5.2-1.0.4. Lustre 2.12.5. OS is Centos 7.8. Kernel is 3.10.0-1127.19.1.el7.x86_64

    Description

      Hello, I am trying to install lustre on our lnet routers which have connectx-5 cards installed in them using dkms on Centos 7.8 with kernel 3.10.0-1127.19.1.el7.x86_64. Also Mellanox just released their latest driver version OFED-5.2-1.0.4 yesterday Jan 4, 2021. When dkms tries to compile lustre, it fails with the following at end:

      configure: LNet kernel checks
      ==============================================================================
      checking whether to enable CPU affinity support... yes
      checking if Linux kernel has cpu affinity support... yes
      checking whether to enable tunable backoff TCP support... yes
      checking if Linux kernel has tunable backoff TCP support... no
      checking whether to use Compat RDMA... /bin/ofed_info
      no
      configure: error: no OFED nor kernel OpenIB gen2 headers present
      configure error, check /var/lib/dkms/lustre-client/2.12.5/build/config.log

      Building module:
      cleaning build area...(bad exit status: 2)
      make -j8 KERNELRELEASE=3.10.0-1127.19.1.el7.x86_64...(bad exit status: 2)
      Error! Bad return status for module build on kernel: 3.10.0-1127.19.1.el7.x86_64 (x86_64)
      Consult /var/lib/dkms/lustre-client/2.12.5/build/make.log for more information.

      Also, I did verify that the MLNX rpms that are supposed to be installed, are installed.
      On the machine I am trying to install on, I did check and ibstat states that both the cards have an active LinkUP:

      [root@lnet08 ~]# ibstat
      CA 'mlx5_0'
      CA type: MT4119
      Number of ports: 1
      Firmware version: 16.26.1040
      Hardware version: 0
      Node GUID: 0xb8599f03002f8318
      System image GUID: 0xb8599f03002f8318
      Port 1:
      State: Active
      Physical state: LinkUp
      Rate: 100
      Base lid: 1522
      LMC: 0
      SM lid: 1434
      Capability mask: 0x2651e848
      Port GUID: 0xb8599f03002f8318
      Link layer: InfiniBand
      CA 'mlx5_1'
      CA type: MT4119
      Number of ports: 1
      Firmware version: 16.26.1040
      Hardware version: 0
      Node GUID: 0xb8599f03002f8319
      System image GUID: 0xb8599f03002f8318
      Port 1:
      State: Active
      Physical state: LinkUp
      Rate: 56
      Base lid: 2260
      LMC: 0
      SM lid: 158
      Capability mask: 0x2651e848
      Port GUID: 0xb8599f03002f8319
      Link layer: InfiniBand

      Any ideas how to get this to work ?

      Thanks,
      Mike

      Attachments

        1. autogen.sh
          0.3 kB
        2. config.log
          208 kB
        3. lustre-version.m4
          1 kB

        Issue Links

          Activity

            People

              yujian Jian Yu
              mre64 Michael Ethier (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: