Details
-
Bug
-
Resolution: Fixed
-
Major
-
Lustre 2.12.5
-
None
-
Dell and Lenovo hardware. MLNX OFED-5.2-1.0.4. Lustre 2.12.5. OS is Centos 7.8. Kernel is 3.10.0-1127.19.1.el7.x86_64
-
3
-
9223372036854775807
Description
Hello, I am trying to install lustre on our lnet routers which have connectx-5 cards installed in them using dkms on Centos 7.8 with kernel 3.10.0-1127.19.1.el7.x86_64. Also Mellanox just released their latest driver version OFED-5.2-1.0.4 yesterday Jan 4, 2021. When dkms tries to compile lustre, it fails with the following at end:
configure: LNet kernel checks
==============================================================================
checking whether to enable CPU affinity support... yes
checking if Linux kernel has cpu affinity support... yes
checking whether to enable tunable backoff TCP support... yes
checking if Linux kernel has tunable backoff TCP support... no
checking whether to use Compat RDMA... /bin/ofed_info
no
configure: error: no OFED nor kernel OpenIB gen2 headers present
configure error, check /var/lib/dkms/lustre-client/2.12.5/build/config.log
Building module:
cleaning build area...(bad exit status: 2)
make -j8 KERNELRELEASE=3.10.0-1127.19.1.el7.x86_64...(bad exit status: 2)
Error! Bad return status for module build on kernel: 3.10.0-1127.19.1.el7.x86_64 (x86_64)
Consult /var/lib/dkms/lustre-client/2.12.5/build/make.log for more information.
Also, I did verify that the MLNX rpms that are supposed to be installed, are installed.
On the machine I am trying to install on, I did check and ibstat states that both the cards have an active LinkUP:
[root@lnet08 ~]# ibstat
CA 'mlx5_0'
CA type: MT4119
Number of ports: 1
Firmware version: 16.26.1040
Hardware version: 0
Node GUID: 0xb8599f03002f8318
System image GUID: 0xb8599f03002f8318
Port 1:
State: Active
Physical state: LinkUp
Rate: 100
Base lid: 1522
LMC: 0
SM lid: 1434
Capability mask: 0x2651e848
Port GUID: 0xb8599f03002f8318
Link layer: InfiniBand
CA 'mlx5_1'
CA type: MT4119
Number of ports: 1
Firmware version: 16.26.1040
Hardware version: 0
Node GUID: 0xb8599f03002f8319
System image GUID: 0xb8599f03002f8318
Port 1:
State: Active
Physical state: LinkUp
Rate: 56
Base lid: 2260
LMC: 0
SM lid: 158
Capability mask: 0x2651e848
Port GUID: 0xb8599f03002f8319
Link layer: InfiniBand
Any ideas how to get this to work ?
Thanks,
Mike