[LU-14025] Need lustre client for SLES15 SP2 and Mellanox OFED 5.1 Created: 13/Oct/20 Updated: 23/Jan/24 Resolved: 23/Jan/24 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.12.5 |
| Fix Version/s: | Lustre 2.12.5 |
| Type: | Bug | Priority: | Critical |
| Reporter: | Jay Lan (Inactive) | Assignee: | Jian Yu |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | build | ||
| Environment: |
Lustre client will run in SLES15 SP2 system with Mellanox OFED 5.1 |
||
| Attachments: |
|
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
We are getting new systems that requires SLES15 SP2 and Mellanox OFED. Uploaded is a log-rpms file that showed compilation failure of lustre client 2.12.5 with SLES15 SP2 kernel and Mellanox OFED 5.1. The only mofed versions that support SLES15 SP2 are version 5.x. The failure looked not trivia. Probably due to some rules changes in newer version of gcc, make, rpmbuilds in SLES15 SP2. I did not see this type of failures in compiling mofed, and other external kernel modules that I built. This only happened to Lustre build. |
| Comments |
| Comment by Peter Jones [ 13/Oct/20 ] |
|
Jay Severity 1 is reserved for site down issues. It seems like this was not the intention of this ticket but please confirm Peter |
| Comment by Jay Lan (Inactive) [ 13/Oct/20 ] |
|
No, not site down. We would need it when we have new system installed. Please adjust the severity and priority as you think appropriate. BTW, since I build lustre rpms myself, I only need to have a working (for 2.12.5) patch 'submitted'. You handle release at your pace. |
| Comment by Peter Jones [ 14/Oct/20 ] |
|
Jay Are you able to build master clients ok? If I understand correctly, SLES15 SP2 requires a 5.3 kernel which I believe has been tested successfully with Ubuntu 18.04 HWE Peter |
| Comment by Jay Lan (Inactive) [ 14/Oct/20 ] |
|
I tried to compile master in sles15sp2 environment. It also failed. There were WARNING before the errors. log-rpms.master-sles15sp2-mofed512 is attached. log-rpms.master-sles15sp2-mofed512 |
| Comment by Peter Jones [ 14/Oct/20 ] |
|
Jian Could you please investigate? Thanks Peter |
| Comment by Jian Yu [ 16/Oct/20 ] |
|
Hi Jay,
The warnings did not cause the build to fail. Have you tried to install the following packages and proceed with the build? error: Failed build dependencies:
binutils-devel is needed by lustre-client-2.13.56-1.x86_64
openmpi2-devel is needed by lustre-client-2.13.56-1.x86_64
FYI, with SLES15 SP2 client support patch https://review.whamcloud.com/40265 applied to master branch, I can successfully build it with MLNX_OFED 5.1-2.3.7.1: # uname -r 5.3.18-24.24-default # rpm -qf /usr/src/ofa_kernel/default/ mlnx-ofa_kernel-devel-5.1-OFED.5.1.2.3.7.1.sles15sp2.x86_64 # cd lustre-release/ # sh ./autogen.sh # ./configure --disable-server --without-zfs --with-linux=/usr/src/linux-5.3.18-24.24 --with-linux-obj=/usr/src/linux-5.3.18-24.24-obj/x86_64/default --with-o2ib=/usr/src/ofa_kernel/default/ # make rpms <~snip~> Wrote: /tmp/rpmbuild-lustre-root-Pz1U9Cf0/RPMS/x86_64/lustre-client-2.13.56_23_g9e2f8a4-1.x86_64.rpm Wrote: /tmp/rpmbuild-lustre-root-Pz1U9Cf0/RPMS/x86_64/lustre-client-kmp-default-2.13.56_23_g9e2f8a4_k5.3.18_24.24-1.x86_64.rpm Wrote: /tmp/rpmbuild-lustre-root-Pz1U9Cf0/RPMS/x86_64/lustre-client-tests-2.13.56_23_g9e2f8a4-1.x86_64.rpm <~snip~> I will try Lustre b2_12 branch and figure out what patches are needed. |
| Comment by Jian Yu [ 16/Oct/20 ] |
|
Patches for |
| Comment by Jian Yu [ 23/Oct/20 ] |
|
Hi Jay, |
| Comment by Jay Lan (Inactive) [ 27/Oct/20 ] |
|
Hi Jian, I picked up these patches: Some errors: The log-rpms.20201027 would be uploaded. |
| Comment by Jay Lan (Inactive) [ 27/Oct/20 ] |
|
In sles15sp2 (linux 5.3), gcc version is 7.x. The '-Wno-stringop-truncation' problem seems to require gcc 8. |
| Comment by Jian Yu [ 27/Oct/20 ] |
|
Hi Jay, |
| Comment by Jay Lan (Inactive) [ 27/Oct/20 ] |
|
Hi Jian, Does any of patches you listed require gcc8 or kernel5.4? sles15sp2 is running kernel5.3 and gcc7. Thanks, |
| Comment by Jian Yu [ 27/Oct/20 ] |
|
No, Jay. # uname -r 5.3.18-24.24-default # gcc --version | head -1 gcc (SUSE Linux) 7.5.0 The commit messages for the above four |
| Comment by Jay Lan (Inactive) [ 29/Oct/20 ] |
|
Hi Jian, I needed to create a temporary workaround to address a synopsis change of rdma_reject() in mofed-5.1. The affected code is in lnet/klnds/o2iblnd/o2iblnd_cb.c. Otherwise, all worked well! I had rpms created. Thank you for your help! |
| Comment by Jian Yu [ 29/Oct/20 ] |
|
You're welcome, Jay. |
| Comment by Jay Lan (Inactive) [ 30/Oct/20 ] |
|
Ah, no, I did not have that patch in our nas-2.12.5 branch. Thank you. |