Details
-
Bug
-
Resolution: Fixed
-
Minor
-
None
-
Lustre 1.8.7
-
None
-
Scientific Linux 6.5
-
3
-
16616
Description
Hello Lustre tream!
We're seeing an issue with the ko2iblnd module not being built properly any longer with our lustre 1.8.7 source with the latest 2.6.32-504.x Linux kernel.
We have successfully been building RPM's utilizing the command below for several kernel revisions: 2.6.32_220.x, 2.6.32_279.x, 2.6.32_358.x, and 2.6.32_431.x.
./configure --with-linux=/lib/modules/KERNEL_VERSION/build --disable-lru-resize --enable-ext4 --disable-server
However, with the release of 2.6.32_504.x we are now seeing that the ko2iblnd module isn't built, and as a result we get I/O errors when loading the lustre module:
LustreError: 2897:0:(api-ni.c:1081:lnet_startup_lndnis()) Can't load LND o2ib, module ko2iblnd, rc=256
LustreError: 2897:0:(events.c:725:ptlrpc_init_portals()) network initialisation failed
Looking over Google and other Jira tickets, we found that using "--with-o2ib=yes" produces the error (which is probably just a red herring):
configure: error: can't compile with kernel OpenIB gen2 headers
Given that the other builds were successful, I am fairly certain that this is a kernel issue, but I wanted to double-check here first. I've attached the config.log for perusal.
Here is an RPM package query from the latest build to the most recent twp:
- pwd
/root/rpmbuild/RPMS/x86_64
- ls -1t lustre-modules-1.8.7-2.6.32_*|head -3
lustre-modules-1.8.7-2.6.32_504.1.3.el6.x86_64.x86_64.rpm
lustre-modules-1.8.7-2.6.32_431.29.2.el6.x86_64.x86_64.rpm
lustre-modules-1.8.7-2.6.32_431.23.3.el6.x86_64.x86_64.rpm
- ls -1t lustre-modules-1.8.7-2.6.32_*|head -3|xargs -I'{}' rpm -qlp {} |grep ko2iblnd
/lib/modules/2.6.32-431.29.2.el6.x86_64/updates/kernel/net/lustre/ko2iblnd.ko
/lib/modules/2.6.32-431.23.3.el6.x86_64/updates/kernel/net/lustre/ko2iblnd.ko
Thank you for any guidance and/or additional information.
John DeSantis
Attachments
Issue Links
- is related to
-
LU-5909 Kernel update [RHEL6.6 2.6.32-504.1.3.el6]
-
- Resolved
-
Martin,
I completely agree with striping, not just for I/O but also because recently ran into a case where the 2.0 TB file size limit was reached because the file wasn't striped over all available OST's.
I was able to get the 1.8.7 client RPM's built (using 1.8.7_wc1) with the ko2iblnd module present using your patch, the
LU-1116full patch (https://jira.hpdd.intel.com/secure/attachment/10860/LU-1116-full.patch), and the patch fromLU-2800(http://review.whamcloud.com/#/c/8607/5).I will be doing some testing over the next few days to see if there are any differences in performance with "default" values between the client versions 1.8.7 and 1.8.9.
Again, without your patch we would have been stuck. So I, too, can confirm to the @Developers that your patch has worked.
John DeSantis