We haven’t see conf-sanity test 55 fail with
for over a year. I’ve gone back to January of 2017 and don’t see this error message for this test.
What we do see frequently is the error
which is not the same issue. We see this error message only during interop testing; when a client with version 2.9.56 (actually 2.9.55.36) or earlier runs against a server with version 2.9.57 (actually ~2.9.55.38) or later. For example, we see this failure with the following Lustre client/server conbinations:
2.9.0 clients and 2.11.50.52 servers
2.9.0 clients and 2.10.52.75 servers
2.5.5-RC2 clients and 2.10.52.97 servers
2.7.3 (2_7_fe) clients and 2.9.55.41 servers
2.9.0 clients and 2.9.56.11 servers
The issue with interop testing is that the patch for LU-4017, commit 91fbc94f3eabe9a, changed the following in conf-sanity test 55
echo checking size of lov_objid for ost index $i
- LOV_OBJID_SIZE=$(do_facet mds1 "$DEBUGFS -R 'stat lov_objid' $mdsdev 2>/dev/null" | grep ^User | awk '\{print $6}')
+ LOV_OBJID_SIZE=$(do_facet mds1 "$DEBUGFS -R 'stat lov_objid' $mdsdev 2>/dev/null" |
+ grep ^User | awk -F 'Size: ' '\{print $2}')
if [ "$LOV_OBJID_SIZE" != $(lov_objid_size $i) ]; then
error "lov_objid size has to be $(lov_objid_size $i), not $LOV_OBJID_SIZE"
else
Looking at a master, 2.11.50, MDS, on a running system, we see
Using the “old”, pre 2.9.56 grep/awk commands printing $6, we get
which explains the output we see with interop testing.
Thus, if we want to "fix" this issue, we would need to change what parameter is printed based on the server version number for all client from 2.9.0 and before which seems unlikely.
+1 on 2.10.3 https://testing.hpdd.intel.com/test_sets/61ca2062-5067-11e8-abc3-52540065bddc