[LU-10230] sanity test_239: 4336 not synced Created: 10/Nov/17 Updated: 25/Nov/19 Resolved: 19/Feb/19 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.11.0, Lustre 2.10.4 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | James Casper | Assignee: | James Nunez (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
onyx, interop |
||
| Issue Links: |
|
||||||||||||
| Severity: | 3 | ||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||
| Description |
|
session: https://testing.hpdd.intel.com/test_sessions/86f20854-dbc9-4ccd-8316-e84444480a63 Note: This looks like From test_log: CMD: onyx-41vm7 /usr/sbin/lctl get_param -n version 2>/dev/null || /usr/sbin/lctl lustre_build_version 2>/dev/null || /usr/sbin/lctl --version 2>/dev/null | cut -d' ' -f2 CMD: onyx-41vm7 lctl get_param -n osp.*MDT*.sync_changes osp.*MDT*.sync_in_flight sanity test_239: @@@@@@ FAIL: 4336 not synced Trace dump: = /usr/lib64/lustre/tests/test-framework.sh:5289:error() = /usr/lib64/lustre/tests/sanity.sh:14057:test_239() = /usr/lib64/lustre/tests/test-framework.sh:5565:run_one() = /usr/lib64/lustre/tests/test-framework.sh:5604:run_one_logged() = /usr/lib64/lustre/tests/test-framework.sh:5451:run_test() = /usr/lib64/lustre/tests/sanity.sh:14059:main() |
| Comments |
| Comment by Minh Diep [ 31/Jan/18 ] |
|
+1 on b2_10 |
| Comment by Saurabh Tandan (Inactive) [ 08/May/18 ] |
|
+1 on b2_10 for 2.10.3_132 |
| Comment by James Nunez (Inactive) [ 25/Oct/18 ] |
|
Alex, On the b2_10 branch, we landed a patch (commit 236f73509cdcc83cd) for
+ [ $(lustre_version_code $SINGLEMDS) -gt $(version_code 2.10.1) ] &&
+ do_nodes $list "lctl set_param -n osp.*.force_sync=1"
For 2.10.2 and later servers, we would set force_sync=1 during testing. On the master branch, we landed a patch (commit 0ba690a526be74c4cdffe7a7) for
+ [ $(lustre_version_code $SINGLEMDS) -gt $(version_code 2.10.53) ] &&
+ do_nodes $list "lctl set_param -n osp.*.force_sync=1"
In this case, for servers with version 2.10.54 and later, we would set force_sync=1 during testing. When we do interop testing with a 2.10.5 server and a master clients, the server version is 2.10.5 and the server version check in test 239 fails and, thus, force_sync is not set. Is this the correct behavior? Note: sanity test 239 was renamed to 239A in the master branch. |
| Comment by James Nunez (Inactive) [ 12/Feb/19 ] |
|
Here are some recent interop failures of sanity test 239A: |
| Comment by James Nunez (Inactive) [ 13/Feb/19 ] |
|
I've modified the patch for |
| Comment by James Nunez (Inactive) [ 19/Feb/19 ] |
|
Patch to modify version check landed to 2.13. If this issue persists, we can reopen this ticket. |