[LU-9322] parallel-scale test_connectathon: connectathon failed: 1 Created: 11/Apr/17  Updated: 03/Dec/18  Resolved: 03/Dec/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.10.0
Fix Version/s: Lustre 2.10.0

Type: Bug Priority: Minor
Reporter: James Casper Assignee: WC Triage
Resolution: Fixed Votes: 1
Labels: None
Environment:

onyx-44, Full,
RHEL7.3, DNE, ZFS, master branch, v2.10.55, b3550


Issue Links:
Related
is related to LU-9285 revert LU-8367 and LU-8972 Resolved
is related to LU-11728 parallel-scale test connectathon fail... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

https://testing.hpdd.intel.com/test_sessions/4b44547b-cba5-4172-ab2c-b902e4049a92

From test_log:

test telldir cookies
expected file 94 at cookie 0, found .
special tests failed
 parallel-scale test_connectathon: @@@@@@ FAIL: connectathon failed: 1 
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:4893:error()
  = /usr/lib64/lustre/tests/functions.sh:542:run_connectathon()
  = /usr/lib64/lustre/tests/parallel-scale.sh:100:test_connectathon()
  = /usr/lib64/lustre/tests/test-framework.sh:5169:run_one()
  = /usr/lib64/lustre/tests/test-framework.sh:5208:run_one_logged()
  = /usr/lib64/lustre/tests/test-framework.sh:5055:run_test()
  = /usr/lib64/lustre/tests/parallel-scale.sh:102:main()

From OST console (onyx-44vm8):

LustreError: 784:0:(ofd_dev.c:1688:ofd_create_hdl()) lustre-OST0005: invalid precreate request for 0x0:302369, last_id 302881. Likely MDS last_id corruption
LustreError: 784:0:(ofd_dev.c:1688:ofd_create_hdl()) Skipped 120 previous similar messages

From MDS console (onyx-44vm7):

LustreError: 28135:0:(osp_precreate.c:917:osp_precreate_cleanup_orphans()) lustre-OST0005-osc-MDT0000: cannot cleanup orphans: rc = -22
LustreError: 28135:0:(osp_precreate.c:917:osp_precreate_cleanup_orphans()) Skipped 120 previous similar messages

May be related to LU-8806. It also has a "cannot cleanup orphans" message, but with a different return code.



 Comments   
Comment by John Hammond [ 12/Apr/17 ]

Alex, would you suggest one of your changes https://review.whamcloud.com/#/c/25925/ or https://review.whamcloud.com/#/c/25926 here?

Comment by Alex Zhuravlev [ 12/Apr/17 ]

both, actually. the fix depends on the reverts.

Comment by James Nunez (Inactive) [ 03/Dec/18 ]

Resolving issue because the patches were reverted and we haven't seen this issue for at least the past six months.

Generated at Sat Feb 10 02:25:09 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.