[LU-5901] replay-dual test_15a: import is not in FULL state Created: 11/Nov/14  Updated: 01/Dec/14  Resolved: 01/Dec/14

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.7.0, Lustre 2.5.4
Fix Version/s: None

Type: Bug Priority: Blocker
Reporter: Jian Yu Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

Lustre Build: https://build.hpdd.intel.com/job/lustre-b2_5/100/
Distro/Arch: RHEL6.5/x86_64
FSTYPE=zfs


Issue Links:
Related
is related to LU-5079 conf-sanity test_47 timeout Resolved
Severity: 3
Rank (Obsolete): 16487

 Description   

replay-dual test 15a failed as follows:

CMD: shadow-2vm10.shadow.whamcloud.com,shadow-2vm9 PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/utils/gss:/usr/lib64/lustre/utils:/usr/lib64/openmpi/bin:/usr/bin:/bin:/usr/sbin:/sbin::/sbin:/bin:/usr/sbin: NAME=autotest_config sh rpc.sh wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid 
shadow-2vm9: CMD: shadow-2vm9.shadow.whamcloud.com lctl get_param -n at_max
shadow-2vm10: CMD: shadow-2vm10.shadow.whamcloud.com lctl get_param -n at_max
shadow-2vm9:  rpc : @@@@@@ FAIL: can't put import for mdc.lustre-MDT0000-mdc-*.mds_server_uuid into FULL state after 662 sec, have REPLAY_WAIT 
shadow-2vm10:  rpc : @@@@@@ FAIL: can't put import for mdc.lustre-MDT0000-mdc-*.mds_server_uuid into FULL state after 662 sec, have REPLAY 

Maloo reports:
https://testing.hpdd.intel.com/test_sets/0cbb451e-6908-11e4-963a-5254006e85c2
https://testing.hpdd.intel.com/test_sets/46a9d6a6-690c-11e4-9444-5254006e85c2



 Comments   
Comment by Jian Yu [ 11/Nov/14 ]

This is a regression failure introduced by Lustre b2_5 build #100.

Here is a for-test-only patch trying to reproduce the failure on Lustre b2_5 build #100: http://review.whamcloud.com/12668

Comment by Jian Yu [ 12/Nov/14 ]

The same regression failure also occurred on master branch:
https://testing.hpdd.intel.com/test_sets/aac207ae-6429-11e4-b689-5254006e85c2
https://testing.hpdd.intel.com/test_sets/474f9bc0-625b-11e4-bd8b-5254006e85c2
https://testing.hpdd.intel.com/test_sets/f0fd57f6-5f5f-11e4-a865-5254006e85c2

Comment by Jian Yu [ 12/Nov/14 ]

It was the patches http://review.whamcloud.com/11213 (master) and http://review.whamcloud.com/12365 (b2_5) for LU-5079 that caused the regressions.

Comment by Peter Jones [ 01/Dec/14 ]

As per Yu Jian this can be closed as a duplicate of LU-5079

Generated at Sat Feb 10 01:55:30 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.