[LU-10523] replay-dual test_10: Restart of mds1 failed! Created: 16/Jan/18  Updated: 09/Aug/18

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.10.3, Lustre 2.10.4
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None
Environment:

Failover,
Server and clients : 2.10.RC1, b2_10 build 68


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

replay-dual test_10 - Restart of mds1 failed!
^^^^^^^^^^^^^ DO NOT REMOVE LINE ABOVE ^^^^^^^^^^^^^

This issue was created by maloo for Saurabh Tandan <saurabh.tandan@intel.com>

This issue relates to the following test suite run:
https://testing.hpdd.intel.com/test_sets/b07e3b80-f65d-11e7-94c7-52540065bddc

test_10 failed with the following error:

Restart of mds1 failed!

test_logs

CMD: onyx-41vm7 /usr/sbin/lctl --device lustre-MDT0000 readonly
CMD: onyx-41vm7 /usr/sbin/lctl mark mds1 REPLAY BARRIER on lustre-MDT0000
CMD: onyx-41vm7 lctl set_param fail_loc=0x80000119
fail_loc=0x80000119
CMD: onyx-41vm7 /usr/sbin/lctl dl
Failing mds1 on onyx-41vm7
+ pm -h powerman --off onyx-41vm7
Command completed successfully
reboot facets: mds1
+ pm -h powerman --on onyx-41vm7
Command completed successfully
Failover mds1 to onyx-41vm8
15:15:44 (1515597344) waiting for onyx-41vm8 network 900 secs ...
15:15:44 (1515597344) network interface is UP
CMD: onyx-41vm8 hostname
mount facets: mds1
CMD: onyx-41vm8 test -b /dev/lvm-Role_MDS/P1
CMD: onyx-41vm8 e2label /dev/lvm-Role_MDS/P1
onyx-41vm8: e2label: No such file or directory while trying to open /dev/lvm-Role_MDS/P1
onyx-41vm8: Couldn't find valid filesystem superblock.
Starting mds1:   -o loop /dev/lvm-Role_MDS/P1 /mnt/lustre-mds1
CMD: onyx-41vm8 mkdir -p /mnt/lustre-mds1; mount -t lustre   -o loop 		                   /dev/lvm-Role_MDS/P1 /mnt/lustre-mds1
onyx-41vm8: mount: /dev/lvm-Role_MDS/P1: failed to setup loop device: No such file or directory
Start of /dev/lvm-Role_MDS/P1 on mds1 failed 32
 replay-dual test_10: @@@@@@ FAIL: Restart of mds1 failed! 


 Comments   
Comment by Sarah Liu [ 03/May/18 ]

I think this is a dup of LU-9707

Comment by Sarah Liu [ 17/May/18 ]

+1 on b2_10 https://testing.hpdd.intel.com/test_sets/b840cfca-58e8-11e8-b303-52540065bddc

Generated at Sat Feb 10 02:35:51 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.