[LU-5619] Hard Failover replay-dual test_0b: mount MDS failed - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Cannot Reproduce
Priority: Minor
Fix Version/s: None
Affects Version/s: Lustre 2.7.0, Lustre 2.8.0
Labels:
- zfs
Environment:
server and client: lustre-master build #2642

Severity:
3
Rank (Obsolete):
15719

Description

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/718edf3e-37c4-11e4-a2a6-5254006e85c2.

The sub-test test_0b failed with the following error:

mount1 fais

== replay-dual test 0b: lost client during waiting for next transno == 13:28:17 (1410182897)
CMD: shadow-12vm8 sync; sync; sync
Filesystem           1K-blocks  Used Available Use% Mounted on
shadow-12vm12:shadow-12vm8:/lustre
                      14223104 19712  14189056   1% /mnt/lustre
CMD: shadow-12vm5,shadow-12vm6,shadow-12vm9.shadow.whamcloud.com mcreate /mnt/lustre/fsa-\$(hostname); rm /mnt/lustre/fsa-\$(hostname)
CMD: shadow-12vm5,shadow-12vm6,shadow-12vm9.shadow.whamcloud.com if [ -d /mnt/lustre2 ]; then mcreate /mnt/lustre2/fsa-\$(hostname); rm /mnt/lustre2/fsa-\$(hostname); fi
CMD: shadow-12vm8 /usr/sbin/lctl --device lustre-MDT0000 notransno
CMD: shadow-12vm8 /usr/sbin/lctl --device lustre-MDT0000 readonly
CMD: shadow-12vm8 /usr/sbin/lctl mark mds1 REPLAY BARRIER on lustre-MDT0000
CMD: shadow-12vm8 /usr/sbin/lctl dl
Failing mds1 on shadow-12vm8
+ pm -h powerman --off shadow-12vm8
Command completed successfully
reboot facets: mds1
+ pm -h powerman --on shadow-12vm8
Command completed successfully
Failover mds1 to shadow-12vm12
13:28:33 (1410182913) waiting for shadow-12vm12 network 900 secs ...
13:28:33 (1410182913) network interface is UP
CMD: shadow-12vm12 hostname
mount facets: mds1
CMD: shadow-12vm12 zpool list -H lustre-mdt1 >/dev/null 2>&1 ||
			zpool import -f -o cachefile=none -d /dev/lvm-Role_MDS lustre-mdt1
Starting mds1:   lustre-mdt1/mdt1 /mnt/mds1
CMD: shadow-12vm12 mkdir -p /mnt/mds1; mount -t lustre   		                   lustre-mdt1/mdt1 /mnt/mds1
CMD: shadow-12vm12 PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/utils/gss:/usr/lib64/lustre/utils:/usr/lib64/openmpi/bin:/usr/bin:/bin:/usr/sbin:/sbin::/sbin:/bin:/usr/sbin: NAME=autotest_config sh rpc.sh set_default_debug \"-1\" \"all -lnet -lnd -pinger\" 4 
CMD: shadow-12vm12 zfs get -H -o value lustre:svname 		                           lustre-mdt1/mdt1 2>/dev/null
Started lustre-MDT0000
Starting client: shadow-12vm9.shadow.whamcloud.com:  -o user_xattr,flock shadow-12vm12:shadow-12vm8:/lustre /mnt/lustre
CMD: shadow-12vm9.shadow.whamcloud.com mkdir -p /mnt/lustre
CMD: shadow-12vm9.shadow.whamcloud.com mount -t lustre -o user_xattr,flock shadow-12vm12:shadow-12vm8:/lustre /mnt/lustre
mount.lustre: mount shadow-12vm12:shadow-12vm8:/lustre at /mnt/lustre failed: Input/output error
Is the MGS running?

Attachments

Issue Links

mentioned in: Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...

(3 mentioned in)

Activity

People

Assignee:: WC Triage

Reporter:: Maloo

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 12/Sep/14 10:40 PM

Updated:: 14/Dec/21 10:46 PM

Resolved:: 14/Dec/21 10:46 PM