Details
-
Bug
-
Resolution: Won't Fix
-
Minor
-
None
-
Lustre 2.1.1, Lustre 1.8.6
-
None
-
Server runs centos 6.2, ofed-1.5.4.1, Lustre 2.1.1.
Client runs sles11sp1, ofed-1.5.4.1, Lustre 1.8.6.
MGS/MDS uses the same device. Two OSS'es. Two clients.
-
3
-
6096
Description
My acc-sm set-ups has been used in testing 1.8.5, 1.8.6, and 1.8.7 successfully.
This is the first time I ran acc-sm against 2.1.1.
The SANITY and SANITYN passed, but all tests in REPLAY_SINGLE failed since
"@@@@@@ FAIL: Restart of mds failed".
== test 0a: empty replay == 12:05:12
Filesystem 1K-blocks Used Available Use% Mounted on
service360@o2ib:/lustre
3937056 205112 3531816 6% /mnt/nbp0-1
Failing mds on node service360
Stopping /mnt/mds (opts![]()
affected facets: mds
df pid is 13509
Failover mds to service360
12:05:26 (1333134326) waiting for service360 network 900 secs ...
12:05:26 (1333134326) network interface is UP
Starting mds: -o errors=panic,acl /dev/sdb1 /mnt/mds
service360: mount.lustre: mount /dev/sdb1 at /mnt/mds failed: Invalid argument
service360: This may have multiple causes.
service360: Are the mount options correct?
service360: Check the syslog for more info.
mount -t lustre /dev/sdb1 /mnt/mds
Start of /dev/sdb1 on mds failed 22
replay-single test_0a: @@@@@@ FAIL: Restart of mds failed!
The /var/log/message of the MGS/MDS node showed:
...
Mar 30 12:05:10 service360 kernel: Lustre: MGC10.151.26.38@o2ib: Reactivating import
Mar 30 12:05:10 service360 kernel: LustreError: 11254:0:(llog_lvfs.c:473:llog_lvfs_next_block()) Invalid llog tail at log id 17/2375643311 offset 14432
Mar 30 12:05:10 service360 kernel: LustreError: 11254:0:(mgs_handler.c:783:mgs_handle()) MGS handle cmd=502 rc=-22
...
The replay-single.test_0a.debug_log.service360.log.[12] are attached.