Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1275

Lustre 2.1.1 REPLAY_SINGLE test_0a FAIL: Restart of mds failed

    XMLWordPrintable

Details

    • Bug
    • Resolution: Won't Fix
    • Minor
    • None
    • Lustre 2.1.1, Lustre 1.8.6
    • None
    • Server runs centos 6.2, ofed-1.5.4.1, Lustre 2.1.1.
      Client runs sles11sp1, ofed-1.5.4.1, Lustre 1.8.6.
      MGS/MDS uses the same device. Two OSS'es. Two clients.
    • 3
    • 6096

    Description

      My acc-sm set-ups has been used in testing 1.8.5, 1.8.6, and 1.8.7 successfully.
      This is the first time I ran acc-sm against 2.1.1.
      The SANITY and SANITYN passed, but all tests in REPLAY_SINGLE failed since
      "@@@@@@ FAIL: Restart of mds failed".

      == test 0a: empty replay == 12:05:12
      Filesystem 1K-blocks Used Available Use% Mounted on
      service360@o2ib:/lustre
      3937056 205112 3531816 6% /mnt/nbp0-1
      Failing mds on node service360
      Stopping /mnt/mds (opts
      affected facets: mds
      df pid is 13509
      Failover mds to service360
      12:05:26 (1333134326) waiting for service360 network 900 secs ...
      12:05:26 (1333134326) network interface is UP
      Starting mds: -o errors=panic,acl /dev/sdb1 /mnt/mds
      service360: mount.lustre: mount /dev/sdb1 at /mnt/mds failed: Invalid argument
      service360: This may have multiple causes.
      service360: Are the mount options correct?
      service360: Check the syslog for more info.
      mount -t lustre /dev/sdb1 /mnt/mds
      Start of /dev/sdb1 on mds failed 22
      replay-single test_0a: @@@@@@ FAIL: Restart of mds failed!

      The /var/log/message of the MGS/MDS node showed:
      ...
      Mar 30 12:05:10 service360 kernel: Lustre: MGC10.151.26.38@o2ib: Reactivating import
      Mar 30 12:05:10 service360 kernel: LustreError: 11254:0:(llog_lvfs.c:473:llog_lvfs_next_block()) Invalid llog tail at log id 17/2375643311 offset 14432
      Mar 30 12:05:10 service360 kernel: LustreError: 11254:0:(mgs_handler.c:783:mgs_handle()) MGS handle cmd=502 rc=-22
      ...
      The replay-single.test_0a.debug_log.service360.log.[12] are attached.

      Attachments

        Activity

          People

            mdiep Minh Diep
            jaylan Jay Lan (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: