Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-5420

Failure on test suite sanity test_17m: mount MDS failed, Input/output error

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.8.0
    • Lustre 2.6.0, Lustre 2.7.0
    • client and server: lustre-b2_6-rc2 RHEL6 ldiskfs DNE mode
    • 3
    • 15076

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/16302020-14ed-11e4-bb6a-5254006e85c2.

      The sub-test test_17m failed with the following error:

      test failed to respond and timed out

      Hit this bug in many tests, the env is configured as 1 MDS with 2 MDTs. Didn't hit this error when the configuration is 2 MDSs with 2 MDTs
      client console:

      CMD: onyx-46vm7 mkdir -p /mnt/mds1
      CMD: onyx-46vm7 test -b /dev/lvm-Role_MDS/P1
      Starting mds1:   /dev/lvm-Role_MDS/P1 /mnt/mds1
      CMD: onyx-46vm7 mkdir -p /mnt/mds1; mount -t lustre   		                   /dev/lvm-Role_MDS/P1 /mnt/mds1
      onyx-46vm7: mount.lustre: mount /dev/mapper/lvm--Role_MDS-P1 at /mnt/mds1 failed: Input/output error
      onyx-46vm7: Is the MGS running?
      Start of /dev/lvm-Role_MDS/P1 on mds1 failed 5
      

      Attachments

        Issue Links

          Activity

            [LU-5420] Failure on test suite sanity test_17m: mount MDS failed, Input/output error
            pjones Peter Jones added a comment -

            Landed for 2.8

            pjones Peter Jones added a comment - Landed for 2.8

            Andreas Dilger (andreas.dilger@intel.com) merged in patch http://review.whamcloud.com/11258/
            Subject: LU-5420 mgc: MGC should retry for invalid import
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 77d406a0699307e8e633ef41f8984f45c09db9b8

            gerrit Gerrit Updater added a comment - Andreas Dilger (andreas.dilger@intel.com) merged in patch http://review.whamcloud.com/11258/ Subject: LU-5420 mgc: MGC should retry for invalid import Project: fs/lustre-release Branch: master Current Patch Set: Commit: 77d406a0699307e8e633ef41f8984f45c09db9b8
            di.wang Di Wang added a comment -

            For now, you can use this http://review.whamcloud.com/13838/ , but that only reverts the patch (http://review.whamcloud.com/#/c/9967/)which cause the problem, and not real fix.

            There are patches trying to fix this problem, but none of them are satisfied by everyone. So leave it to 2.8 for now.
            http://review.whamcloud.com/13693
            http://review.whamcloud.com/11258

            di.wang Di Wang added a comment - For now, you can use this http://review.whamcloud.com/13838/ , but that only reverts the patch ( http://review.whamcloud.com/#/c/9967/)which cause the problem, and not real fix. There are patches trying to fix this problem, but none of them are satisfied by everyone. So leave it to 2.8 for now. http://review.whamcloud.com/13693 http://review.whamcloud.com/11258

            I see many patches for this. Which patches are valid?

            simmonsja James A Simmons added a comment - I see many patches for this. Which patches are valid?

            Yes, my apologies.

            jlevi Jodi Levi (Inactive) added a comment - Yes, my apologies.
            pjones Peter Jones added a comment -

            Actually Jodi the patches for master are still in flight. It is simply a workaround fix that has landed to b2_7

            pjones Peter Jones added a comment - Actually Jodi the patches for master are still in flight. It is simply a workaround fix that has landed to b2_7

            Patches landed to Master.

            jlevi Jodi Levi (Inactive) added a comment - Patches landed to Master.

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/13838/
            Subject: LU-5420 ptlrpc: revert ptlrpc_reconnect_import() changes
            Project: fs/lustre-release
            Branch: b2_7
            Current Patch Set:
            Commit: 02739a078f54b5ccdf49456fd0d1daea90472a8d

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/13838/ Subject: LU-5420 ptlrpc: revert ptlrpc_reconnect_import() changes Project: fs/lustre-release Branch: b2_7 Current Patch Set: Commit: 02739a078f54b5ccdf49456fd0d1daea90472a8d

            Oleg Drokin (oleg.drokin@intel.com) uploaded a new patch: http://review.whamcloud.com/13838
            Subject: LU-5420 revert part of LU-4913
            Project: fs/lustre-release
            Branch: b2_7
            Current Patch Set: 1
            Commit: 77856caa2468dd69cfa5796bceb22c32aacf402f

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) uploaded a new patch: http://review.whamcloud.com/13838 Subject: LU-5420 revert part of LU-4913 Project: fs/lustre-release Branch: b2_7 Current Patch Set: 1 Commit: 77856caa2468dd69cfa5796bceb22c32aacf402f

            Oleg Drokin (oleg.drokin@intel.com) uploaded a new patch: http://review.whamcloud.com/13832
            Subject: LU-5420 revert part of LU-4913
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 260e150f98f07fa68fb124348ca9540e77fed100

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) uploaded a new patch: http://review.whamcloud.com/13832 Subject: LU-5420 revert part of LU-4913 Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 260e150f98f07fa68fb124348ca9540e77fed100

            People

              di.wang Di Wang
              sarah Sarah Liu
              Votes:
              0 Vote for this issue
              Watchers:
              16 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: