Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • Lustre 2.7.0, Lustre 2.8.0
    • client and server: lustre-master build # 2856
      server is zfs
    • 3
    • 17498

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      Please provide additional information about the failure here.

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/abe0dac2-b2d8-11e4-b42d-5254006e85c2.

      Cannot find any log except this

      CMD: onyx-45vm4 mkdir -p /mnt/ost1; mount -t lustre   		                   lustre-ost1/ost1 /mnt/ost1
      onyx-45vm4: mount.lustre: mount lustre-ost1/ost1 at /mnt/ost1 failed: Cannot send after transport endpoint shutdown
      

      Attachments

        Issue Links

          Activity

            [LU-6248] sanity-scrub: Failed mount OST

            Closing as a duplicate of LU-7193.

            jamesanunez James Nunez (Inactive) added a comment - Closing as a duplicate of LU-7193 .
            jamesanunez James Nunez (Inactive) added a comment - - edited

            Note that the instances that Saurab listed above are all using ldiskfs, not zfs, and, as noted, they are all interop issues. Should we open a new ticket for the ldiskfs/interop failures?

            jamesanunez James Nunez (Inactive) added a comment - - edited Note that the instances that Saurab listed above are all using ldiskfs, not zfs, and, as noted, they are all interop issues. Should we open a new ticket for the ldiskfs/interop failures?

            Another instance found for interop : EL6.7 Server/2.5.5 Client
            Server: master, build# 3303, RHEL 6.7
            Client: 2.5.5, b2_5_fe/62
            https://testing.hpdd.intel.com/test_sets/2af59388-bad6-11e5-9137-5254006e85c2

            standan Saurabh Tandan (Inactive) added a comment - - edited Another instance found for interop : EL6.7 Server/2.5.5 Client Server: master, build# 3303, RHEL 6.7 Client: 2.5.5, b2_5_fe/62 https://testing.hpdd.intel.com/test_sets/2af59388-bad6-11e5-9137-5254006e85c2

            Another instance found for interop : EL6.7 Server/2.7.1 Client
            Server: master, build# 3303, RHEL 6.7
            Client: 2.7.1, b2_7_fe/34
            https://testing.hpdd.intel.com/test_sets/3a41f816-bad8-11e5-87b4-5254006e85c2

            standan Saurabh Tandan (Inactive) added a comment - - edited Another instance found for interop : EL6.7 Server/2.7.1 Client Server: master, build# 3303, RHEL 6.7 Client: 2.7.1, b2_7_fe/34 https://testing.hpdd.intel.com/test_sets/3a41f816-bad8-11e5-87b4-5254006e85c2

            Another instance for interop: EL7 Server/2.7.1 Client
            Server: master, build# 3303, RHEL 7
            Client: 2.7.1, b2_7_fe/34
            https://testing.hpdd.intel.com/test_sets/606aa416-bac4-11e5-9137-5254006e85c2

            standan Saurabh Tandan (Inactive) added a comment - Another instance for interop: EL7 Server/2.7.1 Client Server: master, build# 3303, RHEL 7 Client: 2.7.1, b2_7_fe/34 https://testing.hpdd.intel.com/test_sets/606aa416-bac4-11e5-9137-5254006e85c2

            Generally, we can check the fstype some early, but even if we can skip the sanity-scrub for zfs-based backend in time, we will hit the "Failed mount OST" in some subsequent test. Because the failure is not caused by OI scrub via same fail_loc injection (in sanity-scrub.sh) or OI scrub wrong repairing, instead, there should have been some trouble before the OI scrub.

            yong.fan nasf (Inactive) added a comment - Generally, we can check the fstype some early, but even if we can skip the sanity-scrub for zfs-based backend in time, we will hit the "Failed mount OST" in some subsequent test. Because the failure is not caused by OI scrub via same fail_loc injection (in sanity-scrub.sh) or OI scrub wrong repairing, instead, there should have been some trouble before the OI scrub.

            Saw it once here:
            https://testing.hpdd.intel.com/test_sets/784d3a32-b2a3-11e4-a8f5-5254006e85c2

            Happened before the test was SKIPPED. Then I ran the test over 20 times but couldn't reproduce.

            isaac Isaac Huang (Inactive) added a comment - Saw it once here: https://testing.hpdd.intel.com/test_sets/784d3a32-b2a3-11e4-a8f5-5254006e85c2 Happened before the test was SKIPPED. Then I ran the test over 20 times but couldn't reproduce.
            green Oleg Drokin added a comment -

            Actually it appears we are testing this a bit late:

            # build up a clean test environment.
            formatall
            setupall
            
            [ $(facet_fstype $SINGLEMDS) != "ldiskfs" ] &&
                    skip "test OI scrub only for ldiskfs" && check_and_cleanup_lustre &&
                    exit 0
            

            Should we move the tests before the reformat and remount since for ZFS apparently too small size might be unsuitable or something?

            green Oleg Drokin added a comment - Actually it appears we are testing this a bit late: # build up a clean test environment. formatall setupall [ $(facet_fstype $SINGLEMDS) != "ldiskfs" ] && skip "test OI scrub only for ldiskfs" && check_and_cleanup_lustre && exit 0 Should we move the tests before the reformat and remount since for ZFS apparently too small size might be unsuitable or something?

            According to the test scripts, sanity-scrub will be skipped for zfs based backend.

            [ $(facet_fstype $SINGLEMDS) != "ldiskfs" ] &&
                    skip "test OI scrub only for ldiskfs" && check_and_cleanup_lustre &&
                    exit 0
            [ $(facet_fstype ost1) != "ldiskfs" ] &&
                    skip "test OI scrub only for ldiskfs" && check_and_cleanup_lustre &&
                    exit 0
            
            yong.fan nasf (Inactive) added a comment - According to the test scripts, sanity-scrub will be skipped for zfs based backend. [ $(facet_fstype $SINGLEMDS) != "ldiskfs" ] && skip "test OI scrub only for ldiskfs" && check_and_cleanup_lustre && exit 0 [ $(facet_fstype ost1) != "ldiskfs" ] && skip "test OI scrub only for ldiskfs" && check_and_cleanup_lustre && exit 0

            People

              wc-triage WC Triage
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: