Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-9522

sanity test 27z is failing with 'test_27z failed with 6'

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.10.0
    • Lustre 2.10.0
    • None
    • autotest
    • 3
    • 9223372036854775807

    Description

      sanity test_27z is failing with the error

      'test_27z failed with 6' 
      

      From the test_log, we see

      Started lustre-OST0001
      /mnt/lustre-ost2/O/0/d3/1667: parent=[0x200000bd0:0x40cc:0x0] stripe=0 stripe_size=1048576 stripe_count=2
      want: stripe:1 ost:0 oid:858/0x35a seq:0
      /mnt/lustre-ost2/O/0/d3/1667: parent=[0x200000bd0:0x40cc:0x0] stripe=0 stripe_size=1048576 stripe_count=2
       sanity test_27z: @@@@@@ FAIL: FF stripe 0 != 1 
      

      This test is only failing for review-zfs-part-1.

      sanity test 27z started failing with this error on May 16, 2017. The first few failure logs are at
      https://testing.hpdd.intel.com/test_sets/4f3d47bc-3a42-11e7-b81e-5254006e85c2
      https://testing.hpdd.intel.com/test_sets/6d1be1a2-3a43-11e7-b861-5254006e85c2
      https://testing.hpdd.intel.com/test_sets/2a85c6c6-3a5e-11e7-b3f6-5254006e85c2

      Attachments

        Issue Links

          Activity

            [LU-9522] sanity test 27z is failing with 'test_27z failed with 6'
            pjones Peter Jones added a comment -

            Landed for 2.10

            pjones Peter Jones added a comment - Landed for 2.10

            Andreas Dilger (andreas.dilger@intel.com) merged in patch https://review.whamcloud.com/27163/
            Subject: LU-9522 test: sanity 27z fix update local variable
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: e6e0d59bb3af2cad16495059aa23675e6a3cffb0

            gerrit Gerrit Updater added a comment - Andreas Dilger (andreas.dilger@intel.com) merged in patch https://review.whamcloud.com/27163/ Subject: LU-9522 test: sanity 27z fix update local variable Project: fs/lustre-release Branch: master Current Patch Set: Commit: e6e0d59bb3af2cad16495059aa23675e6a3cffb0
            bogl Bob Glossman (Inactive) added a comment - - edited more on master: https://testing.hpdd.intel.com/test_sets/b573cb72-3b56-11e7-b3f6-5254006e85c2 https://testing.hpdd.intel.com/test_sets/f864ac5e-3c18-11e7-b3f6-5254006e85c2

            The test is failing due to https://review.whamcloud.com/#/c/26569/
            Change-Id: Ia971e33cc3d8a5e4ca6f821116f12c0a72bcab07

            jamesanunez James Nunez (Inactive) added a comment - The test is failing due to https://review.whamcloud.com/#/c/26569/ Change-Id: Ia971e33cc3d8a5e4ca6f821116f12c0a72bcab07

            James Nunez (james.a.nunez@intel.com) uploaded a new patch: https://review.whamcloud.com/27163
            Subject: LU-9522 test: sanity 27z fix update local variable
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 558cdb7b540e1eafed4ec637ca1b6a756bc3a0b6

            gerrit Gerrit Updater added a comment - James Nunez (james.a.nunez@intel.com) uploaded a new patch: https://review.whamcloud.com/27163 Subject: LU-9522 test: sanity 27z fix update local variable Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 558cdb7b540e1eafed4ec637ca1b6a756bc3a0b6

            I think I know what the issue is and it looks like a test only issue. The ff variable in the test is not reset for each iteration and, thus, the ff variable is not updated:

            1864         $GETSTRIPE $1 | while read obdidx oid hex seq; do
            ...
            1883 
            1884                 local ff
            1885                 #
            1886                 # Don't unmount/remount the OSTs if we don't need to do that.
            1887                 # LU-2577 changes filter_fid to be smaller, so debugfs needs
            1888                 # update too, until that use mount/ll_decode_filter_fid/mount.
            1889                 # Re-enable when debugfs will understand new filter_fid.
            1890                 #
            1891                 if [ $(facet_fstype ost$ost) == ldiskfs ]; then
            1892                         ff=$(do_facet ost$ost "$DEBUGFS -c -R 'stat $obj_file' \
            1893                                 $dev 2>/dev/null" | grep "parent=")
            1894                 fi
            1895                 if [ -z "$ff" ]; then
            1896                         stop ost$ost
            1897                         mount_fstype ost$ost
            1898                         ff=$(do_facet ost$ost $LL_DECODE_FILTER_FID \
            1899                                 $(facet_mntpt ost$ost)/$obj_file)
            1900                         unmount_fstype ost$ost
            1901                         start ost$ost $dev $OST_MOUNT_OPTS
            1902                         clients_up
            1903                 fi
            1904 
            ...
            

            I will upload a patch to make sure this is the issue.

            jamesanunez James Nunez (Inactive) added a comment - I think I know what the issue is and it looks like a test only issue. The ff variable in the test is not reset for each iteration and, thus, the ff variable is not updated: 1864 $GETSTRIPE $1 | while read obdidx oid hex seq; do ... 1883 1884 local ff 1885 # 1886 # Don 't unmount/remount the OSTs if we don' t need to do that. 1887 # LU-2577 changes filter_fid to be smaller, so debugfs needs 1888 # update too, until that use mount/ll_decode_filter_fid/mount. 1889 # Re-enable when debugfs will understand new filter_fid. 1890 # 1891 if [ $(facet_fstype ost$ost) == ldiskfs ]; then 1892 ff=$(do_facet ost$ost "$DEBUGFS -c -R 'stat $obj_file' \ 1893 $dev 2>/dev/ null " | grep " parent=") 1894 fi 1895 if [ -z "$ff" ]; then 1896 stop ost$ost 1897 mount_fstype ost$ost 1898 ff=$(do_facet ost$ost $LL_DECODE_FILTER_FID \ 1899 $(facet_mntpt ost$ost)/$obj_file) 1900 unmount_fstype ost$ost 1901 start ost$ost $dev $OST_MOUNT_OPTS 1902 clients_up 1903 fi 1904 ... I will upload a patch to make sure this is the issue.

            People

              jamesanunez James Nunez (Inactive)
              jamesanunez James Nunez (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: