Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11681

sanity test 65i fails with 'find /mnt/lustre failed'

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.13.0
    • Lustre 2.12.0, Lustre 2.13.0, Lustre 2.12.1, Lustre 2.12.2, Lustre 2.12.3, Lustre 2.12.4
    • DNE/ZFS
    • 3
    • 9223372036854775807

    Description

      sanity test 65i is failing for DNE with ZFS configurations. Looking at the client test log at https://testing.whamcloud.com/test_sets/4ac91f66-e875-11e8-bfe1-52540065bddc , we see ‘lfs find’ is failing with

      lfs find: llapi_semantic_traverse: Failed to open '/mnt/lustre/d60g.sanity/subdir2': No such file or directory (2)
      error: find failed for /mnt/lustre.
       sanity test_65i: @@@@@@ FAIL: find /mnt/lustre failed 
      

      Looking at the console logs, the only indication of a problem is in the client (vm7) console

       [ 3432.108946] Lustre: DEBUG MARKER: == sanity test 65i: various tests to set root directory striping ===================================== 23:16:02 (1542237362)
      [ 3436.508827] LustreError: 662:0:(llite_lib.c:2390:ll_prep_inode()) new_inode -fatal: rc -2
      [ 3438.077315] Lustre: DEBUG MARKER: /usr/sbin/lctl mark  sanity test_65i: @@@@@@ FAIL: find \/mnt\/lustre failed 
      

      We’ve seen this test fail a total of three time since November 14, 2018.
      https://testing.whamcloud.com/test_sets/da7b847a-ebb1-11e8-86c0-52540065bddc
      https://testing.whamcloud.com/test_sets/5f393496-eba8-11e8-bfe1-52540065bddc

      Attachments

        Issue Links

          Activity

            [LU-11681] sanity test 65i fails with 'find /mnt/lustre failed'

            James Nunez (jnunez@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/34687
            Subject: LU-11681 tests: stop running sanity 65i
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 430ec3ba2cc71fb99e8a2fc80adae4638f4f928f

            gerrit Gerrit Updater added a comment - James Nunez (jnunez@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/34687 Subject: LU-11681 tests: stop running sanity 65i Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 430ec3ba2cc71fb99e8a2fc80adae4638f4f928f
            mdiep Minh Diep added a comment - +1 on b2_12: https://testing.whamcloud.com/test_sets/503715b8-54bb-11e9-8e92-52540065bddc

            Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33930
            Subject: LU-11681 lmv: disable remote file statahead
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: cd9447f033d284d7bcbb0c60118af1605f740107

            gerrit Gerrit Updater added a comment - Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33930 Subject: LU-11681 lmv: disable remote file statahead Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: cd9447f033d284d7bcbb0c60118af1605f740107

            Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33929
            Subject: LU-11681 llite: clear lsm if dir lost its LMV
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: d451f4ee49eec34f56a5bebc1795bf1f9123833e

            gerrit Gerrit Updater added a comment - Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33929 Subject: LU-11681 llite: clear lsm if dir lost its LMV Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: d451f4ee49eec34f56a5bebc1795bf1f9123833e

            Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33928
            Subject: LU-11681 lfsck: misc fixes in inserting shard
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: abea2a3add2fd4d0022b76f53a5aacc6c25fdc96

            gerrit Gerrit Updater added a comment - Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33928 Subject: LU-11681 lfsck: misc fixes in inserting shard Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: abea2a3add2fd4d0022b76f53a5aacc6c25fdc96

            Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33927
            Subject: LU-11681 lfsck: misc fixes for dangling entry repair
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: eb81609cb56f4f3c452d6d2adcb3192a2008e09b

            gerrit Gerrit Updater added a comment - Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33927 Subject: LU-11681 lfsck: misc fixes for dangling entry repair Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: eb81609cb56f4f3c452d6d2adcb3192a2008e09b

            Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33926
            Subject: LU-11681 lfsck: read LMV from bottom object
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 12a5b93757f61da46df72c56cb2bcbb157786e41

            gerrit Gerrit Updater added a comment - Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33926 Subject: LU-11681 lfsck: read LMV from bottom object Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 12a5b93757f61da46df72c56cb2bcbb157786e41
            laisiyao Lai Siyao added a comment -

            Okay, I'll look into it soon.

            laisiyao Lai Siyao added a comment - Okay, I'll look into it soon.

            The sanity test_60g was added in patch https://review.whamcloud.com/33401 "LU-11418 llog: refresh remote llog upon -ESTALE", which landed on 2018-11-13.

            It looks like it has left over some broken directory stub due to the test that is being run. At a minimum, the test should be modified to try and remove the test directory at the end, so that the failure is localized to the test that introduced it. It might be that we need to use "lfs rmentry" to delete the partially-created directory entry.

            There were 9 failures in the past 4 weeks, so it isn't a high failure rate, but something that could likely be addressed fairly easily.

            adilger Andreas Dilger added a comment - The sanity test_60g was added in patch https://review.whamcloud.com/33401 " LU-11418 llog: refresh remote llog upon -ESTALE ", which landed on 2018-11-13. It looks like it has left over some broken directory stub due to the test that is being run. At a minimum, the test should be modified to try and remove the test directory at the end, so that the failure is localized to the test that introduced it. It might be that we need to use " lfs rmentry " to delete the partially-created directory entry. There were 9 failures in the past 4 weeks, so it isn't a high failure rate, but something that could likely be addressed fairly easily.

            People

              laisiyao Lai Siyao
              jamesanunez James Nunez (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: