Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-6831

The ticket for tracking all DNE2 bugs

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.8.0, Lustre 2.9.0
    • 3
    • 9223372036854775807

    Description

      This ticket is for tracking all of DNE2 bugs.

      Attachments

        Issue Links

          Activity

            [LU-6831] The ticket for tracking all DNE2 bugs
            simmonsja James A Simmons added a comment - For my DNE2 testing here is the list of patches I running against: http://review.whamcloud.com/#/c/14346 http://review.whamcloud.com/#/c/14747 http://review.whamcloud.com/#/c/15594 http://review.whamcloud.com/#/c/15720 http://review.whamcloud.com/#/c/15576 http://review.whamcloud.com/#/c/15730 http://review.whamcloud.com/#/c/15692 http://review.whamcloud.com/#/c/15691 http://review.whamcloud.com/#/c/15682 http://review.whamcloud.com/#/c/15690 http://review.whamcloud.com/#/c/15721 http://review.whamcloud.com/#/c/15724 http://review.whamcloud.com/#/c/15728 http://review.whamcloud.com/#/c/15770

            Yes LU-6831 helped with the revalidate FID bug.

            simmonsja James A Simmons added a comment - Yes LU-6831 helped with the revalidate FID bug.

            James: Please try this patch to see if it works? thanks. Unfortunately, I can not reproduce this problem locally.

            di.wang Di Wang (Inactive) added a comment - James: Please try this patch to see if it works? thanks. Unfortunately, I can not reproduce this problem locally.

            wangdi (di.wang@intel.com) uploaded a new patch: http://review.whamcloud.com/15720
            Subject: LU-6831 lmv: revalidate the dentry for striped dir
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 26133ac386693647ef3ee4480a6ce0cbb2773adb

            gerrit Gerrit Updater added a comment - wangdi (di.wang@intel.com) uploaded a new patch: http://review.whamcloud.com/15720 Subject: LU-6831 lmv: revalidate the dentry for striped dir Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 26133ac386693647ef3ee4480a6ce0cbb2773adb

            I see what is triggering the client evictions. I'm getting these errors on the clients:

            LustreError: 10306:0:(lmv_intent.c:234:lmv_revalidate_slaves()) sultan-clilmv-ffff8803ea284c00: nlink 1 < 2 corrupt stripe 1 [0x2800013ba:0x84ad:0x0]:[0x2400013c8:0x84ad:0x0]
            LustreError: 10306:0:(file.c:3104:ll_inode_revalidate_fini()) sultan: revalidate FID [0x2400013ce:0x3:0x0] error: rc = -5
            LustreError: 10316:0:(lmv_intent.c:234:lmv_revalidate_slaves()) sultan-clilmv-ffff8803ea284c00: nlink 1 < 2 corrupt stripe 0 [0x2400013b3:0x14262:0x0]:[0x2400013b3:0x14262:0x0]
            LustreError: 10863:0:(lmv_intent.c:234:lmv_revalidate_slaves()) sultan-clilmv-ffff8803ea284c00: nlink 1 < 2 corrupt stripe 0 [0x2400013b3:0x14262:0x0]:[0x2400013b3:0x14262:0x0]
            LustreError: 10894:0:(lmv_intent.c:234:lmv_revalidate_slaves()) sultan-clilmv-ffff8803ea284c00: nlink 1 < 2 corrupt stripe 1 [0x2800013ab:0x14262:0x0]:[0x2400013b3:0x14262:0x0]
            LustreError: 10894:0:(file.c:3104:ll_inode_revalidate_fini()) sultan: revalidate FID [0x2400013a4:0x5:0x0] error: rc = -5

            simmonsja James A Simmons added a comment - I see what is triggering the client evictions. I'm getting these errors on the clients: LustreError: 10306:0:(lmv_intent.c:234:lmv_revalidate_slaves()) sultan-clilmv-ffff8803ea284c00: nlink 1 < 2 corrupt stripe 1 [0x2800013ba:0x84ad:0x0] : [0x2400013c8:0x84ad:0x0] LustreError: 10306:0:(file.c:3104:ll_inode_revalidate_fini()) sultan: revalidate FID [0x2400013ce:0x3:0x0] error: rc = -5 LustreError: 10316:0:(lmv_intent.c:234:lmv_revalidate_slaves()) sultan-clilmv-ffff8803ea284c00: nlink 1 < 2 corrupt stripe 0 [0x2400013b3:0x14262:0x0] : [0x2400013b3:0x14262:0x0] LustreError: 10863:0:(lmv_intent.c:234:lmv_revalidate_slaves()) sultan-clilmv-ffff8803ea284c00: nlink 1 < 2 corrupt stripe 0 [0x2400013b3:0x14262:0x0] : [0x2400013b3:0x14262:0x0] LustreError: 10894:0:(lmv_intent.c:234:lmv_revalidate_slaves()) sultan-clilmv-ffff8803ea284c00: nlink 1 < 2 corrupt stripe 1 [0x2800013ab:0x14262:0x0] : [0x2400013b3:0x14262:0x0] LustreError: 10894:0:(file.c:3104:ll_inode_revalidate_fini()) sultan: revalidate FID [0x2400013a4:0x5:0x0] error: rc = -5

            James: What test did you run? Do you have the trace? I am not sure if there are such fixes. Thanks.

            di.wang Di Wang (Inactive) added a comment - James: What test did you run? Do you have the trace? I am not sure if there are such fixes. Thanks.

            Now I'm seeing clients get evicted during heavy meta data operations. Di Wang have you seen this behavior and does a patch exist to address this?

            simmonsja James A Simmons added a comment - Now I'm seeing clients get evicted during heavy meta data operations. Di Wang have you seen this behavior and does a patch exist to address this?

            Doing more testing I found that the patch from LU-6341 was not the source of the bug but it exposed the issue reported in LU-6713. Once I applied patch 15269 the problem went away.

            simmonsja James A Simmons added a comment - Doing more testing I found that the patch from LU-6341 was not the source of the bug but it exposed the issue reported in LU-6713 . Once I applied patch 15269 the problem went away.

            Nope. I found the source of the problems. It was the patch from LU-6341. Once I removed it the problems went away.

            simmonsja James A Simmons added a comment - Nope. I found the source of the problems. It was the patch from LU-6341 . Once I removed it the problems went away.

            Strange, does your build include this patch http://review.whamcloud.com/#/c/15269/ ? If it does, please remove this one. and retry? thanks

            di.wang Di Wang (Inactive) added a comment - Strange, does your build include this patch http://review.whamcloud.com/#/c/15269/ ? If it does, please remove this one. and retry? thanks
            simmonsja James A Simmons added a comment - http://review.whamcloud.com/#/c/13990 http://review.whamcloud.com/#/c/15572 http://review.whamcloud.com/#/c/15576

            People

              di.wang Di Wang (Inactive)
              di.wang Di Wang (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

                Created:
                Updated: