Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17658

sanity check when ofd assign a new sequence to osp

Details

    • Improvement
    • Resolution: Fixed
    • Minor
    • Lustre 2.17.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      In the case of ofd loses the commit to update sequence range stored in seq_srv,
      ofd could assign the same sequence to different osp, creating corruption in the filesystem.

      Attachments

        Issue Links

          Activity

            [LU-17658] sanity check when ofd assign a new sequence to osp
            pjones Peter Jones added a comment -

            Merged for 2.17

            pjones Peter Jones added a comment - Merged for 2.17
            gerrit Gerrit Updater added a comment -

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/54474/
            Subject: LU-17658 fid: check on disk sequence before allocating to osp
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 373b76b345b5eaa21b201315075f01ccfc408189

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/54474/ Subject: LU-17658 fid: check on disk sequence before allocating to osp Project: fs/lustre-release Branch: master Current Patch Set: Commit: 373b76b345b5eaa21b201315075f01ccfc408189
            adilger Andreas Dilger added a comment - - edited

            I also noticed that conf-sanity test_155 started taking much longer after patch https://review.whamcloud.com/54191 ("LU-17538 fid: do not use allocation set for ofd") landed on 2025-01-02 (average 400s) compared to before landing (average 50s).

            There were two different patches with conf-sanity test_155 and each had significantly different test durations, causing the subtest_trend report to be confused.

             

            adilger Andreas Dilger added a comment - - edited I also noticed that conf-sanity test_155 started taking much longer after patch https://review.whamcloud.com/54191 ( " LU-17538 fid: do not use allocation set for ofd" ) landed on 2025-01-02 (average 400s) compared to before landing (average 50s). There were two different patches with conf-sanity test_155 and each had significantly different test durations, causing the subtest_trend report to be confused.  

            "Li Dongyang <dongyangli@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/54474
            Subject: LU-17658 fid: check on disk sequence before allocating to osp
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 4f7c8306d656e0381ea8098df00821489b05a69b

            gerrit Gerrit Updater added a comment - "Li Dongyang <dongyangli@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/54474 Subject: LU-17658 fid: check on disk sequence before allocating to osp Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 4f7c8306d656e0381ea8098df00821489b05a69b

            People

              dongyang Dongyang Li
              dongyang Dongyang Li
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: