Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-6785

Interop 2.7.0<->master sanity test_56w: cannot swap layouts: Device or resource busy

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.8.0
    • Lustre 2.8.0
    • None
    • server: 2.7.0
      client: lustre-master build # 3071 EL7
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for sarah_lw <wei3.liu@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/a260a392-1237-11e5-bd2d-5254006e85c2.

      The sub-test test_56w failed with the following error:

      /usr/bin/lfs migrate -i 0 /mnt/lustre/d56w.sanityw/migr_1_ost failed
      
      yes: standard output: Broken pipe
      yes: write error
      /usr/bin/lfs_migrate -y -c 6 /mnt/lustre/d56w.sanityw/file1
      /mnt/lustre/d56w.sanityw/file1: /usr/bin/lfs: /mnt/lustre/d56w.sanityw/file1: cannot swap layouts: Device or resource busy
      cannot put lease: No locks available (37)
      error: migrate: migrate stripe file '/mnt/lustre/d56w.sanityw/file1' failed
      falling back to rsync-based migration
      done
      

      Attachments

        Issue Links

          Activity

            [LU-6785] Interop 2.7.0<->master sanity test_56w: cannot swap layouts: Device or resource busy

            Landed for 2.8.0

            jgmitter Joseph Gmitter (Inactive) added a comment - Landed for 2.8.0

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/16238/
            Subject: LU-6785 utils: compatibility fix for lfs migrate
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: a99e42c9fa47677cd2468abfa9378d776cc40803

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/16238/ Subject: LU-6785 utils: compatibility fix for lfs migrate Project: fs/lustre-release Branch: master Current Patch Set: Commit: a99e42c9fa47677cd2468abfa9378d776cc40803

            Henri Doreau (henri.doreau@cea.fr) uploaded a new patch: http://review.whamcloud.com/16238
            Subject: LU-6785 utils: compatibility fix for lfs migrate
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 0e4820031647405d939ccf43be8bc76426310b9f

            gerrit Gerrit Updater added a comment - Henri Doreau (henri.doreau@cea.fr) uploaded a new patch: http://review.whamcloud.com/16238 Subject: LU-6785 utils: compatibility fix for lfs migrate Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 0e4820031647405d939ccf43be8bc76426310b9f

            The file_lease_supported variable in lfs was supposed to address this. It is set automatically on failed attempts to get a file lease and it determines whether to fallback on group lock or not. Obviously there's something fishy, I can reproduce easily. I'm having a look.

            hdoreau Henri Doreau (Inactive) added a comment - The file_lease_supported variable in lfs was supposed to address this. It is set automatically on failed attempts to get a file lease and it determines whether to fallback on group lock or not. Obviously there's something fishy, I can reproduce easily. I'm having a look.

            It seems that this interop regression was added from http://review.whamcloud.com/10013 "LU-4840 lfs: Use file lease to implement migration" which landed back on 2015-05-28. This landed after the 2.5.54 tag was made (2015-05-17) and first appeared in the 2.7.55 tag (2015-06-10) .

            I'd guess that the new client is always trying to use the file lease lock, but that doesn't exist in 2.7.0 and earlier (interop failures are seen with 2.5.x also). Is there some mechanism by which the client can determine if the MDS has the file lease capability, and fall back to the previous group lock mechanism?

            adilger Andreas Dilger added a comment - It seems that this interop regression was added from http://review.whamcloud.com/10013 " LU-4840 lfs: Use file lease to implement migration" which landed back on 2015-05-28. This landed after the 2.5.54 tag was made (2015-05-17) and first appeared in the 2.7.55 tag (2015-06-10) . I'd guess that the new client is always trying to use the file lease lock, but that doesn't exist in 2.7.0 and earlier (interop failures are seen with 2.5.x also). Is there some mechanism by which the client can determine if the MDS has the file lease capability, and fall back to the previous group lock mechanism?

            People

              hdoreau Henri Doreau (Inactive)
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: