Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.8.0
    • Lustre 2.7.0
    • None
    • 3
    • 9223372036854775807

    Description

      As mentioned in LU-6683, I ran into a situation where lctl lfsck_stop just hangs indefinitely.

      I have managed to reproduce this twice:

      start lfsck (using lctl lfsck_start -M play01-MDT0000 -t layout), this crashes the OSS servers, reboot the servers and restart the OSTs. Attempting to stop the lfsck in this state just hangs. I have waited >1h and it was still hanging. Unmounting the MDT in this situation also appears to be hanging (after 30 minutes I power cycled the MDS).

      Attachments

        1. 15.lctl.tgz
          631 kB
        2. lustre.dmesg.bz2
          37 kB
        3. lustre.log.bz2
          1.38 MB

        Issue Links

          Activity

            [LU-6684] lctl lfsck_stop hangs

            Another instance found for tag 2.7.66 for Full - EL6.7 Server/EL6.7 Client
            On master, build# 3314
            https://testing.hpdd.intel.com/test_sets/35490a0c-ca6e-11e5-9215-5254006e85c2
            Date : 02/02/2016 Time: 9:20 am MST

            The patch 18082 has been landed just after new tag 2.7.66, please test the latest master.

            yong.fan nasf (Inactive) added a comment - Another instance found for tag 2.7.66 for Full - EL6.7 Server/EL6.7 Client On master, build# 3314 https://testing.hpdd.intel.com/test_sets/35490a0c-ca6e-11e5-9215-5254006e85c2 Date : 02/02/2016 Time: 9:20 am MST The patch 18082 has been landed just after new tag 2.7.66, please test the latest master.

            Another instance found for tag 2.7.66 for Full - EL6.7 Server/EL6.7 Client
            On master, build# 3314
            https://testing.hpdd.intel.com/test_sets/35490a0c-ca6e-11e5-9215-5254006e85c2
            Date : 02/02/2016 Time: 9:20 am MST

            standan Saurabh Tandan (Inactive) added a comment - Another instance found for tag 2.7.66 for Full - EL6.7 Server/EL6.7 Client On master, build# 3314 https://testing.hpdd.intel.com/test_sets/35490a0c-ca6e-11e5-9215-5254006e85c2 Date : 02/02/2016 Time: 9:20 am MST

            The patch has been landed to master.

            yong.fan nasf (Inactive) added a comment - The patch has been landed to master.

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/18082/
            Subject: LU-6684 lfsck: set the lfsck notify as interruptable
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 069a9cf551c2e985ea254a1c570b22ed1d72d914

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/18082/ Subject: LU-6684 lfsck: set the lfsck notify as interruptable Project: fs/lustre-release Branch: master Current Patch Set: Commit: 069a9cf551c2e985ea254a1c570b22ed1d72d914
            yujian Jian Yu added a comment - This is blocking patch review testing on master branch: https://testing.hpdd.intel.com/test_sets/a29caebe-c709-11e5-9b6d-5254006e85c2 https://testing.hpdd.intel.com/test_sets/fbfee2be-c70f-11e5-a037-5254006e85c2
            bogl Bob Glossman (Inactive) added a comment - another on master: https://testing.hpdd.intel.com/test_sets/150c07e2-c575-11e5-825e-5254006e85c2

            This is also delaying the landing of several patches.

            simmonsja James A Simmons added a comment - This is also delaying the landing of several patches.
            bogl Bob Glossman (Inactive) added a comment - another on master: https://testing.hpdd.intel.com/test_sets/85d45ece-c0bc-11e5-9620-5254006e85c2

            Fan Yong (fan.yong@intel.com) uploaded a new patch: http://review.whamcloud.com/18082
            Subject: LU-6684 lfsck: set the lfsck notify as interruptable
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 68c078328be253735658fcf43fa98afff936ec6c

            gerrit Gerrit Updater added a comment - Fan Yong (fan.yong@intel.com) uploaded a new patch: http://review.whamcloud.com/18082 Subject: LU-6684 lfsck: set the lfsck notify as interruptable Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 68c078328be253735658fcf43fa98afff936ec6c

            James Nunez (james.a.nunez@intel.com) uploaded a new patch: http://review.whamcloud.com/18059
            Subject: Revert "LU-6684 lfsck: stop lfsck even if some servers offline"
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 2505fd07b29ebfddcd29f16954908f6fe4670276

            gerrit Gerrit Updater added a comment - James Nunez (james.a.nunez@intel.com) uploaded a new patch: http://review.whamcloud.com/18059 Subject: Revert " LU-6684 lfsck: stop lfsck even if some servers offline" Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 2505fd07b29ebfddcd29f16954908f6fe4670276

            People

              yong.fan nasf (Inactive)
              ferner Frederik Ferner (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: