Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13242

sanity-pfl: WARNING: MMP writes to pool have not succeeded in over 60s; suspending pool

Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/c739f9ca-4cd4-11ea-aeb7-52540065bddc

      All if the server consoles show the same message:

      WARNING: MMP writes to pool 'lustre-ost8' have not succeeded in over 60s; suspending pool
      WARNING: MMP writes to pool 'lustre-ost2' have not succeeded in over 60s; suspending pool
      

      and the node panics due to config settings.

      My thinking is that this is a problem with the VM host machine, and does not indicate a problem with Lustre.

      Attachments

        Issue Links

          Activity

            [LU-13242] sanity-pfl: WARNING: MMP writes to pool have not succeeded in over 60s; suspending pool
            adilger Andreas Dilger made changes -
            Link New: This issue is related to ATM-2232 [ ATM-2232 ]
            adilger Andreas Dilger made changes -
            Resolution New: Duplicate [ 3 ]
            Status Original: Open [ 1 ] New: Resolved [ 5 ]

            Close as a duplicate of LU-10956

            adilger Andreas Dilger added a comment - Close as a duplicate of LU-10956

            this error global. I hit same errors in different tests. see LU-15261
            [13583.487544] WARNING: MMP writes to pool 'lustre-ost3' have not succeeded in over 60463 ms; suspending pool. Hrtime 13583487508131

            but it looks better to merge it into single one.

            shadow Alexey Lyashkov added a comment - this error global. I hit same errors in different tests. see LU-15261 [13583.487544] WARNING: MMP writes to pool 'lustre-ost3' have not succeeded in over 60463 ms; suspending pool. Hrtime 13583487508131 but it looks better to merge it into single one.

            I hit this problem few times a week and all my testing runs in tmpfs, so this is not slow/broken disks.

            bzzz Alex Zhuravlev added a comment - I hit this problem few times a week and all my testing runs in tmpfs, so this is not slow/broken disks.
            bzzz Alex Zhuravlev added a comment - another test, but the same cause: https://testing.whamcloud.com/test_sessions/70071ef5-bb3b-46f1-9d95-e5af24267969
            jamesanunez James Nunez (Inactive) made changes -
            Link New: This issue is related to LU-10956 [ LU-10956 ]
            mdiep Minh Diep made changes -
            Reporter Original: Maloo [ maloo ] New: Andreas Dilger [ adilger ]
            maloo Maloo created issue -

            People

              wc-triage WC Triage
              adilger Andreas Dilger
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: