[LU-13242] sanity-pfl: WARNING: MMP writes to pool have not succeeded in over 60s; suspending pool Created: 11/Feb/20  Updated: 23/Nov/21  Resolved: 23/Nov/21

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Andreas Dilger Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Related
is related to LU-10956 sanity-pfl test_3: Kernel panic - not... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/c739f9ca-4cd4-11ea-aeb7-52540065bddc

All if the server consoles show the same message:

WARNING: MMP writes to pool 'lustre-ost8' have not succeeded in over 60s; suspending pool
WARNING: MMP writes to pool 'lustre-ost2' have not succeeded in over 60s; suspending pool

and the node panics due to config settings.

My thinking is that this is a problem with the VM host machine, and does not indicate a problem with Lustre.



 Comments   
Comment by Alex Zhuravlev [ 22/Nov/21 ]

another test, but the same cause:
https://testing.whamcloud.com/test_sessions/70071ef5-bb3b-46f1-9d95-e5af24267969

Comment by Alex Zhuravlev [ 22/Nov/21 ]

I hit this problem few times a week and all my testing runs in tmpfs, so this is not slow/broken disks.

Comment by Alexey Lyashkov [ 22/Nov/21 ]

this error global. I hit same errors in different tests. see LU-15261
[13583.487544] WARNING: MMP writes to pool 'lustre-ost3' have not succeeded in over 60463 ms; suspending pool. Hrtime 13583487508131

but it looks better to merge it into single one.

Comment by Andreas Dilger [ 23/Nov/21 ]

Close as a duplicate of LU-10956

Generated at Sat Feb 10 02:59:37 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.