[LU-12450] MDT ZFS deadlock (task z_wr_iss blocked for more than 120 seconds) Created: 18/Jun/19 Updated: 16/Sep/19 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.10.5, Lustre 2.12.2 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Aurelien Degremont (Inactive) | Assignee: | WC Triage |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
| Comments |
| Comment by Andreas Dilger [ 18/Jun/19 ] |
|
I can't say for sure, but this looks like the bug is purely in the ZFS code? As such, it is probably better suited to be an issue in github/zfsonlinux. |
| Comment by Aurelien Degremont (Inactive) [ 18/Jun/19 ] |
|
I was curious to know what's your point of view on that. I've seen issues in the past which were coming from the way OSD was using LDISKFS or ZFS and not the underlying filesystem itself. But indeed, It looks like a ZFS issue. |
| Comment by James A Simmons [ 22/Jul/19 ] |
|
Can you try patch https://review.whamcloud.com/#/c/35524/ |
| Comment by Aurelien Degremont (Inactive) [ 16/Sep/19 ] |
|
For the records, this was related to a kernel bug in 4.14 kernel branch that was fixed by this ticket: commit 8a5e7aeffc81ed2f1f5371eee7a0b019b37fb13a Author: Waiman Long longman@redhat.com Date: Sun Apr 28 17:25:38 2019 -0400 locking/rwsem: Prevent decrement of reader count before increment [ Upstream commit a9e9bcb45b1525ba7aea26ed9441e8632aeeda58 ] {} |