[LU-2078] sync_on_lock_cancel too aggressive - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Fixed
Priority: Minor
Fix Version/s: Lustre 2.4.0
Affects Version/s: None
Labels:
- server
- topsequoia

Severity:
3
Epic:
- server
Project:
Orion
Rank (Obsolete):
2992

Description

While running some single shared file IOR tests I noticed, with 'zpool iostat', that there were a lots write going on during the read phase. This was absolutely destroying performance and it wasn't at all clear to me why there should be any reads at all. IOR is just reading during this phase, no writes.

On a hunch I set 'sync_on_lock_cancel=never' and the writes stopped entirely and performance improved at least 6x. I haven't looked at the code yet but I think there might be a few issues here.

1) Why were so many lock revocations occurring during the read phase. I would have thought that each client would end up with a single concurrent read lock rather quickly. However, this behavior persistent for the 30 minutes it took to read all the data back.

2) When revoking a read lock, or even a write lock which isn't actually covering any dirty data, there's no reason to issue the sync. I suspect this was never noticed because under ldiskfs if there's no dirty data to sync performing the sync is basically free. Under ZFS however if you call sync you will force the current txg to be written out. That will result in at a minimum the vdev labels at the beginning and end of each disk to be written. Lots of syncs will result each disk head basically seeking from the being to end of the disk repeatedly. Very very bad.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

ori642.tar.bz2
7.08 MB
19/Jul/12 9:59 PM
57chaos-always.tar.bz2
4.56 MB
24/Jul/12 8:10 PM
58chaos-always.tar.bz2
4.51 MB
25/Jul/12 4:35 PM
58chaos-never.tar.bz2
4.50 MB
25/Jul/12 4:52 PM

Issue Links

is related to

LU-2085 sanityn test_16 (fsx) ran over its Autotest time

Closed

Activity

People

Assignee:: Alex Zhuravlev

Reporter:: Brian Behlendorf

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 19/Apr/12 2:09 PM

Updated:: 26/Dec/13 5:21 PM

Resolved:: 08/Oct/12 1:21 AM