Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Cannot Reproduce
Priority: Blocker
Fix Version/s: None
Affects Version/s: Lustre 2.6.0
Labels:
None
Environment:
Hyperion/LLNL

Severity:
3
Rank (Obsolete):
12333

Description

For sometime now we have been observing terrible read performance when running ZFS IOR file-per-proccess. The system will see ~7 GB/s reading with ldiskfs, at higher client counts the ZFS read performance on this test will drop to ~400 MB/s which is roughly a single client level.
Observing the OSTs we typically see one or two of the 12 OSTs with a very high load, the rest idle. The busy OST with then timeout, frequently evict several clients, and move forward. Stack dumps and errors from two servers are attached. These tests are ongoing, please advise what further data needs to be collected.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

h-agb15.errors.txt
9 kB
17/Jan/14 7:09 PM
h-agb15.log.dump.txt
1.66 MB
17/Jan/14 7:09 PM
h-agb21.zfs.read.txt
1.41 MB
17/Jan/14 7:09 PM
MDTEST performance.xlsx
35 kB
21/Jan/14 5:01 PM

Issue Links

is related to

LU-4716 replay-ost-single test_5: stuck in dbuf_read->zio_wait

Resolved

Activity

People

Assignee:: Isaac Huang (Inactive)

Reporter:: Cliff White (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 17/Jan/14 7:09 PM

Updated:: 09/Oct/21 5:55 AM

Resolved:: 09/Oct/21 5:55 AM