[LU-9627] Bad small-file behaviour even when local-only and on RAM-FS - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Unresolved
Priority: Minor
Fix Version/s: None
Affects Version/s: Lustre 2.9.0
Labels:
None

Severity:
3
Rank (Obsolete):
9223372036854775807

Description

Hi everyone, I have noticed a curiously bad small-file creation behaviour on Lustre 2.9.55.

I know that Lustre is inefficient when handling large amounts of small files and profits from the Metadata Servers running on SSDs – but while exploring just how bad this is, I found something curious.

My use case is simple: Create 50.000 40-byte files in a single directory. The "test.py" script below will do just that.

Since I wanted to find the theoretical speed of Lustre, I used the following setup:

A single server played the role of MGS, MDT, OST and Client.
All data storage happens via ldiskfs on a ramdisk
- 16GB Metadata
- 48GB Object Data
All network accesses happen via TCP loopback

The final Lustre FS looks like this:

[-bash-4.3]$ lfs df -h
UUID bytes Used Available Use% Mounted on
ram-MDT0000_UUID 8.9G 46.1M 8.0G 1% /mnt/ram/client[MDT:0]
ram-OST0000_UUID 46.9G 53.0M 44.4G 0% /mnt/ram/client[OST:0]
filesystem_summary: 46.9G 53.0M 44.4G 0% /mnt/ram/client

Unfortunately, when running the test-script (which needs ~5 seconds on a local disk), I instead get these abysmal speeds:

[-bash-4.3]$ ./test.py /mnt/ram/client
2017-06-09 18:49:56,518 [INFO ] Creating 50k files in one directory...
2017-06-09 18:50:50,437 [INFO ] Reading 50k files...
2017-06-09 18:51:09,310 [INFO ] Deleting 50k files...
2017-06-09 18:51:20,604 [INFO ] Creation took: 53.92 seconds
2017-06-09 18:51:20,604 [INFO ] Reading took: 18.87 seconds
2017-06-09 18:51:20,604 [INFO ] Deleting took: 11.29 seconds

This tells me, that there is a rather fundamental performance issue within Lustre – and that it has nothing to do with the disk or network latency.

That – or my test script is broken – but I do not think it is.

If you're curious, here's how I set up the test scenario:

mkdir -p /mnt/ram/disk
mount -t tmpfs -o size=64G tmpfs /mnt/ram/disk
dd if=/dev/zero of=/mnt/ram/disk/mdt.img bs=1M count=16K
dd if=/dev/zero of=/mnt/ram/disk/odt.img bs=1M count=48K
losetup /dev/loop0 /mnt/ram/disk/mdt.img
losetup /dev/loop1 /mnt/ram/disk/odt.img
mkfs.lustre --mgs --mdt --fsname=ram --backfstype=ldiskfs --index=0 /dev/loop0
mkfs.lustre --ost --fsname=ram --backfstype=ldiskfs --index=0 --mgsnode=127.0.0.1@tcp0 /dev/loop1

mkdir -p /mnt/ram/mdt
mount -t lustre -o defaults,noatime /dev/loop0 /mnt/ram/mdt
mkdir -p /mnt/ram/ost
mount -t lustre -o defaults,noatime /dev/loop1 /mnt/ram/ost

mkdir -p /mnt/ram/client
mount -t lustre 127.0.0.1@tcp0:/ram /mnt/ram/client
chmod 1777 /mnt/ram/client

Thanks!

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

test.py
1 kB
09/Jun/17 5:01 PM

Issue Links

is related to

LU-10176 Data-on-MDT phase II

Open

LU-5603 Enable inline_data feature for Lustre

Open

LU-10619 Tiny writes improvement: Size + glimpse changes

Open

LU-9409 Lustre small IO write performance improvement

Resolved

Activity

People

Assignee:: WC Triage

Reporter:: Martin Schröder (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 09/Jun/17 5:13 PM

Updated:: 21/Jan/22 1:02 AM