[LU-5473] Test failure sanity test_51b: test_51b failed: fnum: No space left on device - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: Lustre 2.9.0
Affects Version/s: Lustre 2.7.0, Lustre 2.5.3
Labels:
- HB
- zfs

Severity:
3
Rank (Obsolete):
15250

Description

This issue was created by maloo for Nathaniel Clark <nathaniel.l.clark@intel.com>

This issue relates to the following test suite run:
https://testing.hpdd.intel.com/test_sets/09a62af8-1feb-11e4-8610-5254006e85c2
https://testing.hpdd.intel.com/test_sets/b21a51b0-0aff-11e4-8dbb-5254006e85c2

The sub-test test_51b failed with the following error:

test_51b failed with 1

Info required for matching: sanity 51b

Attachments

Activity

[LU-5473] Test failure sanity test_51b: test_51b failed: fnum: No space left on device

Nathaniel Clark added a comment - 23/Aug/16 8:01 PM

11KB/inode does seem to be a good estimate for ZFS based custom runs from 21821

Nathaniel Clark added a comment - 23/Aug/16 8:01 PM 11KB/inode does seem to be a good estimate for ZFS based custom runs from 21821

Gerrit Updater added a comment - 08/Aug/16 8:46 PM

Nathaniel Clark (nathaniel.l.clark@intel.com) uploaded a new patch: http://review.whamcloud.com/21821
Subject: ~~LU-5473~~ tests: Add debug to sanity/51b
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: d0f82be70ef01af4ccea83d870ddb4d9178690ac

Gerrit Updater added a comment - 08/Aug/16 8:46 PM Nathaniel Clark (nathaniel.l.clark@intel.com) uploaded a new patch: http://review.whamcloud.com/21821 Subject: LU-5473 tests: Add debug to sanity/51b Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: d0f82be70ef01af4ccea83d870ddb4d9178690ac

Nathaniel Clark added a comment - 08/Aug/16 3:15 PM

This test hasn't failed since 2015-06-15:
https://testing.hpdd.intel.com/test_sets/b3e189ee-13ca-11e5-b4b0-5254006e85c2

Same MDS error:

15:41:02:LustreError: 19303:0:(osd_handler.c:209:osd_trans_start()) lustre-MDT0000: failed to start transaction due to ENOSPC. Metadata overhead is underestimated or grant_ratio is too low.

Nathaniel Clark added a comment - 08/Aug/16 3:15 PM This test hasn't failed since 2015-06-15: https://testing.hpdd.intel.com/test_sets/b3e189ee-13ca-11e5-b4b0-5254006e85c2 Same MDS error: 15:41:02:LustreError: 19303:0:(osd_handler.c:209:osd_trans_start()) lustre-MDT0000: failed to start transaction due to ENOSPC. Metadata overhead is underestimated or grant_ratio is too low.

Andreas Dilger added a comment - 28/Aug/15 6:58 PM

The test is skipped (SLOW) on review-zfs, but it is still being run during full test runs, and is consistently passing on ZFS. The average space usage before the test run is about 11KB/inode on the MDT, and it reports plenty of free inodes before the test is passing, but the MDT filesystem is 3GB in size so it would have enough space for 95k inodes even at 32KB/inode that we were seeing before. I think 11KB/inode is reasonable for the number of actual files created in the filesystem at this point, since this includes all of the other filesystem overhead. It would be more useful to know the average per-inode space usage once all 70k files were created to get a better average and/or the differential usage for just those inodes.

https://testing.hpdd.intel.com/sub_tests/7b8bcf2a-4c80-11e5-b77f-5254006e85c2

== sanity test 51b: exceed 64k subdirectory nlink limit == 05:28:37 (1440480517)
UUID                   1K-blocks        Used   Available Use% Mounted on
lustre-MDT0000_UUID      3063424       12416     3048960   0% /mnt/lustre[MDT:0]
lustre-OST0000_UUID      2031360        5632     2023680   0% /mnt/lustre[OST:0]
lustre-OST0001_UUID      2031360        4608     2024704   0% /mnt/lustre[OST:1]
lustre-OST0002_UUID      2031104        3968     2025088   0% /mnt/lustre[OST:2]
lustre-OST0003_UUID      2031360        4224     2025088   0% /mnt/lustre[OST:3]
lustre-OST0004_UUID      2031360        5760     2023552   0% /mnt/lustre[OST:4]
lustre-OST0005_UUID      2031104        5888     2023168   0% /mnt/lustre[OST:5]
lustre-OST0006_UUID      2031360        6912     2022400   0% /mnt/lustre[OST:6]

filesystem summary:     14219008       36992    14167680   0% /mnt/lustre

UUID                      Inodes       IUsed       IFree IUse% Mounted on
lustre-MDT0000_UUID       162442        1363      161079   1% /mnt/lustre[MDT:0]
lustre-OST0000_UUID        76956         423       76533   1% /mnt/lustre[OST:0]
lustre-OST0001_UUID        73555         323       73232   0% /mnt/lustre[OST:1]
lustre-OST0002_UUID        74679         320       74359   0% /mnt/lustre[OST:2]
lustre-OST0003_UUID        74436         324       74112   0% /mnt/lustre[OST:3]
lustre-OST0004_UUID        71288         323       70965   0% /mnt/lustre[OST:4]
lustre-OST0005_UUID        70901         321       70580   0% /mnt/lustre[OST:5]
lustre-OST0006_UUID        69146         322       68824   0% /mnt/lustre[OST:6]

filesystem summary:       162442        1363      161079   1% /mnt/lustre

The osd_statfs->osd_objs_count_estimate() information that is being computed for ZFS is using about 21KB/inode for its free inodes estimate, which is conservative but reasonable given how few inodes are actually in use at this point.

It wouldn't be terrible to print another lfs df and lfs df -i after the test, regardless of pass/fail result, to see what the average space usage is on ZFS, and then if it is still reasonable (i.e. going down from 11KB/inode) this bug could be closed.

Andreas Dilger added a comment - 28/Aug/15 6:58 PM The test is skipped (SLOW) on review-zfs, but it is still being run during full test runs, and is consistently passing on ZFS. The average space usage before the test run is about 11KB/inode on the MDT, and it reports plenty of free inodes before the test is passing, but the MDT filesystem is 3GB in size so it would have enough space for 95k inodes even at 32KB/inode that we were seeing before. I think 11KB/inode is reasonable for the number of actual files created in the filesystem at this point, since this includes all of the other filesystem overhead. It would be more useful to know the average per-inode space usage once all 70k files were created to get a better average and/or the differential usage for just those inodes. https://testing.hpdd.intel.com/sub_tests/7b8bcf2a-4c80-11e5-b77f-5254006e85c2 == sanity test 51b: exceed 64k subdirectory nlink limit == 05:28:37 (1440480517) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 3063424 12416 3048960 0% /mnt/lustre[MDT:0] lustre-OST0000_UUID 2031360 5632 2023680 0% /mnt/lustre[OST:0] lustre-OST0001_UUID 2031360 4608 2024704 0% /mnt/lustre[OST:1] lustre-OST0002_UUID 2031104 3968 2025088 0% /mnt/lustre[OST:2] lustre-OST0003_UUID 2031360 4224 2025088 0% /mnt/lustre[OST:3] lustre-OST0004_UUID 2031360 5760 2023552 0% /mnt/lustre[OST:4] lustre-OST0005_UUID 2031104 5888 2023168 0% /mnt/lustre[OST:5] lustre-OST0006_UUID 2031360 6912 2022400 0% /mnt/lustre[OST:6] filesystem summary: 14219008 36992 14167680 0% /mnt/lustre UUID Inodes IUsed IFree IUse% Mounted on lustre-MDT0000_UUID 162442 1363 161079 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 76956 423 76533 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 73555 323 73232 0% /mnt/lustre[OST:1] lustre-OST0002_UUID 74679 320 74359 0% /mnt/lustre[OST:2] lustre-OST0003_UUID 74436 324 74112 0% /mnt/lustre[OST:3] lustre-OST0004_UUID 71288 323 70965 0% /mnt/lustre[OST:4] lustre-OST0005_UUID 70901 321 70580 0% /mnt/lustre[OST:5] lustre-OST0006_UUID 69146 322 68824 0% /mnt/lustre[OST:6] filesystem summary: 162442 1363 161079 1% /mnt/lustre The osd_statfs->osd_objs_count_estimate() information that is being computed for ZFS is using about 21KB/inode for its free inodes estimate, which is conservative but reasonable given how few inodes are actually in use at this point. It wouldn't be terrible to print another lfs df and lfs df -i after the test, regardless of pass/fail result, to see what the average space usage is on ZFS, and then if it is still reasonable (i.e. going down from 11KB/inode) this bug could be closed.

Peter Jones added a comment - 13/Jul/15 5:20 PM

Nathaniel

Isaac thinks that this failure is due to a flaw in the test script but does not have the bandwidth to dig into to it atm. Are you able to investigate?

Thanks

Peter

Peter Jones added a comment - 13/Jul/15 5:20 PM Nathaniel Isaac thinks that this failure is due to a flaw in the test script but does not have the bandwidth to dig into to it atm. Are you able to investigate? Thanks Peter

Andreas Dilger added a comment - 08/Jan/15 9:38 AM

Only a debug patch was landed, and the test was skipped. The core problem is not fixed.

Andreas Dilger added a comment - 08/Jan/15 9:38 AM Only a debug patch was landed, and the test was skipped. The core problem is not fixed.

Jodi Levi (Inactive) added a comment - 20/Nov/14 4:58 PM

Patch landed to Master.

Jodi Levi (Inactive) added a comment - 20/Nov/14 4:58 PM Patch landed to Master.

Gerrit Updater added a comment - 17/Nov/14 5:54 PM

Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/12185/
Subject: ~~LU-5473~~ tests: print space usage in sanity test_51b
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 6e45c6d3ae4c46a0312bbb95b7e9ff09761f037d

Gerrit Updater added a comment - 17/Nov/14 5:54 PM Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/12185/ Subject: LU-5473 tests: print space usage in sanity test_51b Project: fs/lustre-release Branch: master Current Patch Set: Commit: 6e45c6d3ae4c46a0312bbb95b7e9ff09761f037d

Andreas Dilger added a comment - 12/Nov/14 10:39 PM

Is there any way that we can figure out why the MDS is consuming 32KB per inode? I now recall a patch landing:

commit 47c0b97421b21dab686b05d6bf829ebcaf62d5db
Author: Isaac Huang <he.huang@intel.com>
Date:   Tue Jul 22 16:42:03 2014 -0600

    LU-5391 osd-zfs: ZAP object block sizes too small
    
    Currently osd-zfs ZAP objects use 4K for both leaf
    and indirect blocks. This patch increases:
    - leaf block to 16K, which equals ZFS fzap_default_block_shift
    - indirect block to 16K, the default used by ZPL directories
    
    Signed-off-by: Isaac Huang <he.huang@intel.com>
    Change-Id: I5b476414d27822a14afb25e1307991fbd2e3a59e
    Reviewed-on: http://review.whamcloud.com/11182

which might account for some of this. If the ZAP leaf and indirect blocks are being updated randomly due to many file creations, it may be that the indirect blocks are taking up a lot of space in the metadnode and saved snapshots?

Andreas Dilger added a comment - 12/Nov/14 10:39 PM Is there any way that we can figure out why the MDS is consuming 32KB per inode? I now recall a patch landing: commit 47c0b97421b21dab686b05d6bf829ebcaf62d5db Author: Isaac Huang <he.huang@intel.com> Date: Tue Jul 22 16:42:03 2014 -0600 LU-5391 osd-zfs: ZAP object block sizes too small Currently osd-zfs ZAP objects use 4K for both leaf and indirect blocks. This patch increases: - leaf block to 16K, which equals ZFS fzap_default_block_shift - indirect block to 16K, the default used by ZPL directories Signed-off-by: Isaac Huang <he.huang@intel.com> Change-Id: I5b476414d27822a14afb25e1307991fbd2e3a59e Reviewed-on: http://review.whamcloud.com/11182 which might account for some of this. If the ZAP leaf and indirect blocks are being updated randomly due to many file creations, it may be that the indirect blocks are taking up a lot of space in the metadnode and saved snapshots?

Peter Jones added a comment - 04/Nov/14 1:34 PM

Isaac

What do you suggest here?

Peter

Peter Jones added a comment - 04/Nov/14 1:34 PM Isaac What do you suggest here? Peter

Andreas Dilger added a comment - 27/Oct/14 12:17 AM - edited

The failing test run shows about 32KB of space used per inode on the MDT both before and after the test is run. This is more than I would have expected, which is about 4KB per ZFS inode. It is expected that the free space and free inodes would run out at the same time on a ZFS filesystem, since the ZFS inodes are not preallocated as they are on ldiskfs. The total number of inodes and the number of free inodes is an estimate that is based on the space used and number of inodes used (average bytes per inode) and the number of free blocks.

It would be interesting to see what the average space used per inode is for other ZFS filesystems.

Andreas Dilger added a comment - 27/Oct/14 12:17 AM - edited The failing test run shows about 32KB of space used per inode on the MDT both before and after the test is run. This is more than I would have expected, which is about 4KB per ZFS inode. It is expected that the free space and free inodes would run out at the same time on a ZFS filesystem, since the ZFS inodes are not preallocated as they are on ldiskfs. The total number of inodes and the number of free inodes is an estimate that is based on the space used and number of inodes used (average bytes per inode) and the number of free blocks. It would be interesting to see what the average space used per inode is for other ZFS filesystems.

People

Assignee:: Nathaniel Clark

Reporter:: Maloo

Votes:: 0 Vote for this issue

Watchers:: 13 Start watching this issue

Dates

Created:: 11/Aug/14 8:07 PM

Updated:: 21/Sep/16 5:26 AM

Resolved:: 21/Sep/16 5:26 AM