[LU-4265] replay-ost-single test_6: space grew after dd (or didn't change) Created: 18/Nov/13 Updated: 29/Apr/19 Resolved: 29/Apr/19 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.6.0, Lustre 2.5.3, Lustre 2.9.0, Lustre 2.12.0, Lustre 2.13.0, Lustre 2.12.1 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Maloo | Assignee: | WC Triage |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | zfs | ||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 11718 | ||||||||
| Description |
|
This issue was created by maloo for Nathaniel Clark <nathaniel.l.clark@intel.com> This issue relates to the following test suite run: The sub-test test_6 failed with the following error:
Info required for matching: replay-ost-single 6 |
| Comments |
| Comment by Jian Yu [ 05/Sep/14 ] |
|
While verifying patches http://review.whamcloud.com/11541, http://review.whamcloud.com/11411, http://review.whamcloud.com/9318 on Lustre b2_5 branch with FSTYPE=zfs, the same failure occurred: |
| Comment by Jian Yu [ 25/Sep/14 ] |
|
One more instance on Lustre b2_5 branch: https://testing.hpdd.intel.com/test_sets/cd0bb8ee-44dc-11e4-bb5a-5254006e85c2 |
| Comment by nasf (Inactive) [ 27/Sep/14 ] |
|
Another failure instance: |
| Comment by Isaac Huang (Inactive) [ 29/Oct/14 ] |
|
Another one: It appeared to be related to rm -f $f
sync && sleep 5 && sync # wait for delete thread
# wait till space is returned, following
# (( $before > $after_dd)) test counting on that
wait_mds_ost_sync || return 4
wait_destroy_complete || return 5
local before=$(kbytesfree)
I'd doubt the wait can work reliably with ZFS - sometimes frees can be delayed for a few transaction groups' time. It seemed inherently unreliable to free and wait and get free space for ZFS. |
| Comment by Jian Yu [ 08/Dec/14 ] |
|
More failure instance on Lustre b2_5 branch: |
| Comment by Johann Lombardi (Inactive) [ 13/Jan/15 ] |
|
Another instance on master: |
| Comment by Bob Glossman (Inactive) [ 19/Jun/15 ] |
|
another on master: |
| Comment by James Nunez (Inactive) [ 15/Jul/15 ] |
|
Another two failures on master in review-zfs-part-2: |
| Comment by Jian Yu [ 30/Jul/15 ] |
|
More failure instance on master branch: |
| Comment by James Nunez (Inactive) [ 27/Dec/15 ] |
|
Another failure on master: |
| Comment by James Nunez (Inactive) [ 24/Apr/19 ] |
|
It looks like this test is failing again/still. I’ve gone back to January 2018 and looks at all the times replay-ost-single test 6 failed with the ''space grew after dd:” error and we’ve seen six failures all for ZFS testing: Some of the failures look questionable or, possibly, the error message should be modified. For example, when “before” and “after” are the same value: |
| Comment by Andreas Dilger [ 29/Apr/19 ] |
|
This stopped being hit in 2015. |