[LU-16507] sanity test_413a: division by 0 (error token is "min") Created: 26/Jan/23  Updated: 30/Jan/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.15.3
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Related
is related to LU-14824 sanity test_413a: timeout Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for jianyu <yujian@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/aff94fc3-25d0-4fe0-829f-14d1db24c79a

test_413a failed with the following error:

== sanity test 413a: QoS mkdir with 'lfs mkdir -i -1' ==== 08:28:10 (1674635290)
lmv.lustre-clilmv-ffff99cd41c88800.qos_maxage=1
CMD: trevis-128vm4 /usr/sbin/lctl get_param -n lod.lustre-MDT0000-mdtlov.qos_maxage
CMD: trevis-128vm4,trevis-128vm5 /usr/sbin/lctl set_param lod.*.mdt_qos_maxage=1
lod.lustre-MDT0000-mdtlov.mdt_qos_maxage=1
lod.lustre-MDT0002-mdtlov.mdt_qos_maxage=1
lod.lustre-MDT0001-mdtlov.mdt_qos_maxage=1
lod.lustre-MDT0003-mdtlov.mdt_qos_maxage=1
CMD: trevis-128vm4,trevis-128vm5 /usr/sbin/lctl set_param lod.*.mdt_qos_maxage=5

Check for uneven MDTs: 
weight diff=1% must be > 100% ...Fill MDT0 with 1000 files: loop 0
weight diff=9% must be > 100% ...Fill MDT0 with 1000 files: loop 1
weight diff=21% must be > 100% ...Fill MDT0 with 1000 files: loop 2
weight diff=33% must be > 100% ...Fill MDT0 with 1000 files: loop 3
weight diff=86% must be > 100% ...Fill MDT0 with 1000 files: loop 4
weight diff=39% must be > 100% ...Fill MDT0 with 1000 files: loop 5
weight diff=39% must be > 100% ...Fill MDT0 with 1000 files: loop 6
weight diff=39% must be > 100% ...Fill MDT0 with 1000 files: loop 7
weight diff=98% must be > 100% ...Fill MDT0 with 1000 files: loop 8
weight diff=99% must be > 100% ...Fill MDT0 with 1000 files: loop 9
weight diff=99% must be > 100% ...Fill MDT0 with 1000 files: loop 10
/usr/lib64/lustre/tests/sanity.sh: line 25478: (max - min) * 100 / min: division by 0 (error token is "min")

Test session details:
clients: https://build.whamcloud.com/job/lustre-reviews/91863 - 4.18.0-348.7.1.el8_5.x86_64
servers: https://build.whamcloud.com/job/lustre-reviews/91863 - 4.18.0-348.23.1.el8_lustre.x86_64

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity test_413a - test_413a returned 1



 Comments   
Comment by Andreas Dilger [ 27/Jan/23 ]

This issue was previously fixed by patch https://review.whamcloud.com/45955 "LU-14824 test: sanity 413a/b unlink timeout" that had to be reverted due to it causing test timeouts from looping forever while filling the MDT:

weight diff=101% must be > 120% ...Fill MDT0 with 200 files: loop 0
weight diff=101% must be > 120% ...Fill MDT0 with 200 files: loop 0
weight diff=101% must be > 120% ...Fill MDT0 with 200 files: loop 0
Comment by Andreas Dilger [ 30/Jan/23 ]

I've pushed https://review.whamcloud.com/49799 "LU-14824 test: sanity 413a/b unlink timeout v2" that fixes the issues in the original patch.

Generated at Sat Feb 10 03:27:36 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.