[LU-9522] sanity test 27z is failing with 'test_27z failed with 6' Created: 17/May/17  Updated: 19/May/17  Resolved: 19/May/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.10.0
Fix Version/s: Lustre 2.10.0

Type: Bug Priority: Critical
Reporter: James Nunez (Inactive) Assignee: James Nunez (Inactive)
Resolution: Fixed Votes: 0
Labels: None
Environment:

autotest


Issue Links:
Related
is related to LU-8998 Progressive File Layout (PFL) Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

sanity test_27z is failing with the error

'test_27z failed with 6' 

From the test_log, we see

Started lustre-OST0001
/mnt/lustre-ost2/O/0/d3/1667: parent=[0x200000bd0:0x40cc:0x0] stripe=0 stripe_size=1048576 stripe_count=2
want: stripe:1 ost:0 oid:858/0x35a seq:0
/mnt/lustre-ost2/O/0/d3/1667: parent=[0x200000bd0:0x40cc:0x0] stripe=0 stripe_size=1048576 stripe_count=2
 sanity test_27z: @@@@@@ FAIL: FF stripe 0 != 1 

This test is only failing for review-zfs-part-1.

sanity test 27z started failing with this error on May 16, 2017. The first few failure logs are at
https://testing.hpdd.intel.com/test_sets/4f3d47bc-3a42-11e7-b81e-5254006e85c2
https://testing.hpdd.intel.com/test_sets/6d1be1a2-3a43-11e7-b861-5254006e85c2
https://testing.hpdd.intel.com/test_sets/2a85c6c6-3a5e-11e7-b3f6-5254006e85c2



 Comments   
Comment by James Nunez (Inactive) [ 17/May/17 ]

I think I know what the issue is and it looks like a test only issue. The ff variable in the test is not reset for each iteration and, thus, the ff variable is not updated:

1864         $GETSTRIPE $1 | while read obdidx oid hex seq; do
...
1883 
1884                 local ff
1885                 #
1886                 # Don't unmount/remount the OSTs if we don't need to do that.
1887                 # LU-2577 changes filter_fid to be smaller, so debugfs needs
1888                 # update too, until that use mount/ll_decode_filter_fid/mount.
1889                 # Re-enable when debugfs will understand new filter_fid.
1890                 #
1891                 if [ $(facet_fstype ost$ost) == ldiskfs ]; then
1892                         ff=$(do_facet ost$ost "$DEBUGFS -c -R 'stat $obj_file' \
1893                                 $dev 2>/dev/null" | grep "parent=")
1894                 fi
1895                 if [ -z "$ff" ]; then
1896                         stop ost$ost
1897                         mount_fstype ost$ost
1898                         ff=$(do_facet ost$ost $LL_DECODE_FILTER_FID \
1899                                 $(facet_mntpt ost$ost)/$obj_file)
1900                         unmount_fstype ost$ost
1901                         start ost$ost $dev $OST_MOUNT_OPTS
1902                         clients_up
1903                 fi
1904 
...

I will upload a patch to make sure this is the issue.

Comment by Gerrit Updater [ 17/May/17 ]

James Nunez (james.a.nunez@intel.com) uploaded a new patch: https://review.whamcloud.com/27163
Subject: LU-9522 test: sanity 27z fix update local variable
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 558cdb7b540e1eafed4ec637ca1b6a756bc3a0b6

Comment by James Nunez (Inactive) [ 17/May/17 ]

The test is failing due to https://review.whamcloud.com/#/c/26569/
Change-Id: Ia971e33cc3d8a5e4ca6f821116f12c0a72bcab07

Comment by Bob Glossman (Inactive) [ 18/May/17 ]

more on master:
https://testing.hpdd.intel.com/test_sets/b573cb72-3b56-11e7-b3f6-5254006e85c2
https://testing.hpdd.intel.com/test_sets/f864ac5e-3c18-11e7-b3f6-5254006e85c2

Comment by Gerrit Updater [ 19/May/17 ]

Andreas Dilger (andreas.dilger@intel.com) merged in patch https://review.whamcloud.com/27163/
Subject: LU-9522 test: sanity 27z fix update local variable
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: e6e0d59bb3af2cad16495059aa23675e6a3cffb0

Comment by Peter Jones [ 19/May/17 ]

Landed for 2.10

Generated at Sat Feb 10 02:26:55 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.