Details
-
Bug
-
Resolution: Fixed
-
Minor
-
Lustre 2.4.0
-
None
-
3
-
7315
Description
This issue was created by maloo for Li Wei <liwei@whamcloud.com>
This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/a27c80dc-9050-11e2-8311-52540035b04c.
The sub-test test_90 failed with the following error:
lfs getsripe does not report the affected lustre-OST000d_UUID for fd
Info required for matching: replay-single 90
== replay-single test 90: lfs find identifies the missing striped file segments == 18:07:06 (1363655226) Create the files CMD: c01 lctl get_param -n obdfilter.lustre-OST000d.uuid Fail ost14 lustre-OST000d_UUID, display the list of affected files CMD: c01 grep -c /mnt/ost14' ' /proc/mounts Stopping /mnt/ost14 (opts:) on c01 CMD: c01 umount -d /mnt/ost14 CMD: c01 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST ' General Query: lfs find /mnt/lustre/d0.replay-single/d90 /mnt/lustre/d0.replay-single/d90 /mnt/lustre/d0.replay-single/d90/f7 /mnt/lustre/d0.replay-single/d90/f13 /mnt/lustre/d0.replay-single/d90/f8 /mnt/lustre/d0.replay-single/d90/all /mnt/lustre/d0.replay-single/d90/f9 /mnt/lustre/d0.replay-single/d90/f11 /mnt/lustre/d0.replay-single/d90/f15 /mnt/lustre/d0.replay-single/d90/f12 /mnt/lustre/d0.replay-single/d90/f6 /mnt/lustre/d0.replay-single/d90/f2 /mnt/lustre/d0.replay-single/d90/f4 /mnt/lustre/d0.replay-single/d90/f14 /mnt/lustre/d0.replay-single/d90/f3 /mnt/lustre/d0.replay-single/d90/f10 /mnt/lustre/d0.replay-single/d90/f5 /mnt/lustre/d0.replay-single/d90/f0 /mnt/lustre/d0.replay-single/d90/f1 Querying files on shutdown ost14: lfs find --obd lustre-OST000d_UUID /mnt/lustre/d0.replay-single/d90/f13 /mnt/lustre/d0.replay-single/d90/all replay-single test_90: @@@@@@ FAIL: lfs find does not report the affected lustre-OST000d_UUID for fd Trace dump: = /usr/lib64/lustre/tests/test-framework.sh:3977:error_noexit() = /usr/lib64/lustre/tests/replay-single.sh:2776:test_90() = /usr/lib64/lustre/tests/test-framework.sh:4255:run_one() = /usr/lib64/lustre/tests/test-framework.sh:4288:run_one_logged() = /usr/lib64/lustre/tests/test-framework.sh:4143:run_test() = /usr/lib64/lustre/tests/replay-single.sh:2792:main() Dumping lctl log to /logdir/test_logs/2013-03-18/lustre-reviews-el6-x86_64--review--1_1_1__14094__-70104848885040-152208/replay-single.test_90.*.1363655230.log CMD: c01,c02,c03,c04,c05,c06,c08,c09 /usr/sbin/lctl dk > /logdir/test_logs/2013-03-18/lustre-reviews-el6-x86_64--review--1_1_1__14094__-70104848885040-152208/replay-single.test_90.debug_log.\$(hostname -s).1363655230.log; dmesg > /logdir/test_logs/2013-03-18/lustre-reviews-el6-x86_64--review--1_1_1__14094__-70104848885040-152208/replay-single.test_90.dmesg.\$(hostname -s).1363655230.log Check getstripe: /usr/bin/lfs getstripe -r --obd lustre-OST000d_UUID /mnt/lustre/d0.replay-single/d90/f13 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_layout_gen: 0 lmm_stripe_offset: 13 obdidx objid objid group 13 3138 0xc42 0 * /mnt/lustre/d0.replay-single/d90/all lmm_stripe_count: 16 lmm_stripe_size: 1048576 lmm_layout_gen: 0 lmm_stripe_offset: 15 obdidx objid objid group 13 3137 0xc41 0 * /mnt/lustre/d0.replay-single/d90/all replay-single test_90: @@@@@@ FAIL: lfs getsripe does not report the affected lustre-OST000d_UUID for fd Trace dump: = /usr/lib64/lustre/tests/test-framework.sh:3977:error_noexit() = /usr/lib64/lustre/tests/replay-single.sh:2787:test_90() = /usr/lib64/lustre/tests/test-framework.sh:4255:run_one() = /usr/lib64/lustre/tests/test-framework.sh:4288:run_one_logged() = /usr/lib64/lustre/tests/test-framework.sh:4143:run_test() = /usr/lib64/lustre/tests/replay-single.sh:2792:main() Dumping lctl log to /logdir/test_logs/2013-03-18/lustre-reviews-el6-x86_64--review--1_1_1__14094__-70104848885040-152208/replay-single.test_90.*.1363655235.log CMD: c01,c02,c03,c04,c05,c06,c08,c09 /usr/sbin/lctl dk > /logdir/test_logs/2013-03-18/lustre-reviews-el6-x86_64--review--1_1_1__14094__-70104848885040-152208/replay-single.test_90.debug_log.\$(hostname -s).1363655235.log; dmesg > /logdir/test_logs/2013-03-18/lustre-reviews-el6-x86_64--review--1_1_1__14094__-70104848885040-152208/replay-single.test_90.dmesg.\$(hostname -s).1363655235.log Failover ost14 to c01 18:07:26 (1363655246) waiting for c01 network 900 secs ... 18:07:26 (1363655246) network interface is UP CMD: c01 hostname CMD: c01 test -b /dev/lvm-OSS/P6 Starting ost14: /dev/lvm-OSS/P6 /mnt/ost14 CMD: c01 mkdir -p /mnt/ost14; mount -t lustre /dev/lvm-OSS/P6 /mnt/ost14 CMD: c01 PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/utils/gss:/usr/lib64/lustre/utils:/usr/lib64/openmpi/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin: NAME=autotest_config sh rpc.sh set_default_debug \"0x33f0404\" \" 0xffb7e3ff\" 32 CMD: c01 e2label /dev/lvm-OSS/P6 2>/dev/null Started lustre-OST000d Resetting fail_loc on all nodes...CMD: c01,c02,c03,c04,c05,c06,c08,c09 lctl set_param -n fail_loc=0 2>/dev/null || true done. CMD: c01,c02,c03,c04,c05,c06,c08 rc=\$([ -f /proc/sys/lnet/catastrophe ] && echo \$(< /proc/sys/lnet/catastrophe) || echo 0); if [ \$rc -ne 0 ]; then echo \$(hostname): \$rc; fi exit \$rc
The patch has landed to master.