[LU-1841] Test failure on test suite replay-single, subtest test_61c Created: 05/Sep/12  Updated: 29/May/17  Resolved: 29/May/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 10376

 Description   

This issue was created by maloo for Oleg Drokin <green@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/b80d0642-f703-11e1-b320-52540035b04c.

The sub-test test_61c failed with the following error:

Restart of ost1 failed!

Info required for matching: replay-single 61c

Seems to be some race in the test script, fail_timeout code woke up one second after userspace attempted to unmount/remount (and failed with 'already mounted'). here's part of OST log:

14:51:15:Lustre: DEBUG MARKER: == replay-single test 61c: test race mds llog sync vs llog cleanup == 14:51:14 (1346795474)
14:51:15:Lustre: DEBUG MARKER: lctl set_param fail_loc=0x80000222
14:51:26:LustreError: 25495:0:(fail.c:133:__cfs_fail_timeout_set()) cfs_fail_timeout id 222 sleeping for 30000ms
14:51:26:Lustre: DEBUG MARKER: grep -c /mnt/ost1' ' /proc/mounts
14:51:38:Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
14:51:49:Lustre: DEBUG MARKER: hostname
14:51:49:Lustre: DEBUG MARKER: test -b /dev/lvm-OSS/P1
14:51:49:Lustre: DEBUG MARKER: mkdir -p /mnt/ost1; mount -t lustre   		                   /dev/lvm-OSS/P1 /mnt/ost1
14:51:49:Lustre: DEBUG MARKER: /usr/sbin/lctl mark  replay-single test_61c: @@@@@@ FAIL: Restart of ost1 failed! 
14:51:49:Lustre: DEBUG MARKER: replay-single test_61c: @@@@@@ FAIL: Restart of ost1 failed!
14:51:49:Lustre: DEBUG MARKER: /usr/sbin/lctl dk > /logdir/test_logs/2012-09-04/lustre-reviews-el6-x86_64__8865__-7f9a93b0cf40/replay-single.test_61c.debug_log.$(hostname -s).1346795506.log;
14:51:49:         dmesg > /logdir/test_logs/2012-09-04/lustre-reviews-el6-x86_64__8865__-7f9a93b0cf40/re
14:51:50:LustreError: 25495:0:(fail.c:137:__cfs_fail_timeout_set()) cfs_fail_timeout id 222 awake


 Comments   
Comment by Andreas Dilger [ 29/May/17 ]

Close old ticket.

Generated at Sat Feb 10 01:20:09 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.