Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1841

Test failure on test suite replay-single, subtest test_61c

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Minor
    • None
    • None
    • None
    • 3
    • 10376

    Description

      This issue was created by maloo for Oleg Drokin <green@whamcloud.com>

      This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/b80d0642-f703-11e1-b320-52540035b04c.

      The sub-test test_61c failed with the following error:

      Restart of ost1 failed!

      Info required for matching: replay-single 61c

      Seems to be some race in the test script, fail_timeout code woke up one second after userspace attempted to unmount/remount (and failed with 'already mounted'). here's part of OST log:

      14:51:15:Lustre: DEBUG MARKER: == replay-single test 61c: test race mds llog sync vs llog cleanup == 14:51:14 (1346795474)
      14:51:15:Lustre: DEBUG MARKER: lctl set_param fail_loc=0x80000222
      14:51:26:LustreError: 25495:0:(fail.c:133:__cfs_fail_timeout_set()) cfs_fail_timeout id 222 sleeping for 30000ms
      14:51:26:Lustre: DEBUG MARKER: grep -c /mnt/ost1' ' /proc/mounts
      14:51:38:Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
      14:51:49:Lustre: DEBUG MARKER: hostname
      14:51:49:Lustre: DEBUG MARKER: test -b /dev/lvm-OSS/P1
      14:51:49:Lustre: DEBUG MARKER: mkdir -p /mnt/ost1; mount -t lustre   		                   /dev/lvm-OSS/P1 /mnt/ost1
      14:51:49:Lustre: DEBUG MARKER: /usr/sbin/lctl mark  replay-single test_61c: @@@@@@ FAIL: Restart of ost1 failed! 
      14:51:49:Lustre: DEBUG MARKER: replay-single test_61c: @@@@@@ FAIL: Restart of ost1 failed!
      14:51:49:Lustre: DEBUG MARKER: /usr/sbin/lctl dk > /logdir/test_logs/2012-09-04/lustre-reviews-el6-x86_64__8865__-7f9a93b0cf40/replay-single.test_61c.debug_log.$(hostname -s).1346795506.log;
      14:51:49:         dmesg > /logdir/test_logs/2012-09-04/lustre-reviews-el6-x86_64__8865__-7f9a93b0cf40/re
      14:51:50:LustreError: 25495:0:(fail.c:137:__cfs_fail_timeout_set()) cfs_fail_timeout id 222 awake
      

      Attachments

        Activity

          People

            wc-triage WC Triage
            maloo Maloo
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: