Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-2297

Test failure on test suite replay-single, subtest test_74: client umount hang

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.4.0
    • Lustre 2.4.0
    • 3
    • 5490

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/c2d37650-2819-11e2-aa14-52540035b04c.

      The sub-test test_74 failed with the following error:

      test failed to respond and timed out

      client umount hang

      12:25:18:Lustre: DEBUG MARKER: == replay-single test 74: Ensure applications don't fail waiting for OST recovery == 12:25:18 (1352147118)
      12:25:30:Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre' ' /proc/mounts);
      12:25:30:if [ $running -ne 0 ] ; then
      12:25:30:echo Stopping client $(hostname) /mnt/lustre opts:;
      12:25:30:lsof /mnt/lustre || need_kill=no;
      12:25:30:if [ x != x -a x$need_kill != xno ]; then
      12:25:30:    pids=$(lsof -t /mnt/lustre | sort -u);
      12:25:30:    if 
      12:29:22:INFO: task umount:29383 blocked for more than 120 seconds.
      12:29:22:"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      12:29:22:umount        D 0000000000000000     0 29383  29376 0x00000080
      12:29:22: ffff880079f97b68 0000000000000082 ffff8800ffffffff 00001d5805116b96
      12:29:22: ffff880079f97ad8 ffff880037f6ae50 00000000004059ee ffffffffaf68090e
      12:29:22: ffff88003a921058 ffff880079f97fd8 000000000000fb88 ffff88003a921058
      12:29:22:Call Trace:
      12:29:22: [<ffffffff8109cd49>] ? ktime_get_ts+0xa9/0xe0
      12:29:22: [<ffffffff811141f0>] ? sync_page+0x0/0x50
      12:29:22: [<ffffffff814fe0f3>] io_schedule+0x73/0xc0
      12:29:22: [<ffffffff8111422d>] sync_page+0x3d/0x50
      12:29:22: [<ffffffff814feaaf>] __wait_on_bit+0x5f/0x90
      12:29:22: [<ffffffff81114463>] wait_on_page_bit+0x73/0x80
      12:29:22: [<ffffffff81092110>] ? wake_bit_function+0x0/0x50
      12:29:22: [<ffffffff8112a965>] ? pagevec_lookup_tag+0x25/0x40
      12:29:22: [<ffffffff811148db>] wait_on_page_writeback_range+0xfb/0x190
      12:29:22: [<ffffffff8111499f>] filemap_fdatawait+0x2f/0x40
      12:29:22: [<ffffffff811a4874>] sync_inodes_sb+0x114/0x190
      12:29:22: [<ffffffff811aa312>] __sync_filesystem+0x82/0x90
      12:29:22: [<ffffffff811aa51b>] sync_filesystem+0x4b/0x70
      12:29:22: [<ffffffff8117d317>] generic_shutdown_super+0x27/0xe0
      12:29:22: [<ffffffff8117d436>] kill_anon_super+0x16/0x60
      12:29:22: [<ffffffffa052a94a>] lustre_kill_super+0x4a/0x60 [obdclass]
      12:29:22: [<ffffffff8117e4b0>] deactivate_super+0x70/0x90
      12:29:22: [<ffffffff8119a4ef>] mntput_no_expire+0xbf/0x110
      12:29:22: [<ffffffff8119af8b>] sys_umount+0x7b/0x3a0
      12:29:22: [<ffffffff810d6b12>] ? audit_syscall_entry+0x272/0x2a0
      12:29:23: [<ffffffff8100b0f2>] system_call_fastpath+0x16/0x1b
      12:31:14:INFO: task umount:29383 blocked for more than 120 seconds.
      

      Attachments

        Activity

          People

            bobijam Zhenyu Xu
            maloo Maloo
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: