Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7022

recovery-small test_100: hung on umount

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Bob Glossman <bob.glossman@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/cbf89e7c-4675-11e5-bedf-5254006e85c2.

      The sub-test test_100 failed with the following error:

      test failed to respond and timed out
      

      syslog from ost shows:

      obd refcount = 4. Is it stuck?
      Aug 19 01:10:10 onyx-30vm4 kernel: Lustre: lustre-OST0000: Not available for connect from 10.2.4.97@tcp (stopping)
      Aug 19 01:10:10 onyx-30vm4 kernel: Lustre: Skipped 77 previous similar messages
      Aug 19 01:13:53 onyx-30vm4 kernel: INFO: task umount:9683 blocked for more than 120 seconds.
      Aug 19 01:13:53 onyx-30vm4 kernel:      Tainted: P           ---------------    2.6.32-504.30.3.el6_lustre.g107be2b.x86_64 #1
      Aug 19 01:13:53 onyx-30vm4 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      Aug 19 01:13:53 onyx-30vm4 kernel: umount        D 0000000000000001     0  9683   9682 0x00000080
      Aug 19 01:13:53 onyx-30vm4 kernel: ffff880043057a78 0000000000000082 0000000000000000 ffff880043057a18
      Aug 19 01:13:53 onyx-30vm4 kernel: ffff8800430579d8 ffffffffa21c8983 0000100ed9610762 0000000000000000
      Aug 19 01:13:53 onyx-30vm4 kernel: ffff8800657dd044 000000010108d4b9 ffff88006308e5f8 ffff880043057fd8
      Aug 19 01:13:53 onyx-30vm4 kernel: Call Trace:
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff8152b222>] schedule_timeout+0x192/0x2e0
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff81087540>] ? process_timeout+0x0/0x10
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa2157c66>] obd_exports_barrier+0xb6/0x190 [obdclass]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa0a4556f>] ofd_device_fini+0x5f/0x250 [ofd]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa21747b2>] class_cleanup+0x572/0xd30 [obdclass]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa2154726>] ? class_name2dev+0x56/0xe0 [obdclass]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa2176e06>] class_process_config+0x1e96/0x2800 [obdclass]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa1e11c01>] ? libcfs_debug_msg+0x41/0x50 [libcfs]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa2177c2f>] class_manual_cleanup+0x4bf/0x8e0 [obdclass]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa2154726>] ? class_name2dev+0x56/0xe0 [obdclass]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa21b10b2>] server_put_super+0x9e2/0xeb0 [obdclass]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff811ac776>] ? invalidate_inodes+0xf6/0x190
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff81190b7b>] generic_shutdown_super+0x5b/0xe0
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff81190c66>] kill_anon_super+0x16/0x60
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffffa217aae6>] lustre_kill_super+0x36/0x60 [obdclass]
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff81191407>] deactivate_super+0x57/0x80
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff811b10df>] mntput_no_expire+0xbf/0x110
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff811b1c2b>] sys_umount+0x7b/0x3a0
      Aug 19 01:13:53 onyx-30vm4 kernel: [<ffffffff8100b0d2>] system_call_fastpath+0x16/0x1b
      Aug 19 01:14:25 onyx-30vm4 kernel: Lustre: lustre-OST0000 is waiting for obd_unlinked_exports more than 256 seconds. The obd refcount = 4. Is it stuck?
      

      Info required for matching: recovery-small 100

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: