Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-302

ll_ost_io_* threads hung

    XMLWordPrintable

Details

    • 3
    • 10122

    Description

      While running runtests test, the ll_ost_io_* threads hung as follows:

      Lustre: DEBUG MARKER: copying files from /etc /bin to /mnt/lustre/runtest.5368/etc /bin at Tue May 10 02:14:07 PDT 2011
      Lustre: Service thread pid 6575 was inactive for 40.00s. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes:
      Pid: 6575, comm: ll_ost_io_03
      
      Call Trace:
       [<ffffffff8006466c>] __down_read+0x7a/0x92
       [<ffffffff88bb2b0f>] ldiskfs_ext_walk_space+0xdf/0x2d0 [ldiskfs]
       [<ffffffff88c0bf10>] ldiskfs_ext_new_extent_cb+0x0/0x650 [fsfilt_ldiskfs]
       [<ffffffff8006456b>] __down_write_nested+0x12/0x92
       [<ffffffff88c0846d>] fsfilt_map_nblocks+0xfd/0x150 [fsfilt_ldiskfs]
       [<ffffffff88c69a7d>] filter_direct_io+0x46d/0xd50 [obdfilter]
       [<ffffffff88c08be7>] fsfilt_ldiskfs_setattr+0x1a7/0x250 [fsfilt_ldiskfs]
       [<ffffffff88c6c840>] filter_commitrw_write+0x1800/0x2be0 [obdfilter]
       [<ffffffff8005c33c>] cache_alloc_refill+0x106/0x186
       [<ffffffff88c24eed>] ost_checksum_bulk+0x37d/0x5a0 [ost]
       [<ffffffff88c2bd09>] ost_brw_write+0x1c99/0x2480 [ost]
       [<ffffffff8001aa2d>] vsnprintf+0x5df/0x627
       [<ffffffff88945f25>] lustre_msg_get_opc+0x35/0xf0 [ptlrpc]
       [<ffffffff889460d8>] lustre_msg_check_version_v2+0x8/0x20 [ptlrpc]
       [<ffffffff88c2f09e>] ost_handle+0x2bae/0x55b0 [ost]
       [<ffffffff8890019a>] lock_res_and_lock+0xba/0xd0 [ptlrpc]
       [<ffffffff887b4a87>] libcfs_next_nidstring+0x37/0x50 [libcfs]
       [<ffffffff889556f9>] ptlrpc_server_handle_request+0x989/0xe00 [ptlrpc]
       [<ffffffff88955e55>] ptlrpc_wait_event+0x2e5/0x310 [ptlrpc]
       [<ffffffff8008c86f>] __wake_up_common+0x3e/0x68
       [<ffffffff88956de6>] ptlrpc_main+0xf66/0x1120 [ptlrpc]
       [<ffffffff8005dfb1>] child_rip+0xa/0x11
       [<ffffffff88955e80>] ptlrpc_main+0x0/0x1120 [ptlrpc]
       [<ffffffff8005dfa7>] child_rip+0x0/0x11
      
      LustreError: dumping log to /tmp/lustre-log.1305018888.6575
      

      Please refer to the following Maloo report for more logs:
      https://maloo.whamcloud.com/test_sets/ecc7177c-7aec-11e0-b5bf-52540025f9af

      The issue is blocking the testing on b1_8 branch.

      Attachments

        Activity

          People

            ys Yang Sheng
            yujian Jian Yu
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: