Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1766

dd never finishes sometimes

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Blocker
    • None
    • Lustre 2.3.0
    • None
    • 3
    • 6348

    Description

      I was writing a file with dd and I saw console message on the OST:

      LNet: Service thread pid 3461 was inactive for 40.00s. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes:
      Pid: 3461, comm: ll_ost_io01_001
      
      Call Trace:
       [<ffffffffa0079d85>] jbd2_log_wait_commit+0xc5/0x140 [jbd2]
       [<ffffffff810920d0>] ? autoremove_wake_function+0x0/0x40
       [<ffffffffa0ab248d>] fsfilt_ldiskfs_commit_wait+0x6d/0xf0 [fsfilt_ldiskfs]
       [<ffffffffa0bfa23d>] filter_preprw_write+0xc4d/0x22f0 [obdfilter]
       [<ffffffffa0e0346b>] ? cfs_percpt_lock+0x5b/0x130 [libcfs]
       [<ffffffffa0e769cb>] ? lolnd_send+0x2b/0xb0 [lnet]
       [<ffffffffa0e03374>] ? cfs_percpt_unlock+0x24/0xc0 [libcfs]
       [<ffffffffa0600aab>] ? null_alloc_rs+0x1ab/0x3b0 [ptlrpc]
       [<ffffffffa05edc44>] ? sptlrpc_svc_alloc_rs+0x74/0x2d0 [ptlrpc]
       [<ffffffffa0bfc6e0>] filter_preprw+0x80/0xa0 [obdfilter]
       [<ffffffffa04ea81c>] obd_preprw+0x12c/0x3d0 [ost]
       [<ffffffffa04f198a>] ost_brw_write+0x87a/0x1600 [ost]
       [<ffffffff8127cea6>] ? vsnprintf+0x2b6/0x5f0
       [<ffffffffa05bf07c>] ? lustre_msg_get_version+0x8c/0x100 [ptlrpc]
       [<ffffffffa05bf1d8>] ? lustre_msg_check_version+0xe8/0x100 [ptlrpc]
       [<ffffffffa04f802c>] ost_handle+0x360c/0x4850 [ost]
       [<ffffffffa0df5541>] ? libcfs_debug_msg+0x41/0x50 [libcfs]
       [<ffffffffa0df1344>] ? libcfs_id2str+0x74/0xb0 [libcfs]
       [<ffffffffa05ce87d>] ptlrpc_server_handle_request+0x40d/0xea0 [ptlrpc]
       [<ffffffffa0de565e>] ? cfs_timer_arm+0xe/0x10 [libcfs]
       [<ffffffffa05c5d07>] ? ptlrpc_wait_event+0xa7/0x2a0 [ptlrpc]
       [<ffffffff810533f3>] ? __wake_up+0x53/0x70
       [<ffffffffa05cfe69>] ptlrpc_main+0xb59/0x1860 [ptlrpc]
       [<ffffffffa05cf310>] ? ptlrpc_main+0x0/0x1860 [ptlrpc]
       [<ffffffff8100c14a>] child_rip+0xa/0x20
       [<ffffffffa05cf310>] ? ptlrpc_main+0x0/0x1860 [ptlrpc]
       [<ffffffffa05cf310>] ? ptlrpc_main+0x0/0x1860 [ptlrpc]
       [<ffffffff8100c140>] ? child_rip+0x0/0x20
      

      After this was seen, the dd process never finished.

      I also verify this problem is easily to be seen when the OST is almost full.

      Attachments

        1. stack.txt
          225 kB
          Jinshan Xiong

        Activity

          People

            hongchao.zhang Hongchao Zhang
            jay Jinshan Xiong (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: