Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8392

sanity test_27z: soft lockup - CPU#0 stuck for 22s! [ptlrpcd_rcv:6145]

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • Lustre 2.9.0
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for John Hammond <john.hammond@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/2fcd5716-48f6-11e6-bf87-5254006e85c2.

      The sub-test test_27z failed with the following error:

      test failed to respond and timed out
      
      11:07:16:[ 1632.062003] BUG: soft lockup - CPU#0 stuck for 22s! [ptlrpcd_rcv:6145]
      11:07:16:[ 1632.062003] Modules linked in: lustre(OE) obdecho(OE) mgc(OE) lov(OE) osc(OE) mdc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod crc_t10dif crct10dif_generic crct10dif_common ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr ppdev pcspkr virtio_balloon parport_pc parport i2c_piix4 nfsd nfs_acl lockd auth_rpcgss grace sunrpc ip_tables ext4 mbcache jbd2 ata_generic pata_acpi virtio_blk cirrus syscopyarea sysfillrect sysimgblt drm_kms_helper 8139too ttm ata_piix serio_raw virtio_pci virtio_ring virtio libata 8139cp mii drm i2c_core floppy
      11:07:16:[ 1632.062003] CPU: 0 PID: 6145 Comm: ptlrpcd_rcv Tainted: G           OE  ------------   3.10.0-327.22.2.el7.x86_64 #1
      11:07:16:[ 1632.062003] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
      11:07:16:[ 1632.062003] task: ffff880078184500 ti: ffff880078e58000 task.ti: ffff880078e58000
      11:07:16:[ 1632.062003] RIP: 0010:[<ffffffff8163dab2>]  [<ffffffff8163dab2>] _raw_spin_lock+0x32/0x50
      11:07:16:[ 1632.062003] RSP: 0018:ffff880078e5b9f0  EFLAGS: 00000202
      11:07:16:[ 1632.062003] RAX: 0000000000002fb8 RBX: 0000000000000000 RCX: 000000000000701c
      11:07:16:[ 1632.062003] RDX: 000000000000701e RSI: 000000000000701e RDI: ffff880079cb4d00
      11:07:16:[ 1632.062003] RBP: ffff880078e5b9f0 R08: 0000000000000000 R09: 0000000000000208
      11:07:16:[ 1632.062003] R10: 0000000000000009 R11: ffff880078e5b85e R12: ffff880078e5bfd8
      11:07:16:[ 1632.062003] R13: ffffffff812fd8e3 R14: ffff880078e5b9f0 R15: ffffffffa05cf498
      11:07:16:[ 1632.062003] FS:  0000000000000000(0000) GS:ffff88007fc00000(0000) knlGS:0000000000000000
      11:07:16:[ 1632.062003] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
      11:07:16:[ 1632.062003] CR2: 00007f5378b2fed0 CR3: 000000007856d000 CR4: 00000000000006f0
      11:07:16:[ 1632.062003] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      11:07:16:[ 1632.062003] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      11:07:16:[ 1632.062003] Stack:
      11:07:16:[ 1632.062003]  ffff880078e5ba18 ffffffffa05db498 0000020000000100 0000000000000000
      11:07:16:[ 1632.062003]  ffffffffffffffff ffff880078e5ba90 ffffffffa065a296 000200000a090430
      11:07:16:[ 1632.062003]  000200000a09042e ffff880044ae7c00 ffffffffa06916e0 0000000000020000
      11:07:16:[ 1632.062003] Call Trace:
      11:07:16:[ 1632.062003]  [<ffffffffa05db498>] cfs_percpt_lock+0x58/0x110 [libcfs]
      11:07:16:[ 1632.062003]  [<ffffffffa065a296>] lnet_send+0xb6/0xc90 [lnet]
      11:07:16:[ 1632.062003]  [<ffffffff811c178e>] ? kmem_cache_alloc_trace+0x1ce/0x1f0
      11:07:16:[ 1632.062003]  [<ffffffffa065b0b5>] LNetPut+0x245/0x7a0 [lnet]
      11:07:16:[ 1632.062003]  [<ffffffffa0919aa3>] ptl_send_buf+0x183/0x500 [ptlrpc]
      11:07:16:[ 1632.062003]  [<ffffffffa091b5b1>] ptl_send_rpc+0x611/0xda0 [ptlrpc]
      11:07:16:[ 1632.062003]  [<ffffffffa0910ff0>] ptlrpc_send_new_req+0x460/0xa60 [ptlrpc]
      11:07:16:[ 1632.062003]  [<ffffffffa0914358>] ptlrpc_check_set.part.23+0x9a8/0x1dd0 [ptlrpc]
      11:07:16:[ 1632.062003]  [<ffffffffa09157db>] ptlrpc_check_set+0x5b/0xe0 [ptlrpc]
      11:07:16:[ 1632.062003]  [<ffffffffa09403bb>] ptlrpcd_check+0x4eb/0x5e0 [ptlrpc]
      11:07:16:[ 1632.062003]  [<ffffffffa094076b>] ptlrpcd+0x2bb/0x560 [ptlrpc]
      11:07:16:[ 1632.062003]  [<ffffffff810b88d0>] ? wake_up_state+0x20/0x20
      11:07:16:[ 1632.062003]  [<ffffffffa09404b0>] ? ptlrpcd_check+0x5e0/0x5e0 [ptlrpc]
      11:07:16:[ 1632.062003]  [<ffffffff810a5aef>] kthread+0xcf/0xe0
      11:07:16:[ 1632.062003]  [<ffffffff810a5a20>] ? kthread_create_on_node+0x140/0x140
      11:07:16:[ 1632.062003]  [<ffffffff816467d8>] ret_from_fork+0x58/0x90
      11:07:16:[ 1632.062003]  [<ffffffff810a5a20>] ? kthread_create_on_node+0x140/0x140
      

      Info required for matching: sanity 27z

      Attachments

        Issue Links

          Activity

            People

              sbuisson Sebastien Buisson (Inactive)
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: