Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4807

repeating DQACQ failed with -37

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • None
    • Lustre 2.4.1
    • None
    • Lustre-2.4.1
    • 3
    • 13222

    Description

      Mar 25 00:29:51 ddnoss4 kernel: LustreError: 4503:0:(qsd_handler.c:344:qsd_req_completion()) $$$ DQACQ failed with -37, flags:0x2 qsd:home2-OST0026 qtype:grp id:3303 enforced:1 granted:7364460 pending:0 waiting:0 req:1 usage:7165364 qunit:4194304 qtune:524288 edquot:0
      Mar 25 00:39:51 ddnoss4 kernel: LustreError: 4503:0:(qsd_handler.c:344:qsd_req_completion()) $$$ DQACQ failed with -37, flags:0x2 qsd:home2-OST0026 qtype:grp id:3303 enforced:1 granted:7364460 pending:0 waiting:0 req:1 usage:7165364 qunit:4194304 qtune:524288 edquot:0
      Mar 25 00:49:51 ddnoss4 kernel: LustreError: 4503:0:(qsd_handler.c:344:qsd_req_completion()) $$$ DQACQ failed with -37, flags:0x2 qsd:home2-OST0026 qtype:grp id:3303 enforced:1 granted:7364460 pending:0 waiting:0 req:1 usage:7165364 qunit:4194304 qtune:524288 edquot:0
      Mar 25 00:59:51 ddnoss4 kernel: LustreError: 4503:0:(qsd_handler.c:344:qsd_req_completion()) $$$ DQACQ failed with -37, flags:0x2 qsd:home2-OST0026 qtype:grp id:3303 enforced:1 granted:7364460 pending:0 waiting:0 req:1 usage:7165364 qunit:4194304 qtune:524288 edquot:0
      

      Eventually, OSS crashed. Here is console messages when OSS crashed.

      <4>Pid: 179, comm: kswapd1 Not tainted 2.6.32-358.18.1.el6_lustre.x86_64 #1 Dell Inc. PowerEdge R620/0PXXHP
      <4>RIP: 0010:[<ffffffffa0d45f2c>]  [<ffffffffa0d45f2c>] lqe64_hash_keycmp+0xc/0x20 [lquota]
      <4>RSP: 0018:ffff880821b9d940  EFLAGS: 00010206
      <4>RAX: 0000000000000c7b RBX: ffff88048dc20c80 RCX: 0000000000000000
      <4>RDX: 0000000000000000 RSI: 5a5a5a5a5a5a5a5a RDI: ffff880473a14f28
      <4>RBP: ffff880821b9d940 R08: 0000000000000003 R09: 0000000000000001
      <4>R10: 0000000000000000 R11: 0000000000000000 R12: ffff880821b9d9d0
      <4>R13: ffff880473a14f28 R14: 0000000000000000 R15: 5a5a5a5a5a5a5a5a
      <4>FS:  0000000000000000(0000) GS:ffff88084c400000(0000) knlGS:0000000000000000
      <4>CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
      <4>CR2: 00000034bb673e10 CR3: 000000100a428000 CR4: 00000000000407e0
      <4>DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      <4>DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      <4>Process kswapd1 (pid: 179, threadinfo ffff880821b9c000, task ffff880821422040)
      <4>Stack:
      <4> ffff880821b9d990 ffffffffa0577945 0000000300000000 0000000000000000
      <4><d> ffff880821b9d9b0 ffff880821b9d9d0 ffff880473a14f28 ffff88048dc20c80
      <4><d> 0000000000000001 ffffffffa0714e00 ffff880821b9d9c0 ffffffffa0577ac7
      <4>Call Trace:
      <4> [<ffffffffa0577945>] cfs_hash_bd_lookup_intent+0x65/0x130 [libcfs]
      <4> [<ffffffffa0577ac7>] cfs_hash_dual_bd_lookup_locked+0x37/0x70 [libcfs]
      <4> [<ffffffffa0578cf4>] cfs_hash_lookup+0x54/0xa0 [libcfs]
      <4> [<ffffffffa0d464f7>] lqe_locate+0x47/0x850 [lquota]
      <4> [<ffffffffa0d5804b>] qsd_op_adjust+0x2cb/0x580 [lquota]
      <4> [<ffffffffa0db77f1>] osd_object_delete+0x231/0x2f0 [osd_ldiskfs]
      <4> [<ffffffffa0687829>] lu_object_free+0x89/0x1a0 [obdclass]
      <4> [<ffffffffa05774d2>] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
      <4> [<ffffffffa0688b3f>] lu_site_purge+0x2af/0x4a0 [obdclass]
      <4> [<ffffffffa0688e06>] lu_cache_shrink+0xd6/0x280 [obdclass]
      <4> [<ffffffff81131fca>] shrink_slab+0x12a/0x1a0
      <4> [<ffffffff811351ba>] balance_pgdat+0x59a/0x820
      <4> [<ffffffff81135574>] kswapd+0x134/0x3c0
      <4> [<ffffffff81096da0>] ? autoremove_wake_function+0x0/0x40
      <4> [<ffffffff81135440>] ? kswapd+0x0/0x3c0
      <4> [<ffffffff81096a36>] kthread+0x96/0xa0
      <4> [<ffffffff8100c0ca>] child_rip+0xa/0x20
      <4> [<ffffffff810969a0>] ? kthread+0x0/0xa0
      <4> [<ffffffff8100c0c0>] ? child_rip+0x0/0x20
      <4>Code: 29 c8 21 d0 c3 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 0f 1f 44 00 00 c9 48 8d 47 10 c3 90 55 48 89 e5 0f 1f 44 00 00 48 8b 07 <48> 39 46 10 c9 0f 94 c0 0f b6 c0 c3 0f 1f 84 00 00 00 00 00 55 
      <1>RIP  [<ffffffffa0d45f2c>] lqe64_hash_keycmp+0xc/0x20 [lquota]
      <4> RSP <ffff880821b9d940>
      

      It's very similar to LU-4249

      Attachments

        Issue Links

          Activity

            People

              niu Niu Yawei (Inactive)
              ihara Shuichi Ihara (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: