Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-3366

Test failure obdfilter-survey, subtest test_1c: oom-killer

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Minor
    • None
    • Lustre 2.4.2, Lustre 2.5.1
    • None
    • 3
    • 8329

    Description

      This issue was created by maloo for James Nunez <james.a.nunez@intel.com>

      This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/4ac8e14a-bf36-11e2-88e0-52540035b04c.

      The sub-test test_1c failed with the following error:

      test failed to respond and timed out

      I see the following several times in the OSS console log:

      06:25:50:lctl invoked oom-killer: gfp_mask=0x280da, order=0, oom_adj=0, oom_score_adj=0
      

      OSS stack trace looks like:

      08:48:13: [<ffffffff81195731>] sys_ioctl+0x81/0xa0
      08:48:13: [<ffffffff810dc645>] ? __audit_syscall_exit+0x265/0x290
      08:48:13: [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
      08:48:13:INFO: task lctl:11345 blocked for more than 120 seconds.
      08:48:13:"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      08:48:13:lctl          D 0000000000000000     0 11345  11038 0x00000080
      08:48:13: ffff880031179588 0000000000000082 ffff8800ffffffff 000088f8cde20d22
      08:48:13: ffff88007a65b380 ffff880073cded70 0000000001306642 ffffffffae6a97e1
      08:48:13: ffff880053b15ab8 ffff880031179fd8 000000000000fb88 ffff880053b15ab8
      08:48:13:Call Trace:
      08:48:13: [<ffffffff810a1ac9>] ? ktime_get_ts+0xa9/0xe0
      08:48:13: [<ffffffff8150e723>] io_schedule+0x73/0xc0
      08:48:13: [<ffffffff8125ea08>] get_request_wait+0x108/0x1d0
      08:48:13: [<ffffffff81096ca0>] ? autoremove_wake_function+0x0/0x40
      08:48:13: [<ffffffff81255c8b>] ? elv_merge+0x1cb/0x200
      08:48:13: [<ffffffff8125eb4d>] blk_queue_bio+0x7d/0x5a0
      08:48:13: [<ffffffff8125d1fe>] generic_make_request+0x26e/0x550
      08:48:13: [<ffffffff8111c713>] ? mempool_alloc+0x63/0x140
      08:48:13: [<ffffffff8125d56d>] submit_bio+0x8d/0x120
      08:48:13: [<ffffffffa103e39e>] ? lprocfs_oh_tally+0x2e/0x50 [obdclass]
      08:48:13: [<ffffffffa168adac>] osd_submit_bio+0x1c/0x60 [osd_ldiskfs]
      08:48:13: [<ffffffffa168b1cc>] osd_do_bio+0x3dc/0x800 [osd_ldiskfs]
      08:48:13: [<ffffffffa001702c>] ? fsfilt_map_nblocks+0xcc/0xf0 [fsfilt_ldiskfs]
      08:48:13: [<ffffffffa00172d5>] ? fsfilt_ldiskfs_map_inode_pages+0x85/0x90 [fsfilt_ldiskfs]
      08:48:13: [<ffffffffa168d788>] osd_read_prep+0x338/0x3b0 [osd_ldiskfs]
      08:48:13: [<ffffffffa0494a43>] ofd_preprw_read+0x253/0x7f0 [ofd]
      08:48:13: [<ffffffffa049574a>] ofd_preprw+0x76a/0x13c0 [ofd]
      08:48:13: [<ffffffffa05734eb>] echo_client_iocontrol+0x207b/0x3bd0 [obdecho]
      08:48:13: [<ffffffff81143767>] ? handle_pte_fault+0xf7/0xb50
      08:48:13: [<ffffffffa103247f>] class_handle_ioctl+0x12cf/0x1e90 [obdclass]
      08:48:13: [<ffffffffa101a2ab>] obd_class_ioctl+0x4b/0x190 [obdclass]
      08:48:13: [<ffffffff81195012>] vfs_ioctl+0x22/0xa0
      08:48:13: [<ffffffff8103c7b8>] ? pvclock_clocksource_read+0x58/0xd0
      08:48:13: [<ffffffff811951b4>] do_vfs_ioctl+0x84/0x580
      08:48:13: [<ffffffff81195731>] sys_ioctl+0x81/0xa0
      08:48:13: [<ffffffff810dc645>] ? __audit_syscall_exit+0x265/0x290
      08:48:13: [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
      08:48:13:
      08:48:13:<ConMan> Console [wtm-14vm8] disconnected from <wtm-14:6007> at 05-17 08:47.
      

      Info required for matching: obdfilter-survey 1c

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: