Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1979

SWL - MDS crash after recovery osd_iam_lfix.c:190:iam_lfix_init()) Wrong magic in node 81689 (#56): 0x0 != 0x1976 or wrong count

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Blocker
    • None
    • Lustre 2.3.0
    • None
    • LLNL Hyperion
    • 3
    • 6318

    Description

      Mds crashes hard, after completing recovery.

      2012-09-19 07:40:19 Lustre: MDS mdd_obd-lustre-MDT0000: lustre-OST000e_UUID now active, resetting orphans
      2012-09-19 07:40:19 Lustre: MDS mdd_obd-lustre-MDT0000: lustre-OST0026_UUID now active, resetting orphans
      2012-09-19 07:40:19 Lustre: Skipped 16 previous similar messages
      2012-09-19 07:40:28 LustreError: 4748:0:(osd_iam_lfix.c:190:iam_lfix_init()) Wrong magic in node 81689 (#56): 0x0 != 0x1976 or wrong count: 0Initializing cgroup subsys cpuset

      Backtrace:

       bt
      PID: 4439   TASK: ffff88032a1ee040  CPU: 1   COMMAND: "mdt02_000"
       #0 [ffff8802cf7756f0] machine_kexec at ffffffff8103281b
       #1 [ffff8802cf775750] crash_kexec at ffffffff810ba792
       #2 [ffff8802cf775820] oops_end at ffffffff81501700
       #3 [ffff8802cf775850] no_context at ffffffff81043bab
       #4 [ffff8802cf7758a0] __bad_area_nosemaphore at ffffffff81043e35
       #5 [ffff8802cf7758f0] bad_area_nosemaphore at ffffffff81043f03
       #6 [ffff8802cf775900] __do_page_fault at ffffffff81044661
       #7 [ffff8802cf775a20] do_page_fault at ffffffff815036de
       #8 [ffff8802cf775a50] page_fault at ffffffff81500a95
          [exception RIP: lu_context_key_get+27]
          RIP: ffffffffa072f00b  RSP: ffff8802cf775b00  RFLAGS: 00010246
          RAX: 0000000000000015  RBX: ffff88014362c8c0  RCX: ffffffffa076546f
          RDX: 0000000000000000  RSI: ffffffffa0ee14e0  RDI: ffff880116f9f4c0
          RBP: ffff8802cf775b00   R8: fffffffffffffffe   R9: 0000000000000000
          R10: 0000000000000000  R11: 0000000000000004  R12: ffff8802cf775b60
          R13: ffff880116f9f4c0  R14: ffffffffa076546f  R15: ffff88012f4436f0
          ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
       #9 [ffff8802cf775b08] osd_xattr_get at ffffffffa0ebaf8f [osd_ldiskfs]
      #10 [ffff8802cf775b58] dt_version_get at ffffffffa07330d4 [obdclass]
      #11 [ffff8802cf775b88] mdt_obj_version_get at ffffffffa0e297cc [mdt]
      #12 [ffff8802cf775bb8] mdt_version_get_check_save at ffffffffa0e29d0f [mdt]
      #13 [ffff8802cf775be8] mdt_md_create at ffffffffa0e2a03d [mdt]
      #14 [ffff8802cf775c68] mdt_reint_create at ffffffffa0e2a6b3 [mdt]
      #15 [ffff8802cf775ca8] mdt_reint_rec at ffffffffa0e28151 [mdt]
      #16 [ffff8802cf775cc8] mdt_reint_internal at ffffffffa0e219aa [mdt]
      #17 [ffff8802cf775d18] mdt_reint at ffffffffa0e21cf4 [mdt]
      #18 [ffff8802cf775d38] mdt_handle_common at ffffffffa0e15802 [mdt]
      #19 [ffff8802cf775d88] mdt_regular_handle at ffffffffa0e166f5 [mdt]
      #20 [ffff8802cf775d98] ptlrpc_server_handle_request at ffffffffa08b199d [ptlrpc]
      #21 [ffff8802cf775e98] ptlrpc_main at ffffffffa08b2f89 [ptlrpc]
      #22 [ffff8802cf775f48] kernel_thread at ffffffff8100c14a
      

      Attachments

        Activity

          People

            yong.fan nasf (Inactive)
            cliffw Cliff White (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: