Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7798

ll_prep_inode()) ASSERTION( fid_is_sane(&md.body->mbo_fid1) ) failed:

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • None
    • None
    • lola
      build: 2.8.50-6-gf9ca359 ;commit f9ca359284357d145819beb08b316e932f7a3060
    • 3
    • 9223372036854775807

    Description

      Error happened during soak testing of build '20160218' (see: https://wiki.hpdd.intel.com/display/Releases/Soak+Testing+on+Lola#SoakTestingonLola-20160218). DNE is enabled.
      MDT's have been formated using ldiskfs, OSTs using zfs.

      Event history:

      • 2016-02-18 16:24:27,115:fsmgmt.fsmgmt:INFO reseting MDS node lola-10
      • 2016-02-18-16:34:04] Lustre client lola-26 crash with LBUG
      • 2016-02-18-16:34:06 Lustre client lola-29 crash with LBUG

      Error message and stack trace is same for both events:

      <6>Lustre: soaked-MDT0004-mdc-ffff880852d1a000: Connection restored to 192.168.1.110@o2ib10 (at 192.168.1.110@o2ib10)
      <0>LustreError: 23705:0:(llite_lib.c:2295:ll_prep_inode()) ASSERTION( fid_is_sane(&md.body->mbo_fid1) ) failed:
      <0>LustreError: 23705:0:(llite_lib.c:2295:ll_prep_inode()) LBUG
      <4>Pid: 23705, comm: pct
      <4>
      <4>Call Trace:
      <4> [<ffffffffa050b875>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
      <4> [<ffffffffa050be77>] lbug_with_loc+0x47/0xb0 [libcfs]
      <4> [<ffffffffa0aa0192>] ll_prep_inode+0x752/0xc40 [lustre]
      <4> [<ffffffffa07f6d60>] ? lustre_swab_mdt_body+0x0/0x130 [ptlrpc]
      <4> [<ffffffffa0ab42b2>] ll_new_node+0x682/0x7f0 [lustre]
      <4> [<ffffffffa0ab6b04>] ll_mkdir+0x104/0x220 [lustre]
      <4> [<ffffffff8122ec0f>] ? security_inode_permission+0x1f/0x30
      <4> [<ffffffff8119d759>] vfs_mkdir+0xd9/0x140
      <4> [<ffffffff811a04e7>] sys_mkdirat+0xc7/0x1b0
      <4> [<ffffffff8100c6f5>] ? math_state_restore+0x45/0x60
      <4> [<ffffffff811a05e8>] sys_mkdir+0x18/0x20
      <4> [<ffffffff8100b0d2>] system_call_fastpath+0x16/0x1b
      <4>
      <0>Kernel panic - not syncing: LBUG
      <4>Pid: 23705, comm: pct Not tainted 2.6.32-504.30.3.el6.x86_64 #1
      <4>Call Trace:
      <4> [<ffffffff815293fc>] ? panic+0xa7/0x16f
      <4> [<ffffffffa050becb>] ? lbug_with_loc+0x9b/0xb0 [libcfs]
      <4> [<ffffffffa0aa0192>] ? ll_prep_inode+0x752/0xc40 [lustre]
      <4> [<ffffffffa07f6d60>] ? lustre_swab_mdt_body+0x0/0x130 [ptlrpc]
      <4> [<ffffffffa0ab42b2>] ? ll_new_node+0x682/0x7f0 [lustre]
      <4> [<ffffffffa0ab6b04>] ? ll_mkdir+0x104/0x220 [lustre]
      <4> [<ffffffff8122ec0f>] ? security_inode_permission+0x1f/0x30
      <4> [<ffffffff8119d759>] ? vfs_mkdir+0xd9/0x140
      <4> [<ffffffff811a04e7>] ? sys_mkdirat+0xc7/0x1b0
      <4> [<ffffffff8100c6f5>] ? math_state_restore+0x45/0x60
      <4> [<ffffffff811a05e8>] ? sys_mkdir+0x18/0x20
      <4> [<ffffffff8100b0d2>] ? system_call_fastpath+0x16/0x1b
      
      • No errors on Lustre server nodes ca be be correlated to the events.

      Attached console, messages and vmcore-dmesg.txt from both clients.
      Crash files are available

      Attachments

        1. messages-lola-29.log.bz2
          248 kB
        2. messages-lola-26.log.bz2
          235 kB
        3. lola-29-vmcore-dmesg.txt.bz2
          27 kB
        4. lola-26-vmcore-dmesg.txt.bz2
          30 kB
        5. console-lola-29.log.bz2
          72 kB
        6. console-lola-26.log.bz2
          63 kB

        Issue Links

          Activity

            People

              wc-triage WC Triage
              heckes Frank Heckes (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: