Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7406

OSS' s Crash IMMEDIATELY start lctl lfsck_start -M fsLustre-MDT0000 -t layout

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Major
    • None
    • Lustre 2.7.0
    • None
    • Two MetaData Servers in HA with pcsd and eigth Objetct Storage Server with a five OST each in active/active mode pcsd.
    • 3
    • 9223372036854775807

    Description

      <4>-----------[ cut here ]-----------
      <2>kernel BUG at fs/jbd2/transaction.c:1030!
      <4>invalid opcode: 0000 1 SMP
      <4>last sysfs file: /sys/devices/pci0000:00/0000:00:02.2/0000:03:00.0/host2/target2:2:0/2:2:0:0/state
      <4>CPU 0
      <4>Modules linked in: osp(U) ofd(U) lfsck(U) ost(U) mgc(U) osd_ldiskfs(U) lquota(U) ldiskfs(U) lustre(U) lov(U) mdc(U) fid(U) lmv(U) fld(U) ptlrpc(U) obdclass(U) ksocklnd(U) ko2iblnd(U) lnet(U) sha512_generic crc32c_intel libcfs(U) autofs4 sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm iTCO_wdt iTCO_vendor_support microcode dcdbas sb_edac edac_core shpchp lpc_ich mfd_core ipmi_devintf power_meter acpi_ipmi ipmi_si ipmi_msghandler sg tg3 ptp pps_core ext4 jbd2 mbcache dm_round_robin scsi_dh_rdac sr_mod cdrom sd_mod crc_t10dif ahci wmi mlx4_ib ib_sa ib_mad ib_core ib_addr ipv6 mlx4_core megaraid_sas mpt2sas scsi_transport_sas raid_class dm_multipath dm_mirror dm_region_hash dm_log dm_mod [last unloaded: speedstep_lib]
      <4>
      <4>Pid: 37823, comm: lfsck Not tainted 2.6.32-504.8.1.el6_lustre.x86_64 #1 Dell Inc. PowerEdge R720/0H5J4J
      <4>RIP: 0010:[<ffffffffa01c379d>] [<ffffffffa01c379d>] jbd2_journal_dirty_metadata+0x10d/0x150 [jbd2]
      <4>RSP: 0018:ffff881ff4e67a00 EFLAGS: 00010246
      <4>RAX: ffff880f3aecc6c0 RBX: ffff880f1bf23b58 RCX: ffff881019bf2818
      <4>RDX: 0000000000000000 RSI: ffff881019bf2818 RDI: 0000000000000000
      <4>RBP: ffff881ff4e67a20 R08: ffff881019bf2818 R09: 0000000000000018
      <4>R10: 0000000000480403 R11: ffff880ffa4fa9e8 R12: ffff880f1c1f7d68
      <4>R13: ffff881019bf2818 R14: ffff881013a5e000 R15: 0000000000000000
      <4>FS: 0000000000000000(0000) GS:ffff880062000000(0000) knlGS:0000000000000000
      <4>CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
      <4>CR2: 00007faa2c61b000 CR3: 0000002026577000 CR4: 00000000001407f0
      <4>DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      <4>DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      <4>Process lfsck (pid: 37823, threadinfo ffff881ff4e66000, task ffff881ff5512ae0)
      <4>Stack:
      <4> ffff880f1bf23b58 ffffffffa0beb710 ffff881019bf2818 0000000000000000
      <4><d> ffff881ff4e67a60 ffffffffa0baa00b ffff881ff4e67aa0 ffffffffa0be6af3
      <4><d> ffff880ffa4fa900 ffff880ffc784af0 ffff880ffc784a20 ffff881ff4e67b28
      <4>Call Trace:
      <4> [<ffffffffa0baa00b>] __ldiskfs_handle_dirty_metadata+0x7b/0x100 [ldiskfs]
      <4> [<ffffffffa0be6af3>] ? ldiskfs_xattr_set_entry+0x4e3/0x4f0 [ldiskfs]
      <4> [<ffffffffa0bb5d9a>] ldiskfs_mark_iloc_dirty+0x52a/0x630 [ldiskfs]
      <4> [<ffffffffa0be8abc>] ldiskfs_xattr_set_handle+0x33c/0x560 [ldiskfs]
      <4> [<ffffffffa0cc5cd8>] ? osd_iit_iget+0x118/0x330 [osd_ldiskfs]
      <4> [<ffffffffa0be8ddc>] ldiskfs_xattr_set+0xfc/0x1a0 [ldiskfs]
      <4> [<ffffffffa0be900e>] ldiskfs_xattr_trusted_set+0x2e/0x30 [ldiskfs]
      <4> [<ffffffff811b4722>] generic_setxattr+0xa2/0xb0
      <4> [<ffffffffa0c9a90d>] __osd_xattr_set+0x8d/0xe0 [osd_ldiskfs]
      <4> [<ffffffffa0ca2005>] osd_xattr_set+0x3a5/0x4b0 [osd_ldiskfs]
      <4> [<ffffffffa0d5f446>] lfsck_master_oit_engine+0x14c6/0x1ef0 [lfsck]
      <4> [<ffffffffa0d6094e>] lfsck_master_engine+0xade/0x13e0 [lfsck]
      <4> [<ffffffff81064b90>] ? default_wake_function+0x0/0x20
      <4> [<ffffffffa0d5fe70>] ? lfsck_master_engine+0x0/0x13e0 [lfsck]
      <4> [<ffffffff8109e66e>] kthread+0x9e/0xc0
      <4> [<ffffffff8100c20a>] child_rip+0xa/0x20
      <4> [<ffffffff8109e5d0>] ? kthread+0x0/0xc0
      <4> [<ffffffff8100c200>] ? child_rip+0x0/0x20
      <4>Code: c6 9c 03 00 00 4c 89 f7 e8 91 9f 36 e1 48 8b 33 ba 01 00 00 00 4c 89 e7 e8 b1 ec ff ff 4c 89 f0 66 ff 00 66 66 90 e9 73 ff ff ff <0f> 0b eb fe 0f 0b eb fe 0f 0b 66 0f 1f 84 00 00 00 00 00 eb f5
      <1>RIP [<ffffffffa01c379d>] jbd2_journal_dirty_metadata+0x10d/0x150 [jbd2]
      <4> RSP <ffff881ff4e67a00>

      Attachments

        Activity

          People

            wc-triage WC Triage
            ap0 Apolinar Martine
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: