Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1852

compilebench cause OSS to crash when use OFD + ldiskfs

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Minor
    • None
    • Lustre 2.3.0
    • 3
    • 10280

    Description

      11:18:21:Lustre: DEBUG MARKER: == parallel-scale test compilebench: compilebench == 11:18:06 (1346955486)
      11:18:21:Lustre: DEBUG MARKER: /usr/sbin/lctl mark .\/compilebench -D \/mnt\/lustre\/d0.compilebench -i 4 -r 4 --makej
      11:18:21:Lustre: DEBUG MARKER: ./compilebench -D /mnt/lustre/d0.compilebench -i 4 -r 4 --makej
      11:19:02:BUG: unable to handle kernel NULL pointer dereference at 0000000000000008
      11:19:02:IP: [<ffffffffa0d40217>] ldiskfs_journal_commit_callback+0x67/0xc0 [ldiskfs]
      11:19:02:PGD 73822067 PUD 7c59d067 PMD 0
      11:19:02:Oops: 0002 1 SMP
      11:19:02:last sysfs file: /sys/devices/system/cpu/possible
      11:19:02:CPU 0
      11:19:02:Modules linked in: osd_ldiskfs(U) fsfilt_ldiskfs(U) ldiskfs(U) lustre(U) ofd(U) ost(U) cmm(U) mdt(U) mdd(U) mds(U) mgs(U) jbd2 obdecho(U) mgc(U) lquota(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) sha512_generic sha256_generic libcfs(U) nfsd lockd nfs_acl auth_rpcgss exportfs autofs4 sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: speedstep_lib]
      11:19:02:
      11:19:02:Pid: 6927, comm: jbd2/dm-4-8 Not tainted 2.6.32-279.5.1.el6_lustre.g293c36b.x86_64 #1 Red Hat KVM
      11:19:02:RIP: 0010:[<ffffffffa0d40217>] [<ffffffffa0d40217>] ldiskfs_journal_commit_callback+0x67/0xc0 [ldiskfs]
      11:19:02:RSP: 0018:ffff88004e587cf0 EFLAGS: 00010283
      11:19:02:RAX: ffff88004d27c610 RBX: 0000000000000000 RCX: 0000000000000000
      11:19:02:RDX: ffff88004d27c610 RSI: ffff88005b300a90 RDI: ffff88004d0e92a4
      11:19:02:RBP: ffff88004e587d20 R08: 5a5a5a5a5a5a5a5a R09: 5a5a5a5a5a5a5a5a
      11:19:02:R10: 5a5a5a5a5a5a5a5a R11: 0000000000000000 R12: ffff88004d0e92a4
      11:19:02:R13: ffff880051da7000 R14: ffff88005b300a90 R15: 0000000000000000
      11:19:02:FS: 0000000000000000(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
      11:19:02:CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
      11:19:02:CR2: 0000000000000008 CR3: 000000007c59c000 CR4: 00000000000006f0
      11:19:02:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      11:19:02:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      11:19:02:Process jbd2/dm-4-8 (pid: 6927, threadinfo ffff88004e586000, task ffff88005247e040)
      11:19:02:Stack:
      11:19:02: ffff88004e587d20 ffff880070287b9c ffff88005b3009c0 ffff880070287800
      11:19:02:<d> 0000000000000000 0000000000000011 ffff88004e587e60 ffffffffa08458af
      11:19:02:<d> ffff88004e587d90 ffffffff810096f0 ffff8800566b6078 ffff88004c24c0b8
      11:19:02:Call Trace:
      11:19:02: [<ffffffffa08458af>] jbd2_journal_commit_transaction+0x110f/0x1530 [jbd2]
      11:19:02: [<ffffffff810096f0>] ? __switch_to+0xd0/0x320
      11:19:02: [<ffffffff8107eabb>] ? try_to_del_timer_sync+0x7b/0xe0
      11:19:02: [<ffffffffa084b128>] kjournald2+0xb8/0x220 [jbd2]
      11:19:02: [<ffffffff810920d0>] ? autoremove_wake_function+0x0/0x40
      11:19:02: [<ffffffffa084b070>] ? kjournald2+0x0/0x220 [jbd2]
      11:19:02: [<ffffffff81091d66>] kthread+0x96/0xa0
      11:19:02: [<ffffffff8100c14a>] child_rip+0xa/0x20
      11:19:02: [<ffffffff81091cd0>] ? kthread+0x0/0xa0
      11:19:02: [<ffffffff8100c140>] ? child_rip+0x0/0x20
      11:19:02:Code: e0 49 8b 86 d0 00 00 00 49 81 c6 d0 00 00 00 4c 39 f0 48 8b 18 48 89 c2 74 4a 48 89 d9 eb 08 0f 1f 44 00 00 48 89 cb 48 8b 70 08 <48> 89 71 08 48 89 0e 48 89 10 48 89 50 08 4c 89 e2 c7 02 00 00
      11:19:02:RIP [<ffffffffa0d40217>] ldiskfs_journal_commit_callback+0x67/0xc0 [ldiskfs]
      11:19:02: RSP <ffff88004e587cf0>
      11:19:02:CR2: 0000000000000008
      11:19:02:Initializing cgroup subsys cpuset
      11:19:02:Initializing cgroup subsys cpu

      Attachments

        Activity

          People

            wc-triage WC Triage
            mdiep Minh Diep
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: