Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4683

panic during racer with migration

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • None
    • Lustre 2.6.0
    • None
    • 3
    • 12871

    Description

      John identify the problem when running racer with migration(with patch_set 44)

      Making it executable and running racer I see:
      BUG: unable to handle kernel paging request at ffff87fd0d0f1a70 IP: [<ffffffff8119c50c>] __d_lookup+0x8c/0x150 PGD 0 Oops: 0000 1 SMP last sysfs file: /sys/devices/system/cpu/possible CPU 2 Modules linked in: lustre(U) ofd(U) osp(U) lod(U) ost(U) mdt(U) mdd(U) mgs(U) nodemap(U) osd_ldiskfs(U) ldiskfs(U) lquota(U) lfsck(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) ksocklnd(U) lnet(U) libcfs(U) ext2 exportfs jbd sha512_generic sha256_generic autofs4 nfs lockd fscache auth_rpcgss nfs_acl sunrpc ipv6 microcode virtio_balloon virtio_net i2c_piix4 i2c_core ext4 jbd2 mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs]
      Pid: 26388, comm: ls Not tainted 2.6.32-358.18.1.el6.lustre.x86_64 #1 Bochs Bochs
      RIP: 0010:[<ffffffff8119c50c>] [<ffffffff8119c50c>] __d_lookup+0x8c/0x150
      RSP: 0018:ffff8801345dfbe8 EFLAGS: 00010282
      RAX: ffff88020d0662c8 RBX: ffff87fd0d0f1a58 RCX: 0000000000000000
      RDX: ffff88012ed23c80 RSI: ffff88020d066362 RDI: ffff880134170015
      RBP: ffff8801345dfc38 R08: 00000000fffffffb R09: 00000000fffffffe
      R10: 0000000000000000 R11: 0000000000000000 R12: ffff87fd0d0f1a40
      R13: ffff8801a32f8540 R14: 0000000000019a9c R15: ffff88020d0662c8
      FS: 00007f6d7e01f7a0(0000) GS:ffff880028300000(0000) knlGS:0000000000000000
      CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      CR2: ffff87fd0d0f1a70 CR3: 000000012d231000 CR4: 00000000000006e0
      DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      Process ls (pid: 26388, threadinfo ffff8801345de000, task ffff88018765a040)
      Stack:
      ffff880134170013 000000028118f6e3 0000000000000002 ffff8801345dfd08
      <d> 0000000000000001 ffff8801345dfe28 ffff8801345dfd18 ffff8801345dfd08
      <d> ffff880216eedac0 ffff8801345dfd18 ffff8801345dfc98 ffffffff81191cc6
      Call Trace:
      [<ffffffff81191cc6>] do_lookup+0x36/0x230
      [<ffffffff811925f4>] __link_path_walk+0x734/0x1030
      [<ffffffff8119317a>] path_walk+0x6a/0xe0
      [<ffffffff8119334b>] do_path_lookup+0x5b/0xa0
      [<ffffffff81183e30>] ? get_empty_filp+0xa0/0x180
      [<ffffffff8119428b>] do_filp_open+0xfb/0xdc0
      [<ffffffff810e8415>] ? call_rcu_sched+0x15/0x20
      [<ffffffff81278dc5>] ? _atomic_dec_and_lock+0x55/0x80
      [<ffffffff811a36d0>] ? mntput_no_expire+0x30/0x110
      [<ffffffff811a0f22>] ? alloc_fd+0x92/0x160
      [<ffffffff8117f849>] do_sys_open+0x69/0x140
      [<ffffffff8100c535>] ? math_state_restore+0x45/0x60
      [<ffffffff8117f960>] sys_open+0x20/0x30
      [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b

      I hope patch_set 45 can fix the problem. And racer with migration can pass on my side now. But John needs to confirm that.

      Attachments

        Issue Links

          Activity

            People

              di.wang Di Wang
              di.wang Di Wang
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: