Details
-
Bug
-
Resolution: Duplicate
-
Critical
-
None
-
Lustre 2.6.0
-
None
-
3
-
12871
Description
John identify the problem when running racer with migration(with patch_set 44)
Making it executable and running racer I see:
BUG: unable to handle kernel paging request at ffff87fd0d0f1a70 IP: [<ffffffff8119c50c>] __d_lookup+0x8c/0x150 PGD 0 Oops: 0000 1 SMP last sysfs file: /sys/devices/system/cpu/possible CPU 2 Modules linked in: lustre(U) ofd(U) osp(U) lod(U) ost(U) mdt(U) mdd(U) mgs(U) nodemap(U) osd_ldiskfs(U) ldiskfs(U) lquota(U) lfsck(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) ksocklnd(U) lnet(U) libcfs(U) ext2 exportfs jbd sha512_generic sha256_generic autofs4 nfs lockd fscache auth_rpcgss nfs_acl sunrpc ipv6 microcode virtio_balloon virtio_net i2c_piix4 i2c_core ext4 jbd2 mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs]
Pid: 26388, comm: ls Not tainted 2.6.32-358.18.1.el6.lustre.x86_64 #1 Bochs Bochs
RIP: 0010:[<ffffffff8119c50c>] [<ffffffff8119c50c>] __d_lookup+0x8c/0x150
RSP: 0018:ffff8801345dfbe8 EFLAGS: 00010282
RAX: ffff88020d0662c8 RBX: ffff87fd0d0f1a58 RCX: 0000000000000000
RDX: ffff88012ed23c80 RSI: ffff88020d066362 RDI: ffff880134170015
RBP: ffff8801345dfc38 R08: 00000000fffffffb R09: 00000000fffffffe
R10: 0000000000000000 R11: 0000000000000000 R12: ffff87fd0d0f1a40
R13: ffff8801a32f8540 R14: 0000000000019a9c R15: ffff88020d0662c8
FS: 00007f6d7e01f7a0(0000) GS:ffff880028300000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: ffff87fd0d0f1a70 CR3: 000000012d231000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process ls (pid: 26388, threadinfo ffff8801345de000, task ffff88018765a040)
Stack:
ffff880134170013 000000028118f6e3 0000000000000002 ffff8801345dfd08
<d> 0000000000000001 ffff8801345dfe28 ffff8801345dfd18 ffff8801345dfd08
<d> ffff880216eedac0 ffff8801345dfd18 ffff8801345dfc98 ffffffff81191cc6
Call Trace:
[<ffffffff81191cc6>] do_lookup+0x36/0x230
[<ffffffff811925f4>] __link_path_walk+0x734/0x1030
[<ffffffff8119317a>] path_walk+0x6a/0xe0
[<ffffffff8119334b>] do_path_lookup+0x5b/0xa0
[<ffffffff81183e30>] ? get_empty_filp+0xa0/0x180
[<ffffffff8119428b>] do_filp_open+0xfb/0xdc0
[<ffffffff810e8415>] ? call_rcu_sched+0x15/0x20
[<ffffffff81278dc5>] ? _atomic_dec_and_lock+0x55/0x80
[<ffffffff811a36d0>] ? mntput_no_expire+0x30/0x110
[<ffffffff811a0f22>] ? alloc_fd+0x92/0x160
[<ffffffff8117f849>] do_sys_open+0x69/0x140
[<ffffffff8100c535>] ? math_state_restore+0x45/0x60
[<ffffffff8117f960>] sys_open+0x20/0x30
[<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
I hope patch_set 45 can fix the problem. And racer with migration can pass on my side now. But John needs to confirm that.