[LU-4143] MDS OOPS sanity-hsm/test_52 NULL Pointer deref: mdt_save_lock Created: 24/Oct/13 Updated: 25/Oct/13 Resolved: 25/Oct/13 |
|
| Status: | Closed |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.6.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Critical |
| Reporter: | Maloo | Assignee: | WC Triage |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | HSM | ||
| Issue Links: |
|
||||||||||||||||
| Severity: | 3 | ||||||||||||||||
| Rank (Obsolete): | 11246 | ||||||||||||||||
| Description |
|
This issue was created by maloo for Nathaniel Clark <nathaniel.l.clark@intel.com> This issue relates to the following test suite run: The sub-test test_52 failed with the following error:
Info required for matching: sanity-hsm 52 This is happening on both ldiskfs and zfs runs. MDS Log: 06:28:58:Lustre: DEBUG MARKER: == sanity-hsm test 52: Opened for write file on an evicted client should be set dirty == 06:28:52 (1382362132)
06:28:58:Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200002343:0x13:0x0'.*action='ARCHIVE'/ {print $13}' | cut -f2 -d=
06:28:58:Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200002343:0x13:0x0'.*action='ARCHIVE'/ {print $13}' | cut -f2 -d=
06:28:58:Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n mdt.lustre-MDT0000.evict_client 769b9351-dd26-cfcd-1780-47c10f7077b5
06:28:58:Lustre: 12734:0:(genops.c:1496:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 769b9351-dd26-cfcd-1780-47c10f7077b5 at adminstrative request
06:28:58:BUG: unable to handle kernel NULL pointer dereference at 00000000000001f0
06:28:58:IP: [<ffffffffa0b892fc>] mdt_save_lock+0x1ec/0x300 [mdt]
06:28:58:PGD 7d1aa067 PUD 7c55e067 PMD 0
06:28:58:Oops: 0000 [#1] SMP
06:28:58:last sysfs file: /sys/devices/system/cpu/possible
06:28:58:CPU 0
06:28:58:Modules linked in: osp(U) mdd(U) lfsck(U) lod(U) mdt(U) mgs(U) mgc(U) fsfilt_ldiskfs(U) osd_ldiskfs(U) lquota(U) lustre(U) lov(U) osc(U) mdc(U) fid(U) fld(U) ksocklnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) ldiskfs(U) sha512_generic sha256_generic jbd2 nfsd exportfs autofs4 nfs lockd fscache auth_rpcgss nfs_acl sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: lnet_selftest]
06:28:58:
06:28:58:Pid: 12734, comm: lctl Not tainted 2.6.32-358.18.1.el6_lustre.gb035853.x86_64 #1 Red Hat KVM
06:28:58:RIP: 0010:[<ffffffffa0b892fc>] [<ffffffffa0b892fc>] mdt_save_lock+0x1ec/0x300 [mdt]
06:28:58:RSP: 0018:ffff880066fd9b78 EFLAGS: 00010202
06:28:58:RAX: 0000000000000000 RBX: ffff880067fee050 RCX: 0000000000000000
06:28:58:RDX: 0000000000000000 RSI: ffffffffa0bdf208 RDI: ffffffffa0c06720
06:28:58:RBP: ffff880066fd9bb8 R08: 0000000000000000 R09: 0000000000000001
06:28:58:R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000002
06:28:58:R13: ffff880067fee000 R14: ffff880068dcd900 R15: 0000000000000000
06:28:58:FS: 00007f3b7a734700(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
06:28:58:CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
06:28:58:CR2: 00000000000001f0 CR3: 000000005a694000 CR4: 00000000000006f0
06:28:58:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
06:28:58:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
06:28:58:Process lctl (pid: 12734, threadinfo ffff880066fd8000, task ffff880054bf4080)
06:28:58:Stack:
06:28:58: ffff880067fee178 ffff880059efd000 ffff880067fee000 ffff880067fee048
06:28:58:<d> ffff880067fee000 0000000000000000 ffff880067fee000 ffff880067fee048
06:28:58:<d> ffff880066fd9be8 ffffffffa0b8946c ffff880067fee000 0000000000000000
06:28:58:Call Trace:
06:28:58: [<ffffffffa0b8946c>] mdt_object_unlock+0x5c/0x160 [mdt]
06:28:58: [<ffffffffa0ba66b6>] mdt_add_dirty_flag+0x266/0x2e0 [mdt]
06:28:58: [<ffffffffa0b881bb>] mdt_ctxt_add_dirty_flag+0x12b/0x190 [mdt]
06:28:58: [<ffffffffa0b885f8>] mdt_obd_disconnect+0x3d8/0x500 [mdt]
06:28:58: [<ffffffffa11e591d>] class_fail_export+0x23d/0x540 [obdclass]
06:28:58: [<ffffffffa11e5d62>] obd_export_evict_by_uuid+0x142/0x240 [obdclass]
06:28:58: [<ffffffff8121d23f>] ? security_inode_permission+0x1f/0x30
06:28:58: [<ffffffffa07d9c33>] lprocfs_wr_evict_client+0x2d3/0x3b0 [ptlrpc]
06:28:58: [<ffffffffa0bc31b4>] lprocfs_mdt_wr_evict_client+0x184/0x3a0 [mdt]
06:28:58: [<ffffffffa11ee86b>] lprocfs_fops_write+0x7b/0xa0 [obdclass]
06:28:58: [<ffffffff811e9abe>] proc_reg_write+0x7e/0xc0
06:28:58: [<ffffffff81181368>] vfs_write+0xb8/0x1a0
06:28:58: [<ffffffff81181c61>] sys_write+0x51/0x90
06:28:58: [<ffffffff810dc685>] ? __audit_syscall_exit+0x265/0x290
06:28:58: [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
06:28:58:Code: 5b d4 07 00 66 09 00 00 48 c7 c6 08 f2 bd a0 48 c7 05 55 d4 07 00 00 00 00 00 c7 05 43 d4 07 00 00 00 08 00 48 c7 c7 20 67 c0 a0 <49> 8b 8f f0 01 00 00 4d 8b 87 00 01 00 00 31 c0 e8 5f c4 ac ff
|
| Comments |
| Comment by Nathaniel Clark [ 24/Oct/13 ] |
|
This may be a duplicate or at least fixed by |