[LU-8800] sanity-lfsck tests failed: [ 1082.515725] Oops: 0000 [#1] SMP BUG: unable to handle kernel NULL pointer dereference at 0000000000000004 lfsck_namespace_assistant_handler_p1+0xf45/0x1e80 [lfsck] Created: 04/Nov/16  Updated: 04/Nov/16  Resolved: 04/Nov/16

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: nasf (Inactive) Assignee: nasf (Inactive)
Resolution: Won't Fix Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

It is reported sanity-lfsck test_31e hit for the following failure:

[  228.686472] Lustre: DEBUG MARKER: == sanity-lfsck test 31e: Re-generate the lost slave LMV EA for striped directory (1) ================ 16:16:20 (1476807380)
[  228.713183] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000280000400-0x00000002c0000400):0:mdt
[  228.853833] Lustre: cli-ctl-lustre-MDT0001: Allocated super-sequence [0x00000002c0000400-0x0000000300000400):1:mdt]
[  228.861530] Lustre: *** cfs_fail_loc=162a, val=0***
[  229.125119] BUG: unable to handle kernel NULL pointer dereference at 0000000000000004
[  229.127271] IP: [<ffffffffa0da5d2d>] lfsck_namespace_striped_dir_rescan+0x4cd/0xea0 [lfsck]
[  229.127271] PGD 0 
[  229.127271] Oops: 0000 [#1] SMP 
[  229.127271] Modules linked in: osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) ldiskfs(OE) lustre(OE) lmv(OE) mdc(OE) lov(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache ppdev parport_pc pcspkr virtio_balloon parport i2c_piix4 nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables ext4 mbcache jbd2 ata_generic pata_acpi virtio_blk virtio_net cirrus syscopyarea sysfillrect sysimgblt drm_kms_helper ttm serio_raw drm virtio_pci virtio_ring virtio i2c_core ata_piix libata floppy
[  229.127271] CPU: 1 PID: 13715 Comm: lfsck_namespace Tainted: G           OE  ------------   3.10.0-327.13.1.x3.0.86.x86_64 #1
[  229.127271] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
[  229.127271] task: ffff880058d17300 ti: ffff880058f90000 task.ti: ffff880058f90000
[  229.127271] RIP: 0010:[<ffffffffa0da5d2d>]  [<ffffffffa0da5d2d>] lfsck_namespace_striped_dir_rescan+0x4cd/0xea0 [lfsck]
[  229.127271] RSP: 0018:ffff880058f93b20  EFLAGS: 00010246
[  229.127271] RAX: 0000000000000000 RBX: ffff880058c7a7a0 RCX: 000000000000ba19
[  229.127271] RDX: 000000000000ba18 RSI: 0000000000000002 RDI: ffff88007d001a00
[  229.127271] RBP: ffff880058f93c68 R08: 0000000000017540 R09: ffff88007fd17540
[  229.127271] R10: ffffea0001638480 R11: ffffffffa0833da5 R12: 0000000000000000
[  229.127271] R13: ffff880058e124e0 R14: 0000000000000001 R15: ffff880035e61800
[  229.127271] FS:  0000000000000000(0000) GS:ffff88007fd00000(0000) knlGS:0000000000000000
[  229.127271] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[  229.127271] CR2: 0000000000000004 CR3: 0000000058eb5000 CR4: 00000000000006e0
[  229.127271] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  229.127271] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  229.127271] Stack:
[  229.127271]  ffff880000000001 0000000280000400 ffff880000000002 ffff880000000000
[  229.127271]  ffffffffa0dafa80 ffffffffa0dafa80 ffffffffa0dafa80 ffffffffa0dafa80
[  229.127271]  ffffffffa0dafa80 ffffffffa0dafa80 ffffffffa0dafa7c ffffffffa0dafa7c
[  229.127271] Call Trace:
[  229.127271]  [<ffffffffa0d7e432>] lfsck_namespace_assistant_handler_p1+0x1642/0x1e80 [lfsck]
[  229.127271]  [<ffffffff816307a8>] ? __slab_free+0x10e/0x277
[  229.127271]  [<ffffffffa0d629a7>] lfsck_assistant_engine+0x3c7/0x2140 [lfsck]
[  229.127271]  [<ffffffff810c1986>] ? dequeue_entity+0x106/0x510
[  229.127271]  [<ffffffff810c219e>] ? dequeue_task_fair+0x40e/0x620
[  229.127271]  [<ffffffff81639208>] ? __schedule+0x2d8/0x900
[  229.127271]  [<ffffffff810b8c10>] ? wake_up_state+0x20/0x20
[  229.127271]  [<ffffffffa0d625e0>] ? lfsck_master_engine+0x13e0/0x13e0 [lfsck]
[  229.127271]  [<ffffffff810a5acf>] kthread+0xcf/0xe0
[  229.127271]  [<ffffffff810a5a00>] ? kthread_create_on_node+0x140/0x140
[  229.127271]  [<ffffffff81644818>] ret_from_fork+0x58/0x90
[  229.127271]  [<ffffffff810a5a00>] ? kthread_create_on_node+0x140/0x140
[  229.127271] Code: 0f 84 ae 02 00 00 83 f8 c3 0f 84 2f 04 00 00 83 f8 ea 0f 84 26 04 00 00 85 c0 0f 85 1e 07 00 00 48 8b 45 b8 c6 85 79 ff ff ff 01 <83> 78 04 01 0f 84 ec 07 00 00 c6 85 78 ff ff ff 00 c6 85 3f ff 
[  229.127271] RIP  [<ffffffffa0da5d2d>] lfsck_namespace_striped_dir_rescan+0x4cd/0xea0 [lfsck]


 Comments   
Comment by nasf (Inactive) [ 04/Nov/16 ]

The issue on master has already been fixed via the patch:
http://review.whamcloud.com/19877

Generated at Sat Feb 10 02:20:39 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.