[LU-7782] sanity-scrub test_2: NULL pointer dereference at 0x10 in lu_context_key_get() on mds2 - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Fixed
Priority: Critical
Fix Version/s: Lustre 2.9.0
Affects Version/s: None
Labels:
None

Severity:
3
Rank (Obsolete):
9223372036854775807

Description

This issue was created by maloo for Andreas Dilger <andreas.dilger@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/f0270640-d4f4-11e5-9e3f-5254006e85c2.

This is testing patch http://review.whamcloud.com/18442 which is changing sanity-scrub.sh scrub_prep() to use test_mkdir -i instead of mkdir and lfs mkdir explicitly for testing, so that it works with the upstream kernel (which doesn't have DNE support).

The sub-test test_2 failed with the following error on MDS2:

08:17:18:LustreError: 29698:0:(client.c:1133:ptlrpc_import_delay_req()) Skipped 40 previous similar messages
08:17:18:BUG: unable to handle kernel NULL pointer dereference at 0000000000000010
08:17:18:IP: [<ffffffffa057bb57>] lu_context_key_get+0x1
08:17:18:CPU 0 
08:17:18:Pid: 29699, comm: osp_up2-1 Not tainted 2.6.32-573.12.1.el6_lustre.g93f956d.x86_64 #1 Red Hat KVM
08:17:18:Call Trace:
08:17:18: [<ffffffffa09ee83f>] fld_local_lookup+0x4f/0x290 [fld]
08:17:18: [<ffffffffa09eec83>] fld_server_lookup+0x53/0x330 [fld]
08:17:18: [<ffffffffa0e6e38f>] lod_fld_lookup+0x34f/0x520 [lod]
08:17:18: [<ffffffffa0e84243>] lod_object_init+0x103/0x3c0 [lod]
08:17:18: [<ffffffffa057f198>] lu_object_alloc+0xd8/0x320 [obdclass]
08:17:18: [<ffffffffa0580581>] lu_object_find_try+0x151/0x260 [obdclass]
08:17:18: [<ffffffffa0580741>] lu_object_find_at+0xb1/0xe0 [obdclass]
08:17:18: [<ffffffffa05807af>] lu_object_find_slice+0x1f/0x80 [obdclass]
08:17:18: [<ffffffffa0f79a4e>] osp_trans_stop_cb+0x1be/0x2d0 [osp]
08:17:18: [<ffffffffa0f7b2be>] osp_update_interpret+0x21e/0x4a0 [osp]
08:17:18: [<ffffffffa07900b5>] ptlrpc_check_set+0x615/0x1da0 [ptlrpc]
08:17:18: [<ffffffffa0791b9a>] ptlrpc_set_wait+0x35a/0x960 [ptlrpc]
08:17:18: [<ffffffffa0792221>] ptlrpc_queue_wait+0x81/0x220 [ptlrpc]
08:17:18: [<ffffffffa0f7b9c6>] osp_send_update_req+0x256/0x850 [osp]
08:17:18: [<ffffffffa0f7c63f>] osp_send_update_thread+0x20f/0x7ac [osp]
08:17:18: [<ffffffff810a0fce>] kthread+0x9e/0xc0

Please provide additional information about the failure here.

It shouldn't be possible to cause the node to crash, no matter how the test directories are being created.

Info required for matching: sanity-scrub 2

Attachments

Issue Links

is related to

LU-7935 MDS crash with NULL pointer dereference at 0000000000000010

Resolved

LU-8399 MDT hung at lu_object_find_at during umount

Resolved

LU-8416 sanity-scrub test_4c: Auto trigger full scrub unexpectedly

Resolved

is related to

LU-7746 skip test of new functionality on upstream client

Resolved

Activity

People

Assignee:: nasf (Inactive)

Reporter:: Maloo

Votes:: 0 Vote for this issue

Watchers:: 11 Start watching this issue

Dates

Created:: 17/Feb/16 1:10 AM

Updated:: 10/Sep/16 4:08 AM

Resolved:: 10/Sep/16 4:08 AM