[LU-17093] sanity test_103b: RIP: 0010:lod_sub_get_thandle+0xba/0x460 [lod] Created: 06/Sep/23  Updated: 06/Sep/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.16.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None
Environment:

OST_INDEX_LIST=[0,10,20,40,55,60,80]


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for jianyu <yujian@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/58579a38-95b2-4cda-b091-35f54aaec55c

test_103b failed with the following error:

[ 6538.390094] Lustre: DEBUG MARKER: == sanity test 103b: umask lfs setstripe ================= 20:58:48 (1693429128)
[ 6556.172057] BUG: unable to handle kernel NULL pointer dereference at 000000000000001c
[ 6556.175351] PGD 0 P4D 0 
[ 6556.175905] Oops: 0000 [#1] SMP PTI
[ 6556.176622] CPU: 1 PID: 10843 Comm: mdt00_001 Kdump: loaded Tainted: G           OE    --------- -  - 4.18.0-477.15.1.el8_lustre.x86_64 #1
[ 6556.178888] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
[ 6556.179980] RIP: 0010:lod_sub_get_thandle+0xba/0x460 [lod]
[ 6556.181169] Code: c6 04 24 00 48 83 7b 08 00 0f 84 19 01 00 00 48 8b 4b 48 0f b6 53 44 0f b6 41 44 83 e2 10 83 e0 ef 09 d0 88 41 44 49 8b 55 00 <f6> 42 1c 02 0f 85 d7 00 00 00 48 85 d2 74 2f 48 b9 ff ff ff ff 01
[ 6556.184569] RSP: 0018:ffffaa3ec11e3a60 EFLAGS: 00010206
[ 6556.185570] RAX: 0000000000000014 RBX: ffff8f1003dfbd80 RCX: ffff8f100280a000
[ 6556.186908] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff8f0fc57de080
[ 6556.188257] RBP: ffff8f10041a6000 R08: 00000000000001ab R09: 0000000000000000
[ 6556.189595] R10: ffffaa3ec11e3a60 R11: ffff8f1009842076 R12: ffffaa3ec11e3aa7
[ 6556.190929] R13: ffff8f0fff75ee68 R14: ffff8f0ff174ed40 R15: ffff8f0fc500b800
[ 6556.192256] FS:  0000000000000000(0000) GS:ffff8f107fd00000(0000) knlGS:0000000000000000
[ 6556.193754] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 6556.194844] CR2: 000000000000001c CR3: 0000000009610003 CR4: 00000000000606e0
[ 6556.196165] Call Trace:
[ 6556.196717]  lod_sub_declare_destroy+0x4d/0x320 [lod]
[ 6556.197721]  lod_obj_for_each_stripe+0x118/0x2e0 [lod]
[ 6556.198739]  lod_declare_destroy+0x52a/0x580 [lod]
[ 6556.199693]  ? lod_index_try+0x310/0x310 [lod]
[ 6556.200584]  mdd_declare_finish_unlink+0xad/0x250 [mdd]
[ 6556.201686]  mdd_unlink+0x4cf/0xd90 [mdd]
[ 6556.202508]  mdt_reint_unlink+0xbc3/0x1100 [mdt]
[ 6556.203616]  mdt_reint_rec+0x11f/0x270 [mdt]
[ 6556.204485]  mdt_reint_internal+0x4d3/0x7f0 [mdt]
[ 6556.205437]  mdt_reint+0x5d/0x110 [mdt]
[ 6556.206236]  tgt_request_handle+0xd20/0x19c0 [ptlrpc]
[ 6556.207731]  ptlrpc_server_handle_request+0x31d/0xbc0 [ptlrpc]
[ 6556.208943]  ? lprocfs_counter_add+0x12a/0x1a0 [obdclass]
[ 6556.210234]  ptlrpc_main+0xc91/0x15a0 [ptlrpc]
[ 6556.211186]  ? ptlrpc_wait_event+0x590/0x590 [ptlrpc]
[ 6556.212243]  kthread+0x134/0x150
[ 6556.212953]  ? set_kthread_struct+0x50/0x50
[ 6556.213799]  ret_from_fork+0x35/0x40
[ 6556.214555] Modules linked in: obdecho(OE) ptlrpc_gss(OE) osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_flakey rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache intel_rapl_msr intel_rapl_common crct10dif_pclmul crc32_pclmul ghash_clmulni_intel joydev pcspkr virtio_balloon i2c_piix4 sunrpc dm_mod ext4 mbcache jbd2 ata_generic ata_piix libata virtio_net crc32c_intel serio_raw net_failover virtio_blk failover [last unloaded: llog_test]
[ 6556.224421] CR2: 000000000000001c

Test session details:
clients: https://build.whamcloud.com/job/lustre-reviews/97423 - 4.18.0-477.15.1.el8_8.x86_64
servers: https://build.whamcloud.com/job/lustre-reviews/97423 - 4.18.0-477.15.1.el8_lustre.x86_64

OST_INDEX_LIST=[0,10,20,40,55,60,80]

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity test_103b - onyx-35vm4 crashed during sanity test_103b


Generated at Sat Feb 10 03:32:34 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.