[LU-12767] review-ldiskfs-arm crashed during sanity test_103b Created: 16/Sep/19  Updated: 20/Jan/22  Resolved: 20/Jan/22

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Duplicate
duplicates LU-11878 sanity test 103b: OOM because of too ... Resolved
Related
is related to LU-11879 sanity: trevis-79 crash on RAM parity... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Chris Horn <hornc@cray.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/3027cfc6-d691-11e9-a25b-52540065bddc

[ 6233.106512] Synchronous External Abort: synchronous parity or ECC error (0x96000018) at 0x000000000e235e18
[ 6233.112370] Internal error: : 96000018 [#1] SMP
[ 6233.115017] Modules linked in: loop lustre(OE) obdecho(OE) mgc(OE) lov(OE) mdc(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache rpcrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm sunrpc ib_core vfat fat crc32_ce ghash_ce sha2_ce sha256_arm64 sha1_ce virtio_balloon ip_tables ext4 mbcache jbd2 virtio_blk virtio_net virtio_pci virtio_mmio virtio_ring virtio
[ 6233.148306] CPU: 0 PID: 27053 Comm: bash Kdump: loaded Tainted: G           OE  ------------   4.14.0-115.2.2.el7a.aarch64 #1
[ 6233.155233] Hardware name: QEMU KVM Virtual Machine, BIOS 0.0.0 02/06/2015
[ 6233.159457] task: ffff80002a0f6600 task.stack: ffff00000e220000
[ 6233.163199] PC is at filldir64+0xd0/0x2b8
[ 6233.166134] LR is at ll_dir_read+0x17c/0x378 [lustre]
[ 6233.169231] pc : [<ffff0000082c7ebc>] lr : [<ffff000002d181ec>] pstate: 40000005
[ 6233.173789] sp : ffff00000e22fc40
[ 6233.175867] x29: ffff00000e22fc40 x28: 191d0400098d6ba6
[ 6233.179137] x27: 0000000000000000 x26: ffff80004fbf2ed0
[ 6233.182419] x25: 0000000000000001 x24: ffff80003f733208
[ 6233.185726] x23: ffff7fe00013efc0 x22: 0000000000000020
[ 6233.188998] x21: 000000000000000c x20: ffff00000e22fe90
[ 6233.192284] x19: ffff00000e22fe90 x18: 00000000ffffff9d
[ 6233.195568] x17: 0000000000000000 x16: 0000000000000000
[ 6233.198825] x15: ffff000002a557c0 x14: 0000000200000007
[ 6233.202113] x13: 0000000000000002 x12: 0000000000000008
[ 6233.205400] x11: 0101010101010101 x10: 7f7f7f7f7f7f7f7f
[ 6233.208665] x9 : 646260715e726562 x8 : 0000ffffffffffff
[ 6233.212018] x7 : 0000000000000000 x6 : 0000ffffffffffff
[ 6233.215285] x5 : 0000000000000004 x4 : 02000013a1001216
[ 6233.218569] x3 : 191d0400098d6ba6 x2 : 000000000e22fea0
[ 6233.221840] x1 : ffff80004fbf2ef0 x0 : 000000000e22fea0
[ 6233.225127] Process bash (pid: 27053, stack limit = 0xffff00000e220000)
[ 6233.229234] Call trace:
[ 6233.230772] Exception stack(0xffff00000e22fb00 to 0xffff00000e22fc40)
[ 6233.234720] fb00: 000000000e22fea0 ffff80004fbf2ef0 000000000e22fea0 191d0400098d6ba6
[ 6233.239572] fb20: 02000013a1001216 0000000000000004 0000ffffffffffff 0000000000000000
[ 6233.244414] fb40: 0000ffffffffffff 646260715e726562 7f7f7f7f7f7f7f7f 0101010101010101
[ 6233.249275] fb60: 0000000000000008 0000000000000002 0000000200000007 ffff000002a557c0
[ 6233.254159] fb80: 0000000000000000 0000000000000000 00000000ffffff9d ffff00000e22fe90
[ 6233.258993] fba0: ffff00000e22fe90 000000000000000c 0000000000000020 ffff7fe00013efc0
[ 6233.263827] fbc0: ffff80003f733208 0000000000000001 ffff80004fbf2ed0 0000000000000000
[ 6233.268634] fbe0: 191d0400098d6ba6 ffff00000e22fc40 ffff000002d181ec ffff00000e22fc40
[ 6233.273477] fc00: ffff0000082c7ebc 0000000040000005 0000000002cbcc1e ffff80003f733208
[ 6233.278286] fc20: 0000ffffffffffff ffff800056588600 ffff00000e22fc40 ffff0000082c7ebc
[ 6233.283142] [<ffff0000082c7ebc>] filldir64+0xd0/0x2b8
[ 6233.286625] [<ffff000002d181ec>] ll_dir_read+0x17c/0x378 [lustre]
[ 6233.290600] [<ffff000002d18530>] ll_iterate+0x148/0x688 [lustre]
[ 6233.294309] [<ffff0000082c7a3c>] iterate_dir+0x88/0x1b8
[ 6233.297541] [<ffff0000082c82ac>] SyS_getdents64+0x98/0x170
[ 6233.300936] Exception stack(0xffff00000e22fec0 to 0xffff00000e230000)
[ 6233.304902] fec0: 0000000000000003 000000000e22de10 0000000000008000 00000000004fb000
[ 6233.309744] fee0: 0000000000000001 0000000000000100 000000000000007c 00000000ffffffff
[ 6233.314570] ff00: 000000000000003d 0000ffffcaae3ad0 000000000e029ea0 00000000004fd000
[ 6233.319386] ff20: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
[ 6233.324209] ff40: 00000000004f0300 0000ffff98c85e2c 0000ffffcaae4940 0000000000000030
[ 6233.329019] ff60: 000000000e22dde0 000000000e22de10 000000000e22dde4 0000000000000020
[ 6233.333873] ff80: 0000ffff98e540a0 0000000000000000 00000000004fb274 000000000e167210
[ 6233.338768] ffa0: 0000000000000000 0000ffffcaae3920 0000ffff98c85ed0 0000ffffcaae3920
[ 6233.343615] ffc0: 0000ffff98c8633c 0000000060000000 0000000000000003 000000000000003d
[ 6233.348424] ffe0: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
[ 6233.353243] [<ffff00000808392c>] __sys_trace_return+0x0/0x4
[ 6233.356696] Code: d503201f f9000003 d503201f 35fffe87 (f9400a74)
[ 6233.360566] SMP: stopping secondary CPUs
[ 6233.366242] Starting crashdump kernel...
[ 6233.368499] Bye!


 Comments   
Comment by Chris Horn [ 22/Oct/19 ]

Looks like +1 on master https://testing.whamcloud.com/test_sets/39461f70-f4ac-11e9-be86-52540065bddc

Different test case but same crash signature

Comment by James A Simmons [ 25/Nov/20 ]

Is this still true?

Generated at Sat Feb 10 02:55:28 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.