[LU-15950] conf-sanity test_101:Timeout occurred after 974 minutes Created: 15/Jun/22  Updated: 05/Apr/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.12.9
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Related
is related to LU-15920 Interop parallel-scale-nfsv4: BUG: un... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/3cf7c3c2-9493-4de6-8591-7b5171d3c76e

test_101 failed with the following error:

Timeout occurred after 974 minutes, last suite running was conf-sanity

client 2 and MDS console show following msg

[58460.882901] Showing busy workqueues and worker pools:
[58460.886371] workqueue events: flags=0x0
[58460.888927]   pwq 0: cpus=0 node=0 flags=0x0 nice=0 active=1/256 refcnt=2
[58460.893468]     pending: kfree_rcu_monitor
[58487.142217] sysrq: SysRq : Trigger a crash
[58487.150880] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000000
[58487.156889] Mem abort info:
[58487.158770]   ESR = 0x96000045
[58487.160817]   Exception class = DABT (current EL), IL = 32 bits
[58487.164797]   SET = 0, FnV = 0
[58487.166803]   EA = 0, S1PTW = 0
[58487.168874] Data abort info:
[58487.170765]   ISV = 0, ISS = 0x00000045
[58487.173290]   CM = 0, WnR = 1
[58487.175417] user pgtable: 64k pages, 48-bit VAs, pgdp = 000000008e5c7285
[58487.179858] [0000000000000000] pgd=0000000000000000, pud=0000000000000000
[58487.184405] Internal error: Oops: 96000045 [#1] SMP
[58487.187634] Modules linked in: lustre(OE) obdecho(OE) mgc(OE) lov(OE) mdc(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) ib_core rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache crct10dif_ce ghash_ce sunrpc sha2_ce sha256_arm64 sha1_ce virtio_balloon vfat fat ext4 mbcache jbd2 virtio_net net_failover failover virtio_mmio virtio_blk [last unloaded: libcfs]
[58487.214295] CPU: 1 PID: 1202794 Comm: bash Kdump: loaded Tainted: G           OE    --------- -  - 4.18.0-348.7.1.el8_5.aarch64 #1
[58487.222505] Hardware name: QEMU KVM Virtual Machine, BIOS 0.0.0 02/06/2015
[58487.227413] pstate: 60000005 (nZCv daif -PAN -UAO)
[58487.230747] pc : sysrq_handle_crash+0x28/0x38
[58487.233900] lr : __handle_sysrq+0x9c/0x190
[58487.236832] sp : ffff00001758fcf0
[58487.239190] x29: ffff00001758fcf0 x28: ffff800044c43300 
[58487.242866] x27: 0000aaaabc3f81ec x26: 0000aaaabc3aa000 
[58487.246582] x25: ffff000010bb5308 x24: 0000000000000000 
[58487.250315] x23: 0000000000000000 x22: 0000000000000008 
[58487.254061] x21: 0000000000000063 x20: ffff000011b01000 
[58487.257814] x19: ffff000011b34000 x18: 0000000000000010 
[58487.261568] x17: 0000000000000000 x16: 0000000000000000 
[58487.265290] x15: ffffffffffffffff x14: ffff000011af8788 
[58487.268983] x13: ffff00009758fa47 x12: ffff00001758fa4f 
[58487.272689] x11: ffff000011b33000 x10: ffff00001758f9d0 
[58487.276369] x9 : 00000000ffffffd0 x8 : ffff00001066b318 
[58487.280035] x7 : 000000000000238d x6 : ffff8000bfe82498 
[58487.283742] x5 : ffff8000bfe82498 x4 : 0000000000000000 
[58487.287513] x3 : ffff8000bff0a448 x2 : 0000000000000000 
[58487.291245] x1 : 0000000000000000 x0 : 0000000000000001 
[58487.294988] Process bash (pid: 1202794, stack limit = 0x00000000d80e5397)
[58487.299800] Call trace:
[58487.301536]  sysrq_handle_crash+0x28/0x38
[58487.304326]  __handle_sysrq+0x9c/0x190
[58487.306966]  write_sysrq_trigger+0x7c/0x98
[58487.309892]  proc_reg_write+0x84/0xd8
[58487.312534]  __vfs_write+0x4c/0x90
[58487.314975]  vfs_write+0xb4/0x1c0
[58487.317352]  ksys_write+0x70/0xd8
[58487.319682]  __arm64_sys_write+0x28/0x38
[58487.322473]  el0_svc_handler+0xb4/0x188
[58487.325177]  el0_svc+0x8/0xc
[58487.327297] Code: 52800020 b907c820 d5033e9f d2800001 (39000020) 
[58487.331621] SMP: stopping secondary CPUs
[58487.337278] Starting crashdump kernel...
[58487.339933] Bye!
[    0.000000] Booting Linux on physical CPU 0x0000000001 [0x431f0a11]
[    0.000000] Linux version 4.18.0-348.7.1.el8_5.aarch64 (mockbuild@aarch64-01.mbox.centos.org) (gcc version 8.5.0 20210514 (Red Hat 8.5.0-4) (GCC)) #1 SMP Wed Dec 22 13:24:11 UTC 2021
[    0.000000] efi: Getting EFI parameters from FDT:

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
conf-sanity test_101 - Timeout occurred after 974 minutes, last suite running was conf-sanity


Generated at Sat Feb 10 03:22:41 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.