Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
None
-
3
-
9223372036854775807
Description
This issue was created by maloo for Chris Horn <chris.horn@hpe.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/a136572a-6cf3-4145-a30f-3b6ed48c275a
test_133f failed with the following error:
trevis-54vm2 crashed during sanity test_133f
Test session details:
clients: https://build.whamcloud.com/job/lustre-reviews/110114 - 4.18.0-553.27.1.el8_10.x86_64
servers: https://build.whamcloud.com/job/lustre-reviews/110114 - 4.18.0-553.27.1.el8_lustre.x86_64
[ 8363.209844] Lustre: DEBUG MARKER: == sanity test 133f: Check reads/writes of client lustre proc files with bad area io ========================================================== 20:30:40 (1736281840) [ 8369.948851] Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre' ' /proc/mounts); if [ $running -ne 0 ] ; then echo Stopping client $(hostname) /mnt/lustre opts:; lsof /mnt/lustre || need_kill=no; if [ x != x -a x$need_kill != xno ]; then pids=$(lsof -t /mnt/lustre | sort -u); if [ 8370.434577] general protection fault, probably for non-canonical address 0x50615aceca50de3b: 0000 [#1] SMP PTI [ 8370.436881] CPU: 1 PID: 207345 Comm: kworker/1:2 Kdump: loaded Tainted: G OE -------- - - 4.18.0-553.27.1.el8_10.x86_64 #1 [ 8370.439315] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 [ 8370.440508] Workqueue: events sec_gc_main [ptlrpc] [ 8370.441838] RIP: 0010:lprocfs_counter_add+0xb8/0x180 [obdclass] [ 8370.443309] Code: 04 74 0b 4c 89 e6 49 0f af f4 48 01 70 20 4c 39 60 08 7e 04 4c 89 60 08 48 8d 04 9b 48 8d 04 c1 4c 39 60 10 7c 7a 48 8b 45 30 <4c> 8b 6c 10 18 4d 85 ed 74 3d 4d 85 e4 0f 84 a0 00 00 00 41 83 ec [ 8370.446940] RSP: 0018:ffffc2ab00be3ce8 EFLAGS: 00010202 [ 8370.448016] RAX: 50615aceca50ddfb RBX: 0000000000000002 RCX: ffffa099d9ba4000 [ 8370.449427] RDX: 0000000000000040 RSI: 0000000000000000 RDI: ffffa099d28889c0 [ 8370.450901] RBP: ffffa099d28889c0 R08: 0000000000000eeb R09: 0000000000000000 [ 8370.452366] R10: ffffa099d2f4c000 R11: ffffa099d2f4bee9 R12: 0000000000000000 [ 8370.453793] R13: ffffa099d7ddda00 R14: 0000000000000000 R15: 0000000000000000 [ 8370.455230] FS: 0000000000000000(0000) GS:ffffa09a7fd00000(0000) knlGS:0000000000000000 [ 8370.456869] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 8370.458062] CR2: 000055a09b6a1a80 CR3: 000000000f610005 CR4: 00000000000606e0 [ 8370.459497] Call Trace: [ 8370.460109] ? __die_body+0x1a/0x60 [ 8370.460907] ? die_addr+0x38/0x51 [ 8370.461613] ? do_general_protection+0x135/0x280 [ 8370.462574] ? general_protection+0x1e/0x30 [ 8370.463488] ? lprocfs_counter_add+0xb8/0x180 [obdclass] [ 8370.464646] ? lprocfs_counter_add+0x5a/0x180 [obdclass] [ 8370.465768] ptl_send_rpc+0x88b/0x13c0 [ptlrpc] [ 8370.466794] ? ptlrpc_request_bufs_pack+0x216/0x6f0 [ptlrpc] [ 8370.468015] gss_do_ctx_fini_rpc+0x226/0x4e0 [ptlrpc_gss] [ 8370.469240] gss_cli_ctx_fini_common+0x56/0x2c0 [ptlrpc_gss] [ 8370.470459] ctx_destroy_kr+0x71/0x2a0 [ptlrpc_gss] [ 8370.471490] sec_process_ctx_list+0x118/0x1c0 [ptlrpc] [ 8370.472616] sec_gc_main+0x1e/0x2b0 [ptlrpc] [ 8370.473582] process_one_work+0x1d3/0x390 [ 8370.474483] ? process_one_work+0x390/0x390 [ 8370.475384] worker_thread+0x30/0x390 [ 8370.476168] ? process_one_work+0x390/0x390 [ 8370.477071] kthread+0x134/0x150 [ 8370.477808] ? set_kthread_struct+0x50/0x50 [ 8370.478693] ret_from_fork+0x35/0x40 [ 8370.479489] Modules linked in: mgc(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache intel_rapl_msr intel_rapl_common crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcspkr joydev virtio_balloon i2c_piix4 sunrpc ext4 mbcache jbd2 ata_generic ata_piix libata virtio_net net_failover crc32c_intel serio_raw virtio_blk failover
VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity test_133f - trevis-54vm2 crashed during sanity test_133f