[LU-1806] Test failure on test suite sanity, subtest test_118a Created: 30/Aug/12  Updated: 19/Dec/17  Resolved: 17/Apr/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None

Issue Links:
Related
is related to LU-10299 sanity test_118a: client crash Resolved
Severity: 3
Rank (Obsolete): 4285

 Description   

This issue was created by maloo for Ian <ian@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/3c011700-f2a8-11e1-807d-52540035b04c.

The sub-test test_118a failed with the following error:

test failed to respond and timed out

Info required for matching: sanity 118a



 Comments   
Comment by Nathaniel Clark [ 24/Oct/16 ]

Just happened on master (review-dne-part-1):
https://testing.hpdd.intel.com/test_sets/91441948-9712-11e6-a763-5254006e85c2

Client paniced:

13:44:14:[ 4065.688226] general protection fault: 0000 [#1] SMP 
13:44:14:[ 4065.689005] Modules linked in: loop lustre(OE) obdecho(OE) mgc(OE) lov(OE) osc(OE) mdc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod crc_t10dif crct10dif_generic crct10dif_common ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr ppdev parport_pc parport i2c_piix4 virtio_balloon pcspkr nfsd nfs_acl lockd auth_rpcgss grace sunrpc ip_tables ext4 mbcache jbd2 ata_generic pata_acpi virtio_blk cirrus syscopyarea sysfillrect sysimgblt drm_kms_helper ttm 8139too ata_piix drm serio_raw i2c_core 8139cp virtio_pci virtio_ring mii virtio libata floppy
13:44:14:[ 4065.706054] CPU: 1 PID: 459 Comm: lctl Tainted: G           OE  ------------   3.10.0-327.36.1.el7.x86_64 #1
13:44:14:[ 4065.706054] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
13:44:14:[ 4065.706054] task: ffff880078c29700 ti: ffff880041b8c000 task.ti: ffff880041b8c000
13:44:14:[ 4065.706054] RIP: 0010:[<ffffffffa0c0e4e7>]  [<ffffffffa0c0e4e7>] vvp_pgcache_obj_get+0x37/0x90 [lustre]
13:44:14:[ 4065.706054] RSP: 0018:ffff880041b8fd40  EFLAGS: 00010246
13:44:14:[ 4065.706054] RAX: fffeffffffffffe0 RBX: ffff880041b8fdf0 RCX: 0000000000000000
13:44:14:[ 4065.706054] RDX: 00000000ffffffff RSI: ffff880041b8fd78 RDI: ffff000000000000
13:44:14:[ 4065.706054] RBP: ffff880041b8fd68 R08: 0000000000000007 R09: 00000000000004ce
13:44:14:[ 4065.706054] R10: 0000000000000000 R11: 000000000000000b R12: 0000000000000000
13:44:14:[ 4065.706054] R13: ffff880041039300 R14: ffff000000000000 R15: 0000000000000000
13:44:14:[ 4065.706054] FS:  00007fce1cee3740(0000) GS:ffff88007fd00000(0000) knlGS:0000000000000000
13:44:14:[ 4065.706054] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
13:44:14:[ 4065.706054] CR2: 0000000001f5b0f8 CR3: 0000000078dbe000 CR4: 00000000000006e0
13:44:14:[ 4065.706054] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
13:44:14:[ 4065.706054] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
13:44:14:[ 4065.706054] Stack:
13:44:14:[ 4065.706054]  ffff880041b8fd50 ffff000000000000 ffff880041039300 ffffffffa0c0e4b0
13:44:14:[ 4065.706054]  ffff880041b8fdf0 ffff880041b8fdb0 ffffffffa05e0ed0 ffffc90004bcb000
13:44:14:[ 4065.706054]  00000000000004ff 000000008a11c0bd ffff880041b8fdf0 ffff880079a03360
13:44:14:[ 4065.706054] Call Trace:
13:44:14:[ 4065.706054]  [<ffffffffa0c0e4b0>] ? vvp_device_fini+0x10/0x10 [lustre]
13:44:14:[ 4065.706054]  [<ffffffffa05e0ed0>] cfs_hash_hlist_for_each+0xf0/0x110 [libcfs]
13:44:14:[ 4065.706054]  [<ffffffffa0c0e670>] vvp_pgcache_obj+0x50/0x190 [lustre]
13:44:14:[ 4065.706054]  [<ffffffffa0c0e87b>] vvp_pgcache_find+0xcb/0x160 [lustre]
13:44:14:[ 4065.706054]  [<ffffffffa0c0ea42>] vvp_pgcache_start+0x92/0xb0 [lustre]
13:44:14:[ 4065.706054]  [<ffffffff81202aa7>] seq_read+0x177/0x3a0
13:44:14:[ 4065.706054]  [<ffffffff812493dd>] proc_reg_read+0x3d/0x80
13:44:14:[ 4065.706054]  [<ffffffff811debac>] vfs_read+0x9c/0x170
13:44:14:[ 4065.706054]  [<ffffffff811df6ff>] SyS_read+0x7f/0xe0
13:44:14:[ 4065.706054]  [<ffffffff81646a09>] system_call_fastpath+0x16/0x1b
13:44:14:[ 4065.706054] Code: 89 d6 41 55 49 89 fd 41 54 45 31 e4 53 48 89 cb 48 83 ec 08 48 8b 47 08 48 89 d7 ff 50 20 8b 4b 0c 8d 51 ff 85 c9 89 53 0c 75 0c <48> 8b 50 10 41 b4 01 83 e2 01 74 15 48 83 c4 08 44 89 e0 5b 41 
13:44:14:[ 4065.706054] RIP  [<ffffffffa0c0e4e7>] vvp_pgcache_obj_get+0x37/0x90 [lustre]
13:44:14:[ 4065.706054]  RSP <ffff880041b8fd40>
Comment by Andreas Dilger [ 17/Apr/17 ]

Close old issue.

Comment by Alexander Boyko [ 09/Jun/17 ]

Got the same fault
https://testing.hpdd.intel.com/test_sets/b5e51fa4-4c66-11e7-b558-5254006e85c2

07:49:04:[ 4226.666580] Lustre: DEBUG MARKER: == sanity test 118a: verify O_SYNC works ============================================================= 07:48:59 (1496908139)
07:49:04:
07:49:04:[ 4226.821956] general protection fault: 0000 [#1] SMP 
07:49:04:[ 4226.822005] Modules linked in: loop lustre(OE) obdecho(OE) mgc(OE) lov(OE) osc(OE) mdc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache rpcrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod crc_t10dif crct10dif_generic ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_core iosf_mbi crc32_pclmul ghash_clmulni_intel ppdev aesni_intel lrw gf128mul glue_helper ablk_helper cryptd pcspkr virtio_balloon i2c_piix4 parport_pc parport nfsd nfs_acl lockd auth_rpcgss grace sunrpc ip_tables ext4 mbcache jbd2 ata_generic pata_acpi cirrus virtio_blk drm_kms_helper syscopyarea sysfillrect sysimgblt crct10dif_pclmul fb_sys_fops crct10dif_common 8139too ttm ata_piix crc32c_intel drm virtio_pci virtio_ring virtio serio_raw i2c_core libata 8139cp mii floppy
07:49:04:[ 4226.822005] CPU: 1 PID: 22195 Comm: lctl Tainted: G           OE  ------------   3.10.0-514.21.1.el7.x86_64 #1
07:49:04:[ 4226.822005] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
07:49:04:[ 4226.822005] task: ffff88007a009f60 ti: ffff8800787bc000 task.ti: ffff8800787bc000
07:49:04:[ 4226.822005] RIP: 0010:[<ffffffffa0c73ab7>]  [<ffffffffa0c73ab7>] vvp_pgcache_obj_get+0x37/0x90 [lustre]
07:49:04:[ 4226.822005] RSP: 0018:ffff8800787bfd38  EFLAGS: 00010246
07:49:04:[ 4226.822005] RAX: fffeffffffffffe0 RBX: ffff8800787bfde8 RCX: 0000000000000000
07:49:04:[ 4226.822005] RDX: 00000000ffffffff RSI: ffff8800787bfd70 RDI: ffff000000000000
07:49:04:[ 4226.822005] RBP: ffff8800787bfd60 R08: 00000000000000f9 R09: 0000000000000a9f
07:49:04:[ 4226.822005] R10: 0000000000000079 R11: 000000000007ffff R12: 0000000000000000
07:49:04:[ 4226.822005] R13: ffff880069e8dcc0 R14: ffff000000000000 R15: 0000000000000000
07:49:04:[ 4226.822005] FS:  00007f620e32a740(0000) GS:ffff88007fd00000(0000) knlGS:0000000000000000
07:49:04:[ 4226.822005] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
07:49:04:[ 4226.822005] CR2: 0000000000ba40f8 CR3: 0000000075e08000 CR4: 00000000000406e0
07:49:04:[ 4226.822005] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
07:49:04:[ 4226.822005] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
07:49:04:[ 4226.822005] Stack:
07:49:04:[ 4226.822005]  ffff88005a70ee18 ffff000000000000 ffff880069e8dcc0 ffffffffa0c73a80
07:49:04:[ 4226.822005]  ffff8800787bfde8 ffff8800787bfda8 ffffffffa0647350 ffffc90005640000
07:49:04:[ 4226.822005]  ffffc900000007ff 000000001ea25231 ffff8800787bfde8 ffff8800782b3c60
07:49:04:[ 4226.822005] Call Trace:
07:49:04:[ 4226.822005]  [<ffffffffa0c73a80>] ? vvp_device_fini+0x10/0x10 [lustre]
07:49:04:[ 4226.822005]  [<ffffffffa0647350>] cfs_hash_hlist_for_each+0xf0/0x110 [libcfs]
07:49:04:[ 4226.822005]  [<ffffffffa0c73c40>] vvp_pgcache_obj+0x50/0x190 [lustre]
07:49:04:[ 4226.822005]  [<ffffffffa0c73e4b>] vvp_pgcache_find+0xcb/0x160 [lustre]
07:49:04:[ 4226.822005]  [<ffffffffa07bd876>] ? lu_env_refill+0x36/0x50 [obdclass]
07:49:04:[ 4226.822005]  [<ffffffffa0c73f35>] vvp_pgcache_next+0x55/0xa0 [lustre]
07:49:04:[ 4226.822005]  [<ffffffff81222c23>] seq_read+0x233/0x3b0
07:49:04:[ 4226.822005]  [<ffffffff8126bd0d>] proc_reg_read+0x3d/0x80
07:49:04:[ 4226.822005]  [<ffffffff811fe69e>] vfs_read+0x9e/0x170
07:49:04:[ 4226.822005]  [<ffffffff811ff26f>] SyS_read+0x7f/0xe0
07:49:04:[ 4226.822005]  [<ffffffff816975c9>] system_call_fastpath+0x16/0x1b
Generated at Sat Feb 10 01:19:50 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.