[LU-9046] sanity-hsm: timeout after tests done Created: 25/Jan/17  Updated: 05/Aug/20  Resolved: 05/Aug/20

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Bob Glossman <bob.glossman@intel.com>

panic seen on MDS1 after all tests were complete:

01:35:55:[18368.433528] BUG: unable to handle kernel NULL pointer dereference at           (null)
01:35:55:[18368.434751] IP: [<ffffffff81333279>] __list_del_entry+0x29/0xd0
01:35:55:[18368.434751] PGD 0 
01:35:55:[18368.434751] Oops: 0000 [#1] SMP 
01:35:55:[18368.434751] Modules linked in: osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_ssse3 sha512_generic crypto_null libcfs(OE) ldiskfs(OE) dm_mod rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache rpcrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod crc_t10dif crct10dif_generic ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_core iosf_mbi crc32_pclmul ghash_clmulni_intel ppdev aesni_intel lrw gf128mul glue_helper ablk_helper cryptd pcspkr virtio_balloon i2c_piix4 parport_pc parport nfsd nfs_acl lockd grace auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 ata_generic pata_acpi virtio_blk cirrus drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops crct10dif_pclmul crct10dif_common ttm crc32c_intel 8139too serio_raw drm virtio_pci virtio_ring virtio ata_piix 8139cp mii libata i2c_core floppy
01:35:55:[18368.434751] CPU: 0 PID: 7115 Comm: obd_zombid Tainted: G           OE  ------------   3.10.0-514.6.1.el7_lustre.x86_64 #1
01:35:55:[18368.434751] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
01:35:55:[18368.434751] task: ffff88007768edd0 ti: ffff88007afc0000 task.ti: ffff88007afc0000
01:35:55:[18368.434751] RIP: 0010:[<ffffffff81333279>]  [<ffffffff81333279>] __list_del_entry+0x29/0xd0
01:35:55:[18368.434751] RSP: 0018:ffff88007afc3e20  EFLAGS: 00010207
01:35:55:[18368.434751] RAX: 0000000000000000 RBX: ffff88003e395a20 RCX: dead000000000200
01:35:55:[18368.434751] RDX: 0001020002000100 RSI: 0000000000000000 RDI: ffff88003e395a20
01:35:55:[18368.434751] RBP: ffff88007afc3e20 R08: 000000000000ffff R09: 000000000000ffff
01:35:55:[18368.434751] R10: ffff88001ed78690 R11: 000000000000000f R12: ffff88007ab69800
01:35:55:[18368.434751] R13: ffff88007ab69aa0 R14: 0000000000000000 R15: 0000000000000000
01:35:55:[18368.434751] FS:  0000000000000000(0000) GS:ffff88007fc00000(0000) knlGS:0000000000000000
01:35:55:[18368.434751] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
01:35:55:[18368.434751] CR2: 0000000000000000 CR3: 00000000019ba000 CR4: 00000000000406f0
01:35:55:[18368.434751] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
01:35:55:[18368.434751] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
01:35:55:[18368.434751] Stack:
01:35:55:[18368.434751]  ffff88007afc3e50 ffffffffa083b690 ffff88007768edd0 ffff88007768edd0
01:35:55:[18368.434751]  ffff88007768edd0 0000000000000000 ffff88007afc3ec0 ffffffffa083be6d
01:35:55:[18368.434751]  0000000000000000 0000000000000000 ffff880000000000 ffff88007768edd0
01:35:55:[18368.434751] Call Trace:
01:35:55:[18368.434751]  [<ffffffffa083b690>] obd_zombie_impexp_cull+0x1c0/0x930 [obdclass]
01:35:55:[18368.434751]  [<ffffffffa083be6d>] obd_zombie_impexp_thread+0x6d/0x1c0 [obdclass]
01:35:55:[18368.434751]  [<ffffffff810c4fd0>] ? wake_up_state+0x20/0x20
01:35:55:[18368.434751]  [<ffffffffa083be00>] ? obd_zombie_impexp_cull+0x930/0x930 [obdclass]
01:35:55:[18368.434751]  [<ffffffff810b064f>] kthread+0xcf/0xe0
01:35:55:[18368.434751]  [<ffffffff810b0580>] ? kthread_create_on_node+0x140/0x140
01:35:55:[18368.434751]  [<ffffffff81696958>] ret_from_fork+0x58/0x90
01:35:55:[18368.434751]  [<ffffffff810b0580>] ? kthread_create_on_node+0x140/0x140
01:35:55:[18368.434751] Code: 00 00 55 48 8b 17 48 b9 00 01 00 00 00 00 ad de 48 8b 47 08 48 89 e5 48 39 ca 74 29 48 b9 00 02 00 00 00 00 ad de 48 39 c8 74 7a <4c> 8b 00 4c 39 c7 75 53 4c 8b 42 08 4c 39 c7 75 2b 48 89 42 08 
01:35:55:[18368.434751] RIP  [<ffffffff81333279>] __list_del_entry+0x29/0xd0
01:35:55:[18368.434751]  RSP <ffff88007afc3e20>
01:35:55:[18368.434751] CR2: 0000000000000000

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/b6074006-e2c2-11e6-ac3d-5254006e85c2.



 Comments   
Comment by Andreas Dilger [ 05/Aug/20 ]

Closing old issue that has not been seen in a long time.

Generated at Sat Feb 10 02:22:47 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.