Details
-
Bug
-
Resolution: Duplicate
-
Minor
-
None
-
Lustre 2.14.0
-
None
-
RHEL8.2 DNE
-
3
-
9223372036854775807
Description
sanity-lfsck test_35 crashes for DNE el8.2. Looking at the crash at https://testing.whamcloud.com/test_sets/908eafa6-3710-496a-a54c-f236012912ba , we see the MDS1,3 (vm8) crashes with
[17140.684707] kernel BUG at /tmp/rpmbuild-lustre-jenkins-f2uYZr9W/BUILD/lustre-2.13.56_45_g8ad7404/ldiskfs/htree_lock.c:892! [17140.686567] invalid opcode: 0000 [#1] SMP PTI [17140.687292] CPU: 1 PID: 988042 Comm: lfsck_namespace Kdump: loaded Tainted: G OE --------- - - 4.18.0-193.6.3.el8_lustre.x86_64 #1 [17140.689355] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 [17140.690417] RIP: 0010:htree_lock_free+0x13/0x20 [ldiskfs] [17140.691295] Code: c2 74 ec 0f 0b e9 8d c3 34 d9 0f 0b 90 66 2e 0f 1f 84 00 00 00 00 00 66 66 66 66 90 81 7f 1c 0c d1 ea 0d 75 05 e9 6d c3 34 d9 <0f> 0b 90 66 2e 0f 1f 84 00 00 00 00 00 66 66 66 66 90 41 54 55 89 [17140.694283] RSP: 0018:ffffc2f3032b7d90 EFLAGS: 00010297 [17140.695132] RAX: 0000000000000000 RBX: ffffa0b671338c00 RCX: 0000000000000055 [17140.696278] RDX: ffffa0b671de0000 RSI: ffffffffc11b0100 RDI: ffffa0b64ae15800 [17140.697418] RBP: ffffffffc11b0100 R08: 0000000000000001 R09: 0000000000000000 [17140.698562] R10: ffffa0b64ae15c00 R11: ffffa0b64a07ac01 R12: 0000000000000000 [17140.699710] R13: 0000000000000000 R14: ffffa0b671de0000 R15: ffffa0b64f6e9400 [17140.700851] FS: 0000000000000000(0000) GS:ffffa0b67fd00000(0000) knlGS:0000000000000000 [17140.702162] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [17140.703088] CR2: 000055895dbc0648 CR3: 000000000ee0a006 CR4: 00000000000606e0 [17140.704235] Call Trace: [17140.704777] osd_key_fini+0x1cc/0x770 [osd_ldiskfs] [17140.705841] key_fini+0x4e/0x150 [obdclass] [17140.706565] lu_context_fini+0x42/0x1c0 [obdclass] [17140.707374] lu_env_fini+0x16/0x20 [obdclass] [17140.708198] lfsck_thread_args_fini+0x30/0x140 [lfsck] [17140.709061] lfsck_assistant_engine+0x274/0x1b80 [lfsck] [17140.709964] ? __switch_to_asm+0x41/0x70 [17140.710626] ? finish_wait+0x80/0x80 [17140.711224] ? lfsck_master_engine+0xc90/0xc90 [lfsck] [17140.712083] kthread+0x112/0x130 [17140.712630] ? kthread_flush_work_fn+0x10/0x10 [17140.713356] ret_from_fork+0x35/0x40 [17140.713957] Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache rpcrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp ib_ipoib rdma_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_core intel_rapl_msr intel_rapl_common crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcspkr virtio_balloon joydev i2c_piix4 sunrpc dm_mod ip_tables ext4 mbcache jbd2 ata_generic ata_piix libata 8139too 8139cp crc32c_intel serio_raw mii virtio_blk [last unloaded: dm_flakey] [ 0.000000] Linux version 4.18.0-193.6.3.el8_lustre.x86_64 (jenkins@trevis-305-el8-x8664-2.trevis.whamcloud.com) (gcc version 8.3.1 20190507 (Red Hat 8.3.1-4) (GCC)) #1 SMP Fri Sep 25 21:03:21 UTC 2020
We’ve seen this once before at https://testing.whamcloud.com/test_sets/db725bb2-cb6c-4af0-a16e-d09bd517aa75