Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-15932

Crash in sanity test 160s osd_attr_get

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.16.0
    • None
    • 3
    • 9223372036854775807

    Description

      Hit this crash on master-next, for a recently landed patch

      [67331.365407] Lustre: DEBUG MARKER: == sanity test 160s: changelog garbage collect on idle records acceptance-small.sh acl aiocp.c auster badarea_io badarea_io-badarea_io.o badarea_io.c cfg check_fallocate check_fallocate.c check_fallocate.o check_fhandle_syscalls check_fhandle_syscalls.c c
      [67334.927458] Lustre: 31259:0:(mdd_dir.c:895:mdd_changelog_store()) lustre-MDD0000: starting changelog garbage collection
      [67334.933112] Lustre: 13419:0:(mdd_trans.c:160:mdd_chlg_garbage_collect()) lustre-MDD0000: force deregister of changelog user cl24 idle for 864003s with 500000004 unprocessed records
      [67334.950027] general protection fault: 0000 [#1] SMP DEBUG_PAGEALLOC
      [67334.950643] Modules linked in: lustre(OE) ofd(OE) osp(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_zfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) nfsd ext4 mbcache jbd2 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) crc32_generic crc_t10dif crct10dif_generic crct10dif_common virtio_balloon pcspkr i2c_piix4 virtio_console ip_tables rpcsec_gss_krb5 drm_kms_helper ata_generic ttm pata_acpi drm drm_panel_orientation_quirks virtio_blk ata_piix serio_raw floppy i2c_core libata [last unloaded: libcfs]
      [67334.951005] CPU: 11 PID: 31259 Comm: mdt05_003 Kdump: loaded Tainted: P        W  OE  ------------   3.10.0-7.9-debug #2
      [67334.951005] Hardware name: Red Hat KVM, BIOS 1.15.0-1.module_el8.6.0+1087+b42c8331 04/01/2014
      [67334.951005] task: ffff8800b95449d0 ti: ffff8801f31e4000 task.ti: ffff8801f31e4000
      [67334.951005] RIP: 0010:[<ffffffffa1164bac>]  [<ffffffffa1164bac>] osd_attr_get+0x6c/0x4d0 [osd_zfs]
      [67334.951005] RSP: 0018:ffff8801f31e76e0  EFLAGS: 00010282
      [67334.951005] RAX: 6b6b6b6b6b6b6b6b RBX: ffff8801e3349738 RCX: ffff8801f31e7fd8
      [67334.951005] RDX: 0000000000000000 RSI: 0000000000000015 RDI: ffffffff81a98ea7
      [67334.951005] RBP: ffff8801f31e7730 R08: 00000000ffffffff R09: ffff8802f6b33fc8
      [67334.951005] R10: ffff88028bbcea08 R11: ffff88023db9b548 R12: ffff8802f86f5648
      [67334.951005] R13: ffff880187c08000 R14: ffff8801e3349738 R15: ffff8802f86f56e8
      [67334.951005] FS:  0000000000000000(0000) GS:ffff880331cc0000(0000) knlGS:0000000000000000
      [67334.951005] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
      [67334.951005] CR2: 00007f3d1b48d000 CR3: 0000000288b26000 CR4: 00000000000007e0
      [67334.951005] Call Trace:
      [67334.951005]  [<ffffffffa0374680>] llog_osd_write_rec+0x330/0x1b60 [obdclass]
      [67334.951005]  [<ffffffffa13cb995>] ? lod_xattr_set_internal+0xa5/0x2b0 [lod]
      [67334.951005]  [<ffffffffa11e9a26>] mdd_changelog_write_rec+0xc6/0x120 [mdd]
      [67334.951005]  [<ffffffffa1167ca9>] ? osd_attr_set+0x4b9/0xf50 [osd_zfs]
      [67334.951005]  [<ffffffffa0363760>] llog_write_rec+0x290/0x590 [obdclass]
      [67334.951005]  [<ffffffffa0369691>] llog_cat_add_rec+0x1e1/0x990 [obdclass]
      [67334.951005]  [<ffffffffa03605ff>] llog_add+0x17f/0x1f0 [obdclass]
      [67334.951005]  [<ffffffffa11e9c8e>] mdd_changelog_store+0x17e/0x580 [mdd]
      [67334.951005]  [<ffffffffa11ea755>] mdd_changelog_ns_store+0x3d5/0x910 [mdd]
      [67334.951005]  [<ffffffffa11fdc18>] ? mdd_attr_set_internal+0xc8/0x2e0 [mdd]
      [67334.951005]  [<ffffffffa11fdea5>] ? mdd_update_time+0x75/0x1d0 [mdd]
      [67334.951005]  [<ffffffffa11ec7e4>] mdd_create+0x1554/0x1ce0 [mdd]
      [67334.951005]  [<ffffffffa1299ab1>] mdt_create+0xbf1/0x11f0 [mdt]
      [67334.951005]  [<ffffffffa129a450>] mdt_reint_create+0x3a0/0x460 [mdt]
      [67334.951005]  [<ffffffffa129e157>] mdt_reint_rec+0x87/0x240 [mdt]
      [67334.951005]  [<ffffffffa1273cec>] mdt_reint_internal+0x76c/0xb50 [mdt]
      [67334.951005]  [<ffffffffa127e4d7>] mdt_reint+0x67/0x150 [mdt]
      [67334.951005]  [<ffffffffa06ba02a>] tgt_request_handle+0x93a/0x19c0 [ptlrpc]
      [67334.951005]  [<ffffffffa02c7a5e>] ? libcfs_nid2str_r+0xfe/0x130 [lnet]
      [67334.951005]  [<ffffffffa06693a0>] ptlrpc_server_handle_request+0x250/0xc30 [ptlrpc]
      [67334.951005]  [<ffffffffa066b059>] ptlrpc_main+0xbd9/0x15f0 [ptlrpc]
      [67334.951005]  [<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90
      [67334.951005]  [<ffffffffa066a480>] ? ptlrpc_wait_event+0x620/0x620 [ptlrpc]
      [67334.951005]  [<ffffffff810ba114>] kthread+0xe4/0xf0
      [67334.951005]  [<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
      [67334.951005]  [<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
      [67334.951005]  [<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
      [67334.951005] Code: 48 3d 20 c5 18 a1 0f 84 73 02 00 00 e8 69 66 02 00 66 0f 1f 44 00 00 4d 8d bc 24 a0 00 00 00 4c 89 ff e8 28 b2 67 e0 49 8b 04 24 <f6> 40 1c 01 0f 84 d2 03 00 00 41 f6 84 24 74 01 00 00 01 0f 85 

      Crashdump and such in http://testing.linuxhacker.ru/lustre-reports/external/crashes/boilpot-bigmem-18-2022-06-08-17:49:52/

      Attachments

        Activity

          People

            wc-triage WC Triage
            green Oleg Drokin
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: