Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-19756

WARNING: CPU: 2 PID: 9729 at mm/page-writeback.c:2796 __folio_mark_dirty+0x208/0x260

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Medium
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      sanity/17n sees a warning about dirty blocks:

      [  742.828828] Lustre: DEBUG MARKER: == sanity test 17n: run e2fsck against master/slave MDT which contains remote dir ========================================================== 11:19:22 (1767673162)
      [  745.769018] Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds' ' /proc/mounts || true
      [  747.165319] Lustre: DEBUG MARKER: umount -d /mnt/lustre-mds
      [  747.912097] Lustre: Failing over lustre-MDT0000
      [  748.289834] Lustre: lustre-MDT0000: Not available for connect from 192.168.122.51@tcp (stopping)
      [  748.301755] ------------[ cut here ]------------
      [  748.303743] WARNING: CPU: 2 PID: 9729 at mm/page-writeback.c:2796 __folio_mark_dirty+0x208/0x260
      [  748.305181] Modules linked in: osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) rfkill(E) snd_hda_codec_generic(E) snd_hda_intel(E) snd_intel_dspcfg(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E) snd_seq(E) snd_seq_device(E) intel_rapl_msr(E) intel_rapl_common(E) snd_pcm(E) kvm_amd(E) iTCO_wdt(E) intel_pmc_bxt(E) ccp(E) iTCO_vendor_support(E) sunrpc(E) snd_timer(E) virtio_gpu(E) kvm(E) virtio_dma_buf(E) i2c_i801(E) snd(E) i2c_mux(E) pcspkr(E) drm_shmem_helper(E) i2c_smbus(E) lpc_ich(E) soundcore(E) virtio_balloon(E) drm_kms_helper(E) joydev(E) drm(E) fuse(E) ext4(E) mbcache(E) jbd2(E) sr_mod(E) cdrom(E) sg(E) crct10dif_pclmul(E) crc32_pclmul(E) ahci(E) crc32c_intel(E) libahci(E) polyval_clmulni(E) polyval_generic(E) virtio_net(E) ghash_clmulni_intel(E) libata(E) virtio_blk(E) virtio_scsi(E) virtio_console(E) net_failover(E) failover(E) serio_raw(E)
      [  748.313593] CPU: 2 UID: 0 PID: 9729 Comm: umount Kdump: loaded Tainted: G           OE      6.12.6-1.ldiskfs.gcov.el9.x86_64 #1
      [  748.314749] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
      [  748.315895] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
      [  748.317052] RIP: 0010:__folio_mark_dirty+0x208/0x260
      [  748.318207] Code: 48 83 05 02 8b 94 07 01 e9 7b fe ff ff 48 83 05 dd 8a 94 07 01 e9 01 ff ff ff 48 83 05 e0 8a 94 07 01 48 83 05 e8 8a 94 07 01 <0f> 0b 48 83 05 e6 8a 94 07 01 e9 4f fe ff ff 48 83 05 59 65 94 07
      [  748.320662] RSP: 0018:ffffc9000bc7f128 EFLAGS: 00010002
      [  748.321831] RAX: 0017ffffc0004035 RBX: 0000000000000001 RCX: fffffffface85dbb
      [  748.322973] RDX: 1ffffd4000c02758 RSI: 0000000000000008 RDI: ffffea0006013ac0
      [  748.324069] RBP: ffffea0006013ac0 R08: 0000000000000001 R09: fffff94000c02758
      [  748.325153] R10: ffffea0006013ac7 R11: fffff94000c02759 R12: ffff888161d2d2f0
      [  748.326459] R13: 0000000000000206 R14: ffff888161d2d2f8 R15: ffff888161d2d3a8
      [  748.327537] FS:  00007f3ffa699580(0000) GS:ffff888235b00000(0000) knlGS:0000000000000000
      [  748.328601] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [  748.329630] CR2: 000055ee3835d038 CR3: 000000012ec6e000 CR4: 0000000000750ef0
      [  748.330644] PKRU: 55555554
      [  748.331630] Call Trace:
      [  748.332588]  <TASK>
      [  748.333548]  ? show_regs+0x87/0xa0
      [  748.334512]  ? __folio_mark_dirty+0x208/0x260
      [  748.335433]  ? __warn+0x109/0x260
      [  748.336862]  ? __folio_mark_dirty+0x208/0x260
      [  748.338500]  ? __report_bug+0x232/0x270
      [  748.339361]  ? __folio_mark_dirty+0x208/0x260
      [  748.340195]  ? report_bug+0xa6/0x210
      [  748.341018]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.341838]  ? handle_bug+0x5b/0xa0
      [  748.342644]  ? exc_invalid_op+0x14/0x50
      [  748.343434]  ? asm_exc_invalid_op+0x16/0x20
      [  748.344233]  ? __folio_mark_dirty+0x19b/0x260
      [  748.345004]  ? __folio_mark_dirty+0x208/0x260
      [  748.345748]  ? __folio_mark_dirty+0x19b/0x260
      [  748.346480]  block_dirty_folio+0x157/0x1e0
      [  748.347184]  ldiskfs_dirty_folio+0x7f/0x130 [ldiskfs]
      [  748.347947]  folio_mark_dirty+0xbe/0x150
      [  748.348628]  ldiskfs_dirty_journalled_data+0x3a/0x80 [ldiskfs]
      [  748.349320]  __ldiskfs_block_zero_page_range+0x4fe/0x720 [ldiskfs]
      [  748.350033]  ldiskfs_block_zero_page_range+0xe7/0x1b0 [ldiskfs]
      [  748.350743]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.351380]  ? __ldiskfs_journal_start_sb+0x215/0x280 [ldiskfs]
      [  748.352087]  ldiskfs_block_truncate_page.isra.0+0xd5/0x160 [ldiskfs]
      [  748.352789]  ldiskfs_truncate+0x980/0xdc0 [ldiskfs]
      [  748.353474]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.354074]  ? __might_sleep+0xb2/0x1b0
      [  748.354677]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.355243]  ? __pfx_ldiskfs_truncate+0x10/0x10 [ldiskfs]
      [  748.355863]  ? __pfx_down_write+0x10/0x10
      [  748.356410]  osd_drop_preallocated_space+0x87/0x760 [osd_ldiskfs]
      [  748.356970]  osd_object_delete+0x656/0x970 [osd_ldiskfs]
      [  748.357519]  lu_object_free.isra.0+0x27d/0x810 [obdclass]
      [  748.358089]  ? __pfx_lu_object_free.isra.0+0x10/0x10 [obdclass]
      [  748.358665]  lu_site_purge_objects+0x89f/0xf20 [obdclass]
      [  748.359210]  ? __pfx_lu_site_purge_objects+0x10/0x10 [obdclass]
      [  748.359765]  ? __pfx___call_rcu_common.constprop.0+0x10/0x10
      [  748.360225]  ? __pfx_lu_fid_hash+0x10/0x10 [obdclass]
      [  748.360738]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.361163]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.361598]  ? do_raw_spin_unlock+0x17c/0x280
      [  748.362013]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.362434]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.362841]  mdt_stack_fini+0x1b5/0x1170 [mdt]
      [  748.363288]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.363715]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.364105]  ? kfree+0x15a/0x610
      [  748.364512]  ? __pfx_mdt_stack_fini+0x10/0x10 [mdt]
      [  748.364927]  ? mdt_fld_fini.isra.0+0x249/0x460 [mdt]
      [  748.365347]  mdt_fini+0x70f/0xbf0 [mdt]
      [  748.365789]  ? __pfx_mdt_fini+0x10/0x10 [mdt]
      [  748.366214]  mdt_device_fini+0x9f/0x200 [mdt]
      [  748.366645]  ldto_device_fini.isra.0+0xa1/0x200 [obdclass]
      [  748.367104]  obd_precleanup.isra.0+0x140/0x510 [obdclass]
      [  748.367583]  ? __pfx_obd_precleanup.isra.0+0x10/0x10 [obdclass]
      [  748.368045]  ? class_disconnect_exports+0x50e/0x970 [obdclass]
      [  748.368558]  ? __pfx_class_disconnect_exports+0x10/0x10 [obdclass]
      [  748.369028]  ? do_raw_spin_unlock+0x17c/0x280
      [  748.369450]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.369857]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.370420]  class_cleanup+0x846/0x10a0 [obdclass]
      [  748.371089]  ? stack_depot_save_flags+0x224/0x470
      [  748.371535]  ? __pfx_class_cleanup+0x10/0x10 [obdclass]
      [  748.372030]  ? do_raw_spin_unlock+0x17c/0x280
      [  748.372468]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.372895]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.373304]  ? class_name2obd+0x1e6/0x260 [obdclass]
      [  748.373786]  ? __pfx_class_name2obd+0x10/0x10 [obdclass]
      [  748.374269]  ? server_put_super+0x1319/0x19a0 [ptlrpc]
      [  748.374859]  class_process_config+0x2117/0x3490 [obdclass]
      [  748.375369]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.375780]  ? local_clock_noinstr+0x9/0xc0
      [  748.376227]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.376633]  ? __pfx_class_process_config+0x10/0x10 [obdclass]
      [  748.377103]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.377511]  ? kasan_save_alloc_info+0x43/0x70
      [  748.377918]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.378315]  ? __kasan_kmalloc+0xdc/0xf0
      [  748.378718]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.379121]  ? __asan_memcpy+0x73/0xb0
      [  748.379523]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.379923]  ? lustre_cfg_init+0x196/0x330 [obdclass]
      [  748.380384]  class_manual_cleanup+0x4af/0x1180 [obdclass]
      [  748.380866]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.381313]  ? __pfx_class_manual_cleanup+0x10/0x10 [obdclass]
      [  748.381774]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.382169]  ? class_name2obd+0x1e6/0x260 [obdclass]
      [  748.382633]  ? __pfx_class_name2obd+0x10/0x10 [obdclass]
      [  748.383093]  ? get_mapping_status+0x1896/0x1a50
      [  748.383505]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.383910]  server_put_super+0x1319/0x19a0 [ptlrpc]
      [  748.384440]  ? __pfx_server_put_super+0x10/0x10 [ptlrpc]
      [  748.384966]  ? __pfx_fsnotify_sb_delete+0x10/0x10
      [  748.385378]  generic_shutdown_super+0x182/0x3f0
      [  748.385794]  kill_anon_super+0x3e/0x80
      [  748.386209]  lustre_kill_super+0x66/0xb0 [lustre]
      [  748.386675]  deactivate_locked_super+0xc4/0x170
      [  748.387093]  deactivate_super+0xba/0xe0
      [  748.387523]  cleanup_mnt+0x207/0x350
      [  748.387946]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.388352]  __cleanup_mnt+0x16/0x30
      [  748.388766]  task_work_run+0x16a/0x2b0
      [  748.389176]  ? __pfx_task_work_run+0x10/0x10
      [  748.389580]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.389991]  ? __audit_syscall_exit+0x156/0x1c0
      [  748.390392]  ? srso_alias_return_thunk+0x5/0xfbef5
      [  748.390791]  syscall_exit_to_user_mode+0x2f5/0x330
      [  748.391183]  do_syscall_64+0x6c/0x180
      [  748.391582]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
      [  748.391981] RIP: 0033:0x7f3ffa50e56b
      [  748.392379] Code: b3 a8 0e 00 f7 d8 64 89 01 48 83 c8 ff c3 90 f3 0f 1e fa 31 f6 e9 05 00 00 00 0f 1f 44 00 00 f3 0f 1e fa b8 a6 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 05 c3 0f 1f 40 00 48 8b 15 79 a8 0e 00 f7 d8
      [  748.393269] RSP: 002b:00007fffb62bc248 EFLAGS: 00000246 ORIG_RAX: 00000000000000a6
      [  748.393753] RAX: 0000000000000000 RBX: 0000565194d972f0 RCX: 00007f3ffa50e56b
      [  748.394212] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000565194d9aa10
      [  748.394705] RBP: 0000565194d970c0 R08: 0000000000000000 R09: 0000000000000000
      [  748.395168] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
      [  748.395672] R13: 0000565194d9aa10 R14: 0000565194d971d0 R15: 0000565194d970c0
      [  748.396148]  </TASK>
      [  748.396630] ---[ end trace 0000000000000000 ]---
      [  748.397815] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 57581). There's a risk of filesystem corruption in case of system crash.
      [  748.400665] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 57567). There's a risk of filesystem corruption in case of system crash.
      [  748.407730] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 57569). There's a risk of filesystem corruption in case of system crash.
      [  748.410505] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 58455). There's a risk of filesystem corruption in case of system crash.
      [  748.413746] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 57577). There's a risk of filesystem corruption in case of system crash.
      [  748.417467] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 57573). There's a risk of filesystem corruption in case of system crash.
      [  748.419489] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 57564). There's a risk of filesystem corruption in case of system crash.
      [  748.443354] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 58058). There's a risk of filesystem corruption in case of system crash.
      [  748.451488] JBD2: Spotted dirty metadata buffer (dev = vdb, blocknr = 57587). There's a risk of filesystem corruption in case of system crash.
      [  748.457673] LustreError: 9729:0:(obd_class.h:478:obd_check_dev()) Device 12 not setup
      [  748.645052] LDISKFS-fs (vdb): unmounting filesystem 2b270575-8955-4605-9b2e-77ec2806c940.
      

      https://elixir.bootlin.com/linux/v6.12.63/source/mm/page-writeback.c#L2789

      void __folio_mark_dirty(struct folio *folio, struct address_space *mapping,
      			     int warn)
      {
      	unsigned long flags;
      
      	xa_lock_irqsave(&mapping->i_pages, flags);
      	if (folio->mapping) {	/* Race with truncate? */
      		WARN_ON_ONCE(warn && !folio_test_uptodate(folio));
      		folio_account_dirtied(folio, mapping);
      		__xa_set_mark(&mapping->i_pages, folio_index(folio),
      				PAGECACHE_TAG_DIRTY);
      	}
      	xa_unlock_irqrestore(&mapping->i_pages, flags);
      }
      

      Attachments

        Activity

          People

            wc-triage WC Triage
            stancheff Shaun Tancheff
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: