Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11622

BUG: scheduling while atomic: lfsck

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • Lustre 2.10.5
    • lustre 2.10.5_2.chaos (OST)
      kernel 3.10.0-862.14.4.1chaos.ch6.x86_64
    • 3
    • 9223372036854775807

    Description

      Console log reported the following on all or most OSTs when lfsck was run:

       BUG: scheduling while atomic: lfsck/152563/0x00000002
      Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ptlrpc(OE) obdclass(OE) ko2iblnd(OE) lnet(OE) libcfs(OE) sb_edac intel_powerclamp coretemp intel_rapl iosf_mbi rpcrdma kvm ib_iser iTCO_wdt irqbypass iTCO_vendor_support sg lpc_ich i2c_i801 joydev dm_round_robin pcspkr ipmi_si ioatdma acpi_cpufreq shpchp sch_fq_codel ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm zfs(POE) iw_cxgb4 iw_cxgb3 zunicode(POE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) binfmt_misc msr_safe(OE) ip_tables nfsv3 nfs_acl rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache overlay(T) ext4 mbcache jbd2 dm_service_time sd_mod crc_t10dif crct10dif_generic mlx4_ib mlx4_en ib_core be2iscsi bnx2i cnic uio cxgb4i cxgb4 8021q garp mrp stp llc cxgb3i cxgb3 mdio libcxgbi libcxgb mgag200 qla4xxx drm_kms_helper crct10dif_pclmul syscopyarea crct10dif_common sysfillrect crc32_pclmul iscsi_boot_sysfs igb crc32c_intel sysimgblt fb_sys_fops ghash_clmulni_intel isci ahci ttm aesni_intel libsas dca lrw libahci scsi_transport_sas gf128mul glue_helper dm_multipath mlx4_core ablk_helper drm ptp cryptd libata pps_core i2c_algo_bit i2c_core devlink wmi ipmi_devintf ipmi_msghandler sunrpc dm_mirror dm_region_hash dm_log dm_mod iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
      CPU: 9 PID: 152563 Comm: lfsck Kdump: loaded Tainted: P           OE  ------------ T 3.10.0-862.14.4.1chaos.ch6.x86_64 #1
      Hardware name: CRAY CRAY-GB512X-CN/S2600JF, BIOS SE5C600.86B.02.03.0003.041920141333 04/19/2014
      Call Trace:
       [<ffffffff9f334f01>] dump_stack+0x19/0x1b
       [<ffffffff9f32f5af>] __schedule_bug+0x64/0x72
       [<ffffffff9f33a88c>] __schedule+0x8ec/0x910
       [<ffffffffc0b22e12>] ? zio_execute+0xa2/0x100 [zfs]
       [<ffffffff9f33a8d9>] schedule+0x29/0x70
       [<ffffffff9f338159>] schedule_timeout+0x289/0x310
       [<ffffffffc0b24553>] ? zio_vdev_io_start+0x243/0x370 [zfs]
       [<ffffffffc07bb428>] ? taskq_member+0x18/0x30 [spl]
       [<ffffffff9ecffc92>] ? ktime_get_ts64+0x52/0xf0
       [<ffffffff9f339dad>] io_schedule_timeout+0xad/0x130
       [<ffffffff9ecc20a6>] ? prepare_to_wait_exclusive+0x56/0x90
       [<ffffffff9f339e48>] io_schedule+0x18/0x20
       [<ffffffffc07c06e2>] cv_wait_common+0xc2/0x160 [spl]
       [<ffffffff9ecc24f0>] ? wake_up_atomic_t+0x30/0x30
       [<ffffffffc07c07b8>] __cv_wait_io+0x18/0x20 [spl]
       [<ffffffffc0b267f3>] zio_wait+0x113/0x1d0 [zfs]
       [<ffffffffc0a639fe>] dbuf_read+0x69e/0xa30 [zfs]
       [<ffffffffc0a65e2a>] __dbuf_hold_impl+0x33a/0x5f0 [zfs]
       [<ffffffffc0a66182>] dbuf_hold_impl+0xa2/0xd0 [zfs]
       [<ffffffffc0a661e5>] dbuf_hold_level+0x35/0x60 [zfs]
       [<ffffffffc0a672e6>] dbuf_hold+0x16/0x20 [zfs]
       [<ffffffffc0a6f8dc>] dmu_buf_hold_noread_by_dnode+0x3c/0xc0 [zfs]
       [<ffffffffc0a6fb1f>] dmu_buf_hold_by_dnode+0x2f/0x80 [zfs]
       [<ffffffffc0aea4a9>] zap_lockdir_by_dnode.constprop.12+0x49/0xc0 [zfs]
       [<ffffffffc0aec8b7>] zap_lookup_norm_by_dnode+0x57/0xc0 [zfs]
       [<ffffffffc158c555>] ? osd_get_name_n_idx+0xb5/0xd00 [osd_zfs]
       [<ffffffffc0aec94e>] zap_lookup_by_dnode+0x2e/0x30 [zfs]
       [<ffffffffc158d27c>] osd_fid_lookup+0xdc/0x3a0 [osd_zfs]
       [<ffffffffc1586c05>] osd_object_init+0xf5/0x850 [osd_zfs]
       [<ffffffffc117391d>] ? lu_object_add+0x2d/0x40 [obdclass]
       [<ffffffffc11764f5>] lu_object_alloc+0xe5/0x320 [obdclass]
       [<ffffffffc1176910>] lu_object_find_at+0x180/0x2b0 [obdclass]
       [<ffffffffc1177a98>] dt_locate_at+0x18/0xb0 [obdclass]
       [<ffffffffc15fadb2>] lfsck_layout_slave_prep+0x392/0x5b0 [lfsck]
       [<ffffffffc15d1fe6>] lfsck_master_engine+0x196/0x1450 [lfsck]
       [<ffffffffc15d1e50>] ? lfsck_master_oit_engine+0x11a0/0x11a0 [lfsck]
       [<ffffffff9ecc12d1>] kthread+0xd1/0xe0
       [<ffffffff9ecc1200>] ? insert_kthread_work+0x40/0x40
       [<ffffffff9f347837>] ret_from_fork_nospec_begin+0x21/0x21
       [<ffffffff9ecc1200>] ? insert_kthread_work+0x40/0x40

      Attachments

        Activity

          People

            wc-triage WC Triage
            ofaaland Olaf Faaland
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: