Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13508

crash in sanity test 160j

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.14.0
    • Lustre 2.14.0
    • None
    • 3
    • 9223372036854775807

    Description

      This seems to only trigger when running in "one unmount/mount per subtest" mode like this:

      FLAKEY=false REFORMAT=yes ONLY="160-999" MDSCOUNT=2 SETUP=setup CLEANUP=cleanup sh sanity.sh
      

      The crash is 100% reproducible on a single node testing and looks like this:

      [421223.630585] Lustre: server umount lustre-OST0001 complete
      [421223.879781] general protection fault: 0000 [#1] SMP DEBUG_PAGEALLOC
      [421223.881119] Modules linked in: ofd(OE) osp(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) lov(OE) mdc(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) pcc_cpufreq loop zfs(PO) zunicode(PO) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common virtio_balloon pcspkr virtio_console i2c_piix4 binfmt_misc ip_tables rpcsec_gss_krb5 ata_generic pata_acpi drm_kms_helper ttm drm ata_piix drm_panel_orientation_quirks serio_raw libata virtio_blk i2c_core floppy [last unloaded: lustre]
      [421223.890017] CPU: 1 PID: 18476 Comm: sh Kdump: loaded Tainted: P           OE  ------------   3.10.0-7.7-debug #1
      [421223.890922] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
      [421223.891370] task: ffff8803b185a340 ti: ffff8800ca418000 task.ti: ffff8800ca418000
      [421223.892207] RIP: 0010:[<ffffffff8110a016>]  [<ffffffff8110a016>] module_put+0x26/0xc0
      [421223.893060] RSP: 0018:ffff8800ca41bda8  EFLAGS: 00010202
      [421223.893504] RAX: ffff8800ca41bfd8 RBX: 6b6b6b6b6b6b6b6b RCX: ffff8800ca41bce0
      [421223.894355] RDX: 000000000000000b RSI: 800000038530b060 RDI: 6b6b6b6b6b6b6b6b
      [421223.895185] RBP: ffff8800ca41bdc0 R08: 0000000000000001 R09: 0000000000000067
      [421223.896016] R10: ffff880000000148 R11: 0000000000000000 R12: 6b6b6b6b6b6b6b6b
      [421223.896871] R13: ffff8803dd8d6d30 R14: ffff8803ca1ff168 R15: ffff8803d43ece60
      [421223.897696] FS:  00007ffa9ed7f740(0000) GS:ffff8803f4640000(0000) knlGS:0000000000000000
      [421223.898540] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [421223.898991] CR2: 00007f119516c8d0 CR3: 0000000001c10000 CR4: 00000000000006e0
      [421223.899845] Call Trace:
      [421223.900249]  [<ffffffff812448e0>] cdev_put+0x20/0x30
      [421223.900684]  [<ffffffff812415c6>] __fput+0x266/0x300
      [421223.901129]  [<ffffffff8124173e>] ____fput+0xe/0x10
      [421223.901565]  [<ffffffff810b4af5>] task_work_run+0xb5/0xf0
      [421223.902010]  [<ffffffff81092395>] do_exit+0x2f5/0xb30
      [421223.902450]  [<ffffffff817e0ee1>] ? system_call_after_swapgs+0xae/0x146
      [421223.902941]  [<ffffffff81092c4f>] do_group_exit+0x3f/0xa0
      [421223.904158]  [<ffffffff81092cc4>] SyS_exit_group+0x14/0x20
      [421223.904605]  [<ffffffff817e0f98>] system_call_fastpath+0x1f/0x24
      [421223.905071]  [<ffffffff817e0ee1>] ? system_call_after_swapgs+0xae/0x146
      [421223.905550] Code: 00 00 00 00 00 0f 1f 44 00 00 55 48 85 ff 48 89 e5 41 55 41 54 49 89 fc 53 74 3c 65 48 8b 04 25 f8 0e 01 00 83 80 44 c0 ff ff 01 <48> 8b 87 30 02 00 00 65 48 ff 40 08 4c 8b 6d 08 0f 1f 44 00 00 
      [421223.907311] RIP  [<ffffffff8110a016>] module_put+0x26/0xc0
      [421223.907761]  RSP <ffff8800ca41bda8>
      

      I thin kit must be due to some recent module changes for upstreaming/cleanup reasons?

      Attachments

        Issue Links

          Activity

            People

              hongchao.zhang Hongchao Zhang
              green Oleg Drokin
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: