Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-2844

NULL pointer deref on unmount

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.4.0
    • Lustre 2.4.0
    • 2.6.32-279.9.1.1chaos.ch5.1.x86_64
    • 3
    • 6884

    Description

      After reproducing LU-2843, mounting and unmounting the MDT leads to a NULL pointer dereference.

      padlock: VIA PadLock Hash Engine not detected.
      Lustre: Lustre: Build Version: 2.3.58-g6125dec-CHANGED-2.6.32-279.9.1.1chaos.ch5.1.x86_64
      LNet: Added LNI 192.168.122.43@tcp [8/256/0/180]
      LNet: Accept secure, port 988
      Lustre: Echo OBD driver; http://www.lustre.org/
      LDISKFS-fs (sda): recovery complete
      LDISKFS-fs (sda): mounted filesystem with ordered data mode. quota=on. Opts: 
      Lustre: lustre-MDT0000: used disk, loading
      Lustre: 29186:0:(mdt_lproc.c:380:lprocfs_wr_identity_upcall()) lustre-MDT0000: identity upcall set to /home/bass6/lustre-release/lustre/tests/../utils/l_getidentity
      LustreError: 11-0: lustre-MDT0000-lwp-MDT0000: Communicating with 0@lo, operation mds_connect failed with -11.
      LustreError: 28964:0:(llog_osd.c:712:llog_osd_prev_block()) lustre-MDT0000-osd: invalid llog tail at log id 35/0 offset 5033984
      LustreError: 28964:0:(mdd_device.c:323:mdd_changelog_llog_init()) lustre-MDD0000: changelog init failed: rc = -22
      LustreError: 28964:0:(mdd_device.c:398:mdd_changelog_init()) lustre-MDD0000: changelog setup during init failed: rc = -22
      LustreError: 11-0: lustre-MDT0000-lwp-MDT0000: Communicating with 0@lo, operation mds_connect failed with -11.
      Lustre: 28668:0:(client.c:1866:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1361403145/real 1361403145]  req@ffff880015c35000 x1427534649491496/t0(0) o8->lustre-OST0001-osc-MDT0000@0@lo:28/4 lens 400/544 e 0 to 1 dl 1361403150 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
      LustreError: 11-0: lustre-MDT0000-lwp-MDT0000: Communicating with 0@lo, operation mds_connect failed with -11.
      Lustre: Failing over lustre-MDT0000
      BUG: unable to handle kernel NULL pointer dereference at (null)
      IP: [<ffffffff81283626>] __list_add+0x26/0xa0
      PGD 0 
      Oops: 0000 [#1] SMP 
      last sysfs file: /sys/devices/pci0000:00/0000:00:01.1/host0/target0:0:0/0:0:0:0/block/sda/queue/max_sectors_kb
      CPU 1 
      Modules linked in: lustre(U) ofd(U) osp(U) lod(U) ost(U) mdt(U) osd_ldiskfs(U) fsfilt_ldiskfs(U) ldiskfs(U) exportfs mdd(U) mgs(U) lquota(U) jbd obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) sha512_generic sha256_generic libcfs(U) ebtable_nat ebtables fuse autofs4 sunrpc ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 zfs(P)(U) zcommon(P)(U) znvpair(P)(U) zavl(P)(U) zunicode(P)(U) spl(U) zlib_deflate vhost_net macvtap macvlan tun kvm virtio_balloon virtio_net snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc i2c_piix4 i2c_core sg ext4 mbcache jbd2 virtio_blk sd_mod crc_t10dif virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: speedstep_lib]
      
      Pid: 29211, comm: umount Tainted: P           ---------------    2.6.32-279.9.1.1chaos.ch5.1.x86_64 #1 Bochs Bochs
      
      crash> bt
      PID: 29211  TASK: ffff88007b6f0040  CPU: 1   COMMAND: "umount"
       #0 [ffff8800182bd5c0] machine_kexec at ffffffff8103283b
       #1 [ffff8800182bd620] crash_kexec at ffffffff810ba492
       #2 [ffff8800182bd6f0] oops_end at ffffffff81501e60
       #3 [ffff8800182bd720] no_context at ffffffff81043bfb
       #4 [ffff8800182bd770] __bad_area_nosemaphore at ffffffff81043e85
       #5 [ffff8800182bd7c0] bad_area at ffffffff81043fae
       #6 [ffff8800182bd7f0] __do_page_fault at ffffffff81044760
       #7 [ffff8800182bd910] do_page_fault at ffffffff81503e3e
       #8 [ffff8800182bd940] page_fault at ffffffff815011f5
          [exception RIP: __list_add+38]
          RIP: ffffffff81283626  RSP: ffff8800182bd9f8  RFLAGS: 00010246
          RAX: 0000000000000000  RBX: ffff8800182bda38  RCX: 00000000ffffffff
          RDX: ffff8800182ee150  RSI: 0000000000000000  RDI: ffff8800182bda38
          RBP: ffff8800182bda18   R8: 0000000000000000   R9: ffff88001636ff80
          R10: 000000000000000f  R11: 0000000000000000  R12: ffff8800182ee150
          R13: 0000000000000000  R14: ffff8800182ee150  R15: ffffffffffffffff
          ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
       #9 [ffff8800182bda20] __mutex_lock_slowpath at ffffffff814ff9ef
      #10 [ffff8800182bda90] mutex_lock at ffffffff814ff8fb
      #11 [ffff8800182bdab0] mdd_lfsck_stop at ffffffffa0c86eec [mdd]
      #12 [ffff8800182bdb30] mdd_iocontrol at ffffffffa0c827bc [mdd]
      #13 [ffff8800182bdb90] mdt_device_fini at ffffffffa0dca7fa [mdt]
      #14 [ffff8800182bdbf0] class_cleanup at ffffffffa079bcb7 [obdclass]
      #15 [ffff8800182bdc70] class_process_config at ffffffffa079d59c [obdclass]
      #16 [ffff8800182bdd00] class_manual_cleanup at ffffffffa079e2d9 [obdclass]
      #17 [ffff8800182bddc0] server_put_super at ffffffffa07aae7c [obdclass]
      #18 [ffff8800182bde30] generic_shutdown_super at ffffffff8117d10b
      #19 [ffff8800182bde50] kill_anon_super at ffffffff8117d1f6
      #20 [ffff8800182bde70] lustre_kill_super at ffffffffa07a0136 [obdclass]
      #21 [ffff8800182bde90] deactivate_super at ffffffff8117e270
      #22 [ffff8800182bdeb0] mntput_no_expire at ffffffff8119a2ef
      #23 [ffff8800182bdee0] sys_umount at ffffffff8119ad8b
      #24 [ffff8800182bdf80] system_call_fastpath at ffffffff8100b0f2
          RIP: 00007f405efc7d67  RSP: 00007fff44ccb308  RFLAGS: 00010202
          RAX: 00000000000000a6  RBX: ffffffff8100b0f2  RCX: 000000000000ffc0
          RDX: 0000000000000000  RSI: 0000000000000000  RDI: 00007f405fe84ad0
          RBP: 00007f405fe84ab0   R8: 0000000000000006   R9: 0000000000000000
          R10: 00007fff44ccb130  R11: 0000000000000246  R12: 0000000000000000
          R13: 0000000000000000  R14: 0000000000000000  R15: 00007f405fe84b30
          ORIG_RAX: 00000000000000a6  CS: 0033  SS: 002b
      

      Attachments

        Activity

          People

            laisiyao Lai Siyao
            nedbass Ned Bass (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: