Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-9360

lustre-initialization-1 failed, MDS crash

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Duplicate
    • Affects Version/s: Lustre 2.10.0
    • Fix Version/s: None
    • Labels:
      None
    • Environment:
      master build #3558, el6.8 server/client
    • Severity:
      3
    • Rank (Obsolete):
      9223372036854775807

      Description

      ldiskfs: https://testing.hpdd.intel.com/test_sessions/3158fa00-d534-4820-84a0-ab9e2f179306
      zfs: https://testing.hpdd.intel.com/test_sessions/be2a47a0-e2ef-485e-952e-d11206fe309c

      The latest master build #3558, el6.8 server/client, MDS crash during the initialization for ldiskfs and zfs

      MDS console

      01:48:26:Lustre: DEBUG MARKER: /usr/sbin/lctl conf_param lustre.sys.jobid_var='procname_uid'
      01:48:26:Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n osd-ldiskfs.lustre-MDT0000.quota_slave.enabled
      01:48:26:Lustre: DEBUG MARKER: /usr/sbin/lctl conf_param lustre.quota.mdt=ug3
      01:48:26:Lustre: DEBUG MARKER: /usr/sbin/lctl conf_param lustre.quota.ost=ug3
      01:48:26:BUG: unable to handle kernel NULL pointer dereference at (null)
      01:48:26:IP: [<ffffffffa041fca9>] cfs_hash_lookup+0x29/0xa0 [libcfs]
      01:48:26:PGD 0 
      01:48:26:Oops: 0000 [#1] SMP 
      01:48:26:last sysfs file: /sys/devices/system/cpu/online
      01:48:26:CPU 0 
      01:48:26:Modules linked in: osp(U) mdd(U) lod(U) mdt(U) lfsck(U) mgs(U) mgc(U) osd_ldiskfs(U) lquota(U) lustre(U) lov(U) mdc(U) fid(U) lmv(U) fld(U) ksocklnd(U) ptlrpc(U) obdclass(U) lnet(U) ldiskfs(U) jbd2 crc32c_intel libcfs(U) nfsd exportfs autofs4 nfs lockd fscache auth_rpcgss nfs_acl sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr ipv6 microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: speedstep_lib]
      01:48:26:
      01:48:26:Pid: 4252, comm: mdt00_002 Not tainted 2.6.32-642.13.1.el6_lustre.x86_64 #1 Red Hat KVM
      01:48:26:RIP: 0010:[<ffffffffa041fca9>]  [<ffffffffa041fca9>] cfs_hash_lookup+0x29/0xa0 [libcfs]
      01:48:26:RSP: 0018:ffff880059fa7b00  EFLAGS: 00010246
      01:48:26:RAX: 0000000000000000 RBX: ffff880041ffe490 RCX: 0000000000000000
      01:48:26:RDX: ffff880066263e20 RSI: 0000000000000000 RDI: ffff880041ffe490
      01:48:26:RBP: ffff880059fa7b40 R08: 0000000000000003 R09: ffff880066263e20
      01:48:26:R10: ffff880045058000 R11: 0000000000000400 R12: ffff880059fa7b00
      01:48:26:R13: ffff880066263e20 R14: ffff880066263e20 R15: ffff880066270bc0
      01:48:26:FS:  0000000000000000(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
      01:48:26:CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
      01:48:26:CR2: 0000000000000000 CR3: 0000000037d9f000 CR4: 00000000000406f0
      01:48:26:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      01:48:26:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      01:48:26:Process mdt00_002 (pid: 4252, threadinfo ffff880059fa4000, task ffff880059fc6ab0)
      01:48:26:Stack:
      01:48:26: ffff880065eb1b40 0000000a00000000 0000000000000000 0000000000000000
      01:48:26:<d> ffff880059fa7b60 ffff880065f3cb00 ffff880066263e20 ffff880067fdfdc0
      01:48:26:<d> ffff880059fa7b80 ffffffffa0c8dc18 0000000000000150 ffff880065f3cb00
      01:48:26:Call Trace:
      01:48:26: [<ffffffffa0c8dc18>] lqe_locate+0x48/0x7b0 [lquota]
      01:48:26: [<ffffffffa0caf07b>] qmt_pool_lqe_lookup+0x1ab/0x260 [lquota]
      01:48:26: [<ffffffffa0ca5747>] qmt_set.clone.0+0x67/0x700 [lquota]
      01:48:26: [<ffffffffa080e06b>] ? lustre_pack_reply_v2+0x1eb/0x280 [ptlrpc]
      01:48:26: [<ffffffffa080ce15>] ? lustre_msg_buf+0x55/0x60 [ptlrpc]
      01:48:26: [<ffffffffa0834732>] ? __req_capsule_get+0x162/0x6e0 [ptlrpc]
      01:48:26: [<ffffffffa0ca6270>] qmt_quotactl+0x490/0x5b0 [lquota]
      01:48:26: [<ffffffffa0eeb9b1>] mdt_quotactl+0x611/0x780 [mdt]
      01:48:26: [<ffffffffa0874a3c>] tgt_request_handle+0x8ec/0x1440 [ptlrpc]
      01:48:26: [<ffffffffa081d83b>] ptlrpc_server_handle_request+0x2eb/0xbd0 [ptlrpc]
      01:48:26: [<ffffffffa0818639>] ? ptlrpc_wait_event+0xa9/0x2e0 [ptlrpc]
      01:48:26: [<ffffffffa081ebe1>] ptlrpc_main+0xac1/0x18d0 [ptlrpc]
      01:48:26: [<ffffffffa081e120>] ? ptlrpc_main+0x0/0x18d0 [ptlrpc]
      01:48:26: [<ffffffff810a640e>] kthread+0x9e/0xc0
      01:48:26: [<ffffffff8100c28a>] child_rip+0xa/0x20
      01:48:26: [<ffffffff810a6370>] ? kthread+0x0/0xc0
      01:48:26: [<ffffffff8100c280>] ? child_rip+0x0/0x20
      01:48:26:Code: 00 00 55 48 89 e5 48 83 ec 40 48 89 5d e8 4c 89 65 f0 4c 89 6d f8 0f 1f 44 00 00 48 8b 47 10 4c 8d 65 c0 48 89 fb 49 89 f5 31 f6 <ff> 10 4c 89 ee 4c 89 e2 48 89 df e8 77 e8 ff ff 31 d2 4c 89 e6 
      01:48:26:RIP  [<ffffffffa041fca9>] cfs_hash_lookup+0x29/0xa0 [libcfs]
      01:48:26: RSP <ffff880059fa7b00>
      01:48:26:CR2: 0000000000000000
      01:48:26:Initializing cgroup subsys cpuset
      01:48:26:Initializing cgroup subsys cpu
      01:48:26:Linux version 2.6.32-642.13.1.el6_lustre.x86_64 (jenkins@trevis-306-el6-x8664-1.trevis.hpdd.intel.com) (gcc version 4.4.7 20120313 (Red Hat 4.4.7-17) (GCC) ) #1 SMP Wed Apr 5 06:19:32 UTC 2017
      01:48:26:Command line: ro root=UUID=48d1ab93-ef14-4cf3-888d-bcd55104d5e8 rd_NO_LUKS rd_NO_LVM LANG=en_US.UTF-8 rd_NO_MD console=tty0 SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM console=ttyS0,115200 irqpoll nr_cpus=1 reset_devices cgroup_disable=memory mce=off acpi_no_memhotplug disable_cpu_apicid=0 memmap=exactmap memmap=627K@4K memmap=131449K@49779K elfcorehdr=181228K memmap=4K$0K memmap=9K$631K memmap=64K$960K memmap=12K$2097140K memmap=272K$4194032K
      01:48:26:KERNEL supported cpus:
      01:48:26:  Intel GenuineIntel
      

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              wc-triage WC Triage
              Reporter:
              sarah Sarah Liu
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved: