Details
-
Bug
-
Resolution: Unresolved
-
Blocker
-
None
-
Lustre 2.8.0
-
None
-
Hypeiron/SWL 2.7.61 tag lustre-reviews build 35536
-
3
-
9223372036854775807
Description
Running SWL , hard crash of MDS
Nov 5 23:08:09 iws10 kernel: ------------[ cut here ]------------
Nov 5 23:08:09 iws10 kernel: kernel BUG at mm/slab.c:3069!
Nov 5 23:08:09 iws10 kernel: invalid opcode: 0000 [#1] SMP
Nov 5 23:08:09 iws10 kernel: last sysfs file: /sys/devices/pci0000:00/0000:00:02.0/0000:02:00.0/host7/scsi_host/host7/local_ib_port
Nov 5 23:08:09 iws10 kernel: CPU 3
Nov 5 23:08:09 iws10 kernel: Modules linked in: osp(U) mdd(U) lod(U) mdt(U) lfsck(U) mgs(U) mgc(U) osd_zfs(U) lquota(U) lustre(U) lov(U) mdc(U) fid(U) lmv(U) fld(U) ptlrpc(U) obdclass(U) zfs(P)(U) dm_round_robin zcommon(P)(U) znvpair(P)(U) spl(U) zlib_deflate zavl(P)(U) zunicode(P)(U) scsi_dh_rdac sg sd_mod crc_t10dif ko2iblnd(U) lnet(U) sha512_generic crc32c_intel libcfs(U) ib_srp scsi_transport_srp scsi_tgt ipmi_devintf ipmi_si ipmi_msghandler cpufreq_ondemand acpi_cpufreq freq_table mperf ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm mlx4_ib ib_sa ib_mad ib_core ib_addr dm_mirror dm_region_hash dm_log dm_multipath dm_mod vhost_net macvtap macvlan tun kvm uinput microcode iTCO_wdt iTCO_vendor_support sb_edac edac_core joydev ahci isci libsas scsi_transport_sas wmi lpc_ich mfd_core i2c_i801 ioatdma ipv6 nfs lockd fscache auth_rpcgss nfs_acl sunrpc igb dca i2c_algo_bit i2c_core mlx4_en ptp pps_core mlx4_core [last unloaded: scsi_wait_scan]
Nov 5 23:08:09 iws10 kernel:
Nov 5 23:08:09 iws10 kernel: Pid: 25574, comm: mdt00_029 Tainted: P -- ------------ 2.6.32-573.7.1.el6_lustre.g36c9bfe.x86_64 #1 appro 512x/S2600JF
Nov 5 23:08:09 iws10 kernel: RIP: 0010:[<ffffffff81177b24>] [<ffffffff81177b24>] cache_alloc_refill+0x1e4/0x240
Nov 5 23:08:09 iws10 kernel: RSP: 0018:ffff880ff5e67840 EFLAGS: 00010082
Nov 5 23:08:09 iws10 kernel: RAX: 0000000000000004 RBX: ffff88083fd705c0 RCX: 00000000ffffffff
Nov 5 23:08:09 iws10 kernel: RDX: 000000000000623c RSI: 0000000000000000 RDI: ffff88083fc21b00
Nov 5 23:08:09 iws10 kernel: RBP: ffff880ff5e678a0 R08: 0000000000000246 R09: 00000000fffffffe
Nov 5 23:08:09 iws10 kernel: R10: 0000000000000001 R11: 0000000000000000 R12: ffff88083424ad40
Nov 5 23:08:09 iws10 kernel: R13: ffff88083fc21ac0 R14: 0000000000000004 R15: ffff88068392d540
Nov 5 23:08:09 iws10 kernel: FS: 0000000000000000(0000) GS:ffff880048660000(0000) knlGS:0000000000000000
Nov 5 23:08:09 iws10 kernel: CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Nov 5 23:08:09 iws10 kernel: CR2: 000000000065e7f8 CR3: 0000000001a8d000 CR4: 00000000000407e0
Nov 5 23:08:09 iws10 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Nov 5 23:08:09 iws10 kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Nov 5 23:08:09 iws10 kernel: Process mdt00_029 (pid: 25574, threadinfo ffff880ff5e64000, task ffff880ff43c2ab0)
Nov 5 23:08:09 iws10 kernel: Stack:
Nov 5 23:08:09 iws10 kernel: ffff880f00000010 00000000f5e678a0 ffff88083fc21b00 00049250fffffffe
Nov 5 23:08:09 iws10 kernel: <d> ffff88083fc21ae0 ffff88083fc21ad0 ffff8807fefd49c0 0000000000008000
Nov 5 23:08:09 iws10 kernel: <d> 0000000000000010 0000000000008250 ffff88083fd705c0 ffffffffa093dad2
Nov 5 23:08:09 iws10 kernel: Call Trace:
Nov 5 23:08:09 iws10 kernel: [<ffffffffa093dad2>] ? llog_init_handle+0x72/0xb10 [obdclass]
Nov 5 23:08:09 iws10 kernel: [<ffffffff81178889>] __kmalloc+0x1b9/0x230
Nov 5 23:08:09 iws10 kernel: [<ffffffffa093dad2>] llog_init_handle+0x72/0xb10 [obdclass]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa0944531>] llog_cat_new_log+0x3e1/0xe50 [obdclass]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa09453c6>] llog_cat_declare_add_rec+0x426/0x430 [obdclass]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa093c06f>] llog_declare_add+0x7f/0x1b0 [obdclass]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa0c3614c>] top_trans_start+0x17c/0x9b0 [ptlrpc]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa12cf7a1>] lod_trans_start+0x61/0x70 [lod]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa1377b64>] mdd_trans_start+0x14/0x20 [mdd]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa135e0df>] mdd_unlink+0x65f/0xee0 [mdd]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa04b3b61>] ? libcfs_debug_msg+0x41/0x50 [libcfs]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa12168e8>] mdo_unlink+0x18/0x50 [mdt]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa121f90c>] mdt_reint_unlink+0xb2c/0xff0 [mdt]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa121697d>] mdt_reint_rec+0x5d/0x200 [mdt]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa120277b>] mdt_reint_internal+0x62b/0xb80 [mdt]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa120316b>] mdt_reint+0x6b/0x120 [mdt]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa0c200ec>] tgt_request_handle+0x8bc/0x12e0 [ptlrpc]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa0bc79e1>] ptlrpc_main+0xe41/0x1910 [ptlrpc]
Nov 5 23:08:09 iws10 kernel: [<ffffffffa0bc6ba0>] ? ptlrpc_main+0x0/0x1910 [ptlrpc]
Nov 5 23:08:09 iws10 kernel: [<ffffffff810a0fce>] kthread+0x9e/0xc0
Nov 5 23:08:09 iws10 kernel: [<ffffffff8100c28a>] child_rip+0xa/0x20
Nov 5 23:08:09 iws10 kernel: [<ffffffff810a0f30>] ? kthread+0x0/0xc0
Nov 5 23:08:09 iws10 kernel: [<ffffffff8100c280>] ? child_rip+0x0/0x20
Nov 5 23:08:09 iws10 kernel: Code: 89 ff e8 f0 c7 12 00 eb 99 66 0f 1f 44 00 00 41 c7 45 60 01 00 00 00 4d 8b 7d 20 4c 39 7d c0 0f 85 f2 fe ff ff eb 84 0f 0b eb fe <0f> 0b 66 2e 0f 1f 84 00 00 00 00 00 eb f4 8b 55 ac 8b 75 bc 31
Nov 5 23:08:09 iws10 kernel: RIP [<ffffffff81177b24>] cache_alloc_refill+0x1e4/0x240
Nov 5 23:08:09 iws10 kernel: RSP <ffff880ff5e67840>
Nov 5 23:08:09 iws10 kernel: ---[ end trace 5e2fb88ba6d38398 ]---