[LU-1529] kernel BUG at mm/slab.c:28333 Created: 15/Jun/12  Updated: 18/Jun/12  Resolved: 15/Jun/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.3.0
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Cliff White (Inactive) Assignee: Zhenyu Xu
Resolution: Duplicate Votes: 0
Labels: None
Environment:

Hyperion - RHEL6 servers and clients


Issue Links:
Duplicate
duplicates LU-1513 Test failure on test lustre-initializ... Closed
Severity: 3
Rank (Obsolete): 6381

 Description   

When attempting to mount, system crashes hard, very repeatable:

2012-06-15 07:36:39 ------------[ cut here ]------------
2012-06-15 07:36:39 kernel BUG at mm/slab.c:2833!
2012-06-15 07:36:39 invalid opcode: 0000 [#1] SMP
2012-06-15 07:36:39 last sysfs file: /sys/module/lquota/initstate
2012-06-15 07:36:39 CPU 13
2012-06-15 07:36:39 Modules linked in: obdfilter(U) fsfilt_ldiskfs(U) exportfs ost(U) mgc(U) ldiskfs(U) mbcache jbd2 lustre(U) lquota(U) lov(U) osc(U) mdc(U) fid(U) fld(U) ko2iblnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) ib_srp scsi_transport_srp cpufreq_ondemand acpi_cpufreq freq_table mperf ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ib_sa mlx4_ib ib_mad ib_core dm_mirror dm_region_hash dm_log dm_mod vhost_net macvtap macvlan tun kvm_intel kvm sg sd_mod crc_t10dif serio_raw i2c_i801 i2c_core ata_generic pata_acpi ata_piix iTCO_wdt iTCO_vendor_support ioatdma i7core_edac edac_core qla2xxx scsi_transport_fc scsi_tgt ipv6 nfs lockd fscache nfs_acl auth_rpcgss sunrpc igb dca mlx4_en mlx4_core [last unloaded: scsi_wait_scan]
2012-06-15 07:36:39
2012-06-15 07:36:39 Pid: 12562, comm: llog_process_th Not tainted 2.6.32-220.17.1.el6_lustre.x86_64 #1 Supermicro X8DTG-D/X8DTG-D
2012-06-15 07:36:39 RIP: 0010:[<ffffffff8115e7b3>]  [<ffffffff8115e7b3>] cache_grow+0x313/0x320
2012-06-15 07:36:39 RSP: 0018:ffff8802ed00b990  EFLAGS: 00010002
2012-06-15 07:36:39 RAX: ffff88063fc00800 RBX: ffff88033fd10440 RCX: 0000000000000000
2012-06-15 07:36:39 RDX: 0000000000000001 RSI: 0000000000041252 RDI: ffff88033fd10440
2012-06-15 07:36:39 RBP: ffff8802ed00b9f0 R08: 0000000000000246 R09: 00000000fffffffe
2012-06-15 07:36:39 R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000041252
2012-06-15 07:36:39 R13: ffff88063fc007c0 R14: 000000000000000c R15: 0000000000000000
2012-06-15 07:36:39 FS:  00002aaaab05db20(0000) GS:ffff88034aca0000(0000) knlGS:0000000000000000
2012-06-15 07:36:39 CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
2012-06-15 07:36:39 CR2: 00007ffff7af8450 CR3: 0000000001a85000 CR4: 00000000000006e0
2012-06-15 07:36:39 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
2012-06-15 07:36:39 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
2012-06-15 07:36:39 Process llog_process_th (pid: 12562, threadinfo ffff8802ed00a000, task ffff8802ed018b40)
2012-06-15 07:36:39 Stack:
2012-06-15 07:36:39  000000000000008e 0100000400000020 4fdb48770000000d 000000000001dce8
2012-06-15 07:36:39 <0> 0000311200000000 000000af00000000 ffff880340021b48 ffff88033fd10440
2012-06-15 07:36:39 <0> ffff880637c1b1c0 ffff88063fc007c0 000000000000000c ffff88063fc007e0
2012-06-15 07:36:39 Call Trace:
2012-06-15 07:36:39  [<ffffffff8115e9c2>] cache_alloc_refill+0x202/0x240
2012-06-15 07:36:39  [<ffffffffa03e1be0>] ? cfs_alloc+0x30/0x60 [libcfs]
2012-06-15 07:36:39  [<ffffffff8115f6e9>] __kmalloc+0x1a9/0x220
2012-06-15 07:36:39  [<ffffffffa03e1be0>] cfs_alloc+0x30/0x60 [libcfs]
2012-06-15 07:36:39  [<ffffffffa0cbb89e>] filter_common_setup+0xde/0x13f0 [obdfilter]
2012-06-15 07:36:39  [<ffffffffa0cbd1d0>] filter_setup+0x620/0xa20 [obdfilter]
2012-06-15 07:36:39  [<ffffffffa05248f4>] obd_setup+0x1b4/0x2f0 [obdclass]
2012-06-15 07:36:39  [<ffffffffa051031b>] ? class_new_export+0x73b/0x970 [obdclass]
2012-06-15 07:36:39  [<ffffffffa0524c38>] class_setup+0x208/0x890 [obdclass]
2012-06-15 07:36:39  [<ffffffffa052bd9c>] class_process_config+0xbec/0x1c20 [obdclass]
2012-06-15 07:36:39  [<ffffffffa03e1be0>] ? cfs_alloc+0x30/0x60 [libcfs]
2012-06-15 07:36:39  [<ffffffffa05265a3>] ? lustre_cfg_new+0x353/0x7e0 [obdclass]
2012-06-15 07:36:39  [<ffffffffa052de7b>] class_config_llog_handler+0x9bb/0x1610 [obdclass]
2012-06-15 07:36:39  [<ffffffffa04fd5b0>] ? llog_lvfs_next_block+0x2d0/0x650 [obdclass]
2012-06-15 07:36:39  [<ffffffffa04f7940>] ? llog_process_thread+0x0/0xd00 [obdclass]
2012-06-15 07:36:39  [<ffffffffa04f81c8>] llog_process_thread+0x888/0xd00 [obdclass]
2012-06-15 07:36:39  [<ffffffffa04f7940>] ? llog_process_thread+0x0/0xd00 [obdclass]
2012-06-15 07:36:39  [<ffffffff8100c14a>] child_rip+0xa/0x20
2012-06-15 07:36:39  [<ffffffffa04f7940>] ? llog_process_thread+0x0/0xd00 [obdclass]
2012-06-15 07:36:39  [<ffffffffa04f7940>] ? llog_process_thread+0x0/0xd00 [obdclass]
2012-06-15 07:36:39  [<ffffffff8100c140>] ? child_rip+0x0/0x20
2012-06-15 07:36:39 Code: 0f 1f 84 00 00 00 00 00 49 8d 54 24 30 48 c7 c0 fc ff ff ff 48 89 55 c8 e9 e1 fe ff ff 0f 0b eb fe ba 01 00 00 00 e9 2a fe ff ff <0f> 0b eb fe 66 0f 1f 84 00 00 00 00 00 55 48 89 e5 41 57 41 56
2012-06-15 07:36:39 RIP  [<ffffffff8115e7b3>] cache_grow+0x313/0x320
2012-06-15 07:36:39  RSP <ffff8802ed00b990>
2012-06-15 07:36:39 Initializing cgroup subsys cpuset
~                                                      


 Comments   
Comment by Andreas Dilger [ 15/Jun/12 ]

Unfortunately, we just became aware of this issue last night.

Comment by Peter Jones [ 18/Jun/12 ]

Bobijam

Could you please look into this one?

Thanks

Peter

Generated at Sat Feb 10 01:17:27 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.