[LU-616] Oops: RIP: libcfs:cfs_tracefile_dump_all_pages+0x1d2/0x2f0 Created: 22/Aug/11  Updated: 28/May/17  Resolved: 28/May/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.1.0
Fix Version/s: None

Type: Bug Priority: Critical
Reporter: Jian Yu Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None
Environment:

Lustre Tag: v2_1_0_0_RC0
Lustre Build: http://newbuild.whamcloud.com/job/lustre-master/267/
Distro/Arch: RHEL5/x86_64 (kernel version: 2.6.18-238.19.1.el5)


Severity: 3
Rank (Obsolete): 4017

 Description   

While running conf-sanity test 33a, one client (client-12-ib) crashed as follows:

Unable to handle kernel NULL pointer dereference at 0000000000000000 RIP: 
 [<ffffffff8881d4b2>] :libcfs:cfs_tracefile_dump_all_pages+0x1d2/0x2f0
PGD 321320067 PUD 324636067 PMD 0 
Oops: 0000 [1] SMP 
last sysfs file: /devices/pci0000:00/0000:00:01.0/0000:01:00.1/irq
CPU 2 
Modules linked in: libcfs(U) nfs fscache nfs_acl autofs4 hidp rfcomm l2cap bluetooth lockd sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf be2iscsi iscsi_tcp bnx2i cnic uio cxgb3i iw_cxgb3 cxgb3 libiscsi_tcp ib_iser libiscsi2 scsi_transport_iscsi2 scsi_transport_iscsi ib_srp rds ib_sdp ib_ipoib ipoib_helper ipv6 xfrm_nalgo crypto_api rdma_ucm rdma_cm ib_ucm ib_uverbs ib_umad ib_cm iw_cm ib_addr ib_sa loop dm_mirror dm_multipath scsi_dh video backlight sbs power_meter hwmon i2c_ec dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc lp parport mlx4_ib ib_mad ib_core mlx4_en joydev sg igb 8021q shpchp i2c_i801 serio_raw pcspkr i2c_core mlx4_core dca i7core_edac edac_mc tpm_tis tpm tpm_bios dm_raid45 dm_message dm_region_hash dm_log dm_mod dm_mem_cache ahci libata sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 21134, comm: lctl Tainted: G      2.6.18-238.19.1.el5 #1
RIP: 0010:[<ffffffff8881d4b2>]  [<ffffffff8881d4b2>] :libcfs:cfs_tracefile_dump_all_pages+0x1d2/0x2f0
RSP: 0018:ffff81030e385de8  EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff810321e71500 RCX: 0000000000001fff
RDX: 0000000000000fbb RSI: 0000000000000000 RDI: ffff81010aaf8b00
RBP: ffff810321e717e0 R08: ffffffff8001831d R09: ffff81033eb3cc00
R10: 0000000000000000 R11: 0000000000000000 R12: ffff810321e71508
R13: ffff81033e1ce480 R14: ffff81030e385df8 R15: ffff81033e1ce4b8
FS:  00002b85d34e4540(0000) GS:ffff81010b763ec0(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 0000000313ecd000 CR4: 00000000000006e0
Process lctl (pid: 21134, threadinfo ffff81030e384000, task ffff810323d79080)
Stack:  00000000ffffffd8 ffff810000000000 ffff810321e71508 ffff810321e715c8
 0000000100000001 00000f9c0000001c 0000000000000020 00000000ffffffea
 000000000000001f 0000000000000020 00007fff58de05d0 0000000000000001
Call Trace:
 [<ffffffff8881d630>] :libcfs:cfs_trace_dump_debug_buffer_usrstr+0x60/0xa0
 [<ffffffff8001ebea>] __dentry_open+0x101/0x1dc
 [<ffffffff88816854>] :libcfs:proc_call_handler+0x24/0x60
 [<ffffffff8009745d>] do_rw_proc+0xcb/0x126
 [<ffffffff80016b71>] vfs_write+0xce/0x174
 [<ffffffff8001743e>] sys_write+0x45/0x6e
 [<ffffffff8005d28d>] tracesys+0xd5/0xe0


Code: 48 8b 06 48 be 00 00 00 00 00 81 ff ff 89 d2 4c 89 f9 48 83 
RIP  [<ffffffff8881d4b2>] :libcfs:cfs_tracefile_dump_all_pages+0x1d2/0x2f0
 RSP <ffff81030e385de8>
CR2: 0000000000000000
 <0>Kernel panic - not syncing: Fatal exception

Maloo report: https://maloo.whamcloud.com/test_sets/20461fce-ccbb-11e0-8d02-52540025f9af



 Comments   
Comment by Jian Yu [ 23/Aug/11 ]

Lustre Tag: v2_1_0_0_RC0
Lustre Build: http://newbuild.whamcloud.com/job/lustre-master/267/
Distro/Arch: RHEL6/x86_64(server), SLES11/x86_64(client)

The same failure occurred: https://maloo.whamcloud.com/test_sets/e835a9be-cd38-11e0-8d02-52540025f9af

Comment by Andreas Dilger [ 28/May/17 ]

Close old issue.

Generated at Sat Feb 10 01:08:47 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.