[LU-1928] parallel-scale-nfsv3 subtest test_compilebench: Oops: RIP: put_page+0x9/0x40 Created: 13/Sep/12  Updated: 28/Sep/12  Resolved: 14/Sep/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.3.0
Fix Version/s: None

Type: Bug Priority: Blocker
Reporter: Maloo Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Duplicate
duplicates LU-1881 sanity test 116 soft lockup Resolved
Severity: 3
Rank (Obsolete): 4304

 Description   

This issue was created by maloo for yujian <yujian@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/024da508-fd90-11e1-afe5-52540035b04c.

The sub-test test_compilebench failed with the following error:

./compilebench -D /mnt/lustre/d0.compilebench -i 2         -r 2 --makej
using working directory /mnt/lustre/d0.compilebench, 2 intial dirs 2 runs
native unpatched native-0 222MB in 181.16 seconds (1.23 MB/s)
Traceback (most recent call last):
  File "./compilebench", line 567, in <module>
    dset = dataset(options.sources, rnd)
  File "./compilebench", line 320, in __init__
    self.patched = native_order(self.patched, "patched")
  File "./compilebench", line 97, in native_order
    run_directory(tmplist, dirname, "native %s" % tag)
  File "./compilebench", line 225, in run_directory
    fp = file(fname, 'a+')
IOError: [Errno 13] Permission denied: '/mnt/lustre/d0.compilebench/native-0/arch/arm/configs/ns9xxx_defconfig'
 parallel-scale-nfsv3 test_compilebench: @@@@@@ FAIL: compilebench failed: 1 

Info required for matching: parallel-scale-nfsv3 compilebench

Lustre Build: http://build.whamcloud.com/job/lustre-b2_3/17

Console log on MDS (fat-intel-2):

Lustre: DEBUG MARKER: /usr/sbin/lctl mark == parallel-scale-nfsv3 test compilebench: compilebench == 03:11:31 \(1347531091\)
Lustre: DEBUG MARKER: == parallel-scale-nfsv3 test compilebench: compilebench == 03:11:31 (1347531091)
Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000200000400-0x0000000240000400):0:mdt
Lustre: DEBUG MARKER: /usr/sbin/lctl mark .\/compilebench -D \/mnt\/lustre\/d0.compilebench -i 2         -r 2 --makej
Lustre: DEBUG MARKER: ./compilebench -D /mnt/lustre/d0.compilebench -i 2 -r 2 --makej
BUG: unable to handle kernel NULL pointer dereference at (null)
IP: [<ffffffff8112ace9>] put_page+0x9/0x40
PGD 332173067 PUD 33213c067 PMD 0 
Oops: 0000 [#1] SMP 
last sysfs file: /sys/devices/system/cpu/cpu23/cache/index2/shared_cpu_map
CPU 13 
Modules linked in: lmv(U) nfs fscache cmm(U) osd_ldiskfs(U) mdt(U) mdd(U) mds(U) fsfilt_ldiskfs(U) mgs(U) mgc(U) ldiskfs(U) jbd2 lustre(U) lquota(U) lov(U) osc(U) mdc(U) fid(U) fld(U) ksocklnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) sha512_generic sha256_generic libcfs(U) nfsd lockd nfs_acl auth_rpcgss exportfs autofs4 sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa mlx4_ib ib_mad ib_core mlx4_en mlx4_core e1000e microcode serio_raw i2c_i801 i2c_core sg iTCO_wdt iTCO_vendor_support ioatdma dca i7core_edac edac_core shpchp ext3 jbd mbcache sd_mod crc_t10dif ahci dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]

Pid: 0, comm: swapper Not tainted 2.6.32-279.5.1.el6_lustre.g634f764.x86_64 #1 Supermicro X8DTT-H/X8DTT-H
RIP: 0010:[<ffffffff8112ace9>]  [<ffffffff8112ace9>] put_page+0x9/0x40
RSP: 0018:ffff8800282e3b50  EFLAGS: 00010206
RAX: 0000000000000030 RBX: 0000000000000001 RCX: ffff8802c5cf6000
RDX: ffff8802c5cf6680 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffff8800282e3b50 R08: ffff880630580800 R09: 0000000000000000
R10: ffff88032cc7ddb8 R11: 0000000000000000 R12: ffff88032cc7dd80
R13: ffff88032cc7dd80 R14: 0000000000000008 R15: ffffffff81c05f20
FS:  0000000000000000(0000) GS:ffff8800282e0000(0000) knlGS:0000000000000000
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 000000032ec88000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process swapper (pid: 0, threadinfo ffff880637cfe000, task ffff880337e2a040)
Stack:
 ffff8800282e3b70 ffffffff8143030f ffff88032cc7dd80 ffffffff81470f09
<d> ffff8800282e3b90 ffffffff8142fe9e ffff880630580800 ffff88032cc7dd80
<d> ffff8800282e3bc0 ffffffff8142ffe2 ffff8802c5cf6030 ffff88032cc7dd80
Call Trace:
 <IRQ> 
 [<ffffffff8143030f>] skb_release_data+0x7f/0x110
 [<ffffffff81470f09>] ? ip_rcv_finish+0x199/0x440
 [<ffffffff8142fe9e>] __kfree_skb+0x1e/0xa0
 [<ffffffff8142ffe2>] kfree_skb+0x42/0x90
 [<ffffffff81470f09>] ip_rcv_finish+0x199/0x440
 [<ffffffff81471425>] ip_rcv+0x275/0x350
 [<ffffffff8143ac2b>] __netif_receive_skb+0x49b/0x6f0
 [<ffffffff8143cea8>] netif_receive_skb+0x58/0x60
 [<ffffffff8143cfb0>] napi_skb_finish+0x50/0x70
 [<ffffffff8143f4e9>] napi_gro_receive+0x39/0x50


 Comments   
Comment by Jian Yu [ 13/Sep/12 ]

Lustre Build: http://build.whamcloud.com/job/lustre-b2_3/17

parallel-scale-nfsv4 subtest test_compilebench hit the same issue:
https://maloo.whamcloud.com/test_sets/1404aa52-fd96-11e1-afe5-52540035b04c

Comment by Jian Yu [ 14/Sep/12 ]

This is fixed in LU-1881.

Lustre Build: http://build.whamcloud.com/job/lustre-b2_3/19

parallel-scale-nfsv3/v4 passed:

https://maloo.whamcloud.com/test_sets/1b3e8424-fe44-11e1-b4cd-52540035b04c
https://maloo.whamcloud.com/test_sets/211c7810-fe44-11e1-b4cd-52540035b04c

Generated at Sat Feb 10 01:20:55 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.