[LU-1932] sanity-quota test 1: Oops IP: [<ffffffffa068cf11>] class_export_get+0x11/0x90 [obdclass] Created: 13/Sep/12  Updated: 01/May/13  Resolved: 01/May/13

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.3.0
Fix Version/s: Lustre 2.4.0

Type: Bug Priority: Minor
Reporter: Minh Diep Assignee: Keith Mannthey (Inactive)
Resolution: Cannot Reproduce Votes: 0
Labels: USE_OFD

Severity: 3
Rank (Obsolete): 6057

 Description   

Lustre: DEBUG MARKER: /usr/sbin/lctl mark == sanity-quota test 1: Block hard limit (normal use and out of quota) ===== 07:44:04 (1347547444)
Lustre: DEBUG MARKER: == sanity-quota test 1: Block hard limit (normal use and out of quota) ===== 07:44:04 (1347547444)
Lustre: DEBUG MARKER: lctl set_param lquota.lustre-OST*.quota_btune_sz=3154944
Lustre: DEBUG MARKER: lctl set_param lquota.lustre-OST*.quota_bunit_sz=3398656
Lustre: DEBUG MARKER: /usr/sbin/lctl mark User quota (limit: 38041 kbytes)
Lustre: DEBUG MARKER: User quota (limit: 38041 kbytes)
LustreError: 7296:0:(obd_class.h:1735:obd_quota_adjust_qunit()) obd_quota_adjust_qunit: dev 2 no operation
LustreError: 7296:0:(obd_class.h:1735:obd_quota_adjust_qunit()) Skipped 1 previous similar message
LustreError: 7297:0:(lquota_lib.c:106:lquotactl_slv()) (null): Unsupported quotactl command: 800101
LustreError: 7297:0:(lquota_lib.c:106:lquotactl_slv()) Skipped 1 previous similar message
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Write ...
Lustre: DEBUG MARKER: Write ...
BUG: unable to handle kernel NULL pointer dereference at 0000000000000038
IP: [<ffffffffa068cf11>] class_export_get+0x11/0x90 [obdclass]
PGD 0
Oops: 0002 1 SMP
last sysfs file: /sys/devices/system/cpu/possible
CPU 0
Modules linked in: lustre(U) ofd(U) ost(U) cmm(U) mdt(U) osd_ldiskfs(U) fsfilt_ldiskfs(U) ldiskfs(U) mdd(U) mds(U) mgs(U) obdecho(U) mgc(U) lquota(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) jbd2 sha512_generic sha256_generic nfs fscache nfsd lockd nfs_acl auth_rpcgss exportfs autofs4 sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core zfs(P)(U) zcommon(P)(U) znvpair(P)(U) zavl(P)(U) zunicode(P)(U) spl(U) zlib_deflate microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs]

Pid: 9790, comm: ll_ost_io00_008 Tainted: P --------------- 2.6.32-279.5.1.el6_lustre.g293c36b.x86_64 #1 Red Hat KVM
RIP: 0010:[<ffffffffa068cf11>] [<ffffffffa068cf11>] class_export_get+0x11/0x90 [obdclass]
RSP: 0018:ffff8800757dba70 EFLAGS: 00010282
RAX: ffff88002301c000 RBX: 0000000000000000 RCX: 0000000000000002
RDX: 0000000000000008 RSI: ffffffffa1011368 RDI: 0000000000000000
RBP: ffff8800757dba80 R08: 0000000000000246 R09: 0000000000000000
R10: ffff88002301c000 R11: 0000000000001088 R12: ffff880015f1e400
R13: 0000000000000100 R14: 0000000000000000 R15: 0000000000000001
FS: 00007fc7dc5cf700(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000038 CR3: 000000007d1b9000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process ll_ost_io00_008 (pid: 9790, threadinfo ffff8800757da000, task ffff8800561f2ae0)
Stack:
0000000000000000 ffff88002301c000 ffff8800757dbac0 ffffffffa0830dea
<d> 0000000000000000 ffff880015f1e400 ffff880015f1e400 0000000000000000
<d> ffff880040df5800 ffff8800757dbc80 ffff8800757dbc30 ffffffffa0dffc77
Call Trace:
[<ffffffffa0830dea>] ptlrpc_prep_bulk_exp+0x6a/0x180 [ptlrpc]
[<ffffffffa0dffc77>] ost_brw_write+0xb67/0x1600 [ost]
[<ffffffff8127ce76>] ? vsnprintf+0x2b6/0x5f0
[<ffffffffa083920c>] ? lustre_msg_get_version+0x8c/0x100 [ptlrpc]
[<ffffffffa0839368>] ? lustre_msg_check_version+0xe8/0x100 [ptlrpc]
[<ffffffffa0e0602c>] ost_handle+0x360c/0x4850 [ost]
[<ffffffffa0fd3561>] ? libcfs_debug_msg+0x41/0x50 [libcfs]
[<ffffffffa0fcf364>] ? libcfs_id2str+0x74/0xb0 [libcfs]
[<ffffffffa08489cd>] ptlrpc_server_handle_request+0x40d/0xea0 [ptlrpc]
[<ffffffffa0fc365e>] ? cfs_timer_arm+0xe/0x10 [libcfs]
[<ffffffffa083ff67>] ? ptlrpc_wait_event+0xa7/0x2a0 [ptlrpc]
[<ffffffff810533f3>] ? __wake_up+0x53/0x70
[<ffffffffa0849fb9>] ptlrpc_main+0xb59/0x1860 [ptlrpc]
[<ffffffffa0849460>] ? ptlrpc_main+0x0/0x1860 [ptlrpc]
[<ffffffff8100c14a>] child_rip+0xa/0x20
[<ffffffffa0849460>] ? ptlrpc_main+0x0/0x1860 [ptlrpc]
[<ffffffffa0849460>] ? ptlrpc_main+0x0/0x1860 [ptlrpc]
[<ffffffff8100c140>] ? child_rip+0x0/0x20
Code: 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 0f 1f 44 00 00 e8 52 ff ff ff c9 c3 55 48 89 e5 53 48 83 ec 08 0f 1f 44 00 00 48 89 fb <3e> ff 47 38 f6 05 88 57 96 00 40 74 63 f6 05 7b 57 96 00 20 74
RIP [<ffffffffa068cf11>] class_export_get+0x11/0x90 [obdclass]
RSP <ffff8800757dba70>
CR2: 0000000000000038



 Comments   
Comment by Keith Mannthey (Inactive) [ 13/Sep/12 ]

Are there more logs?

Can you describe how your produced this issue and do you see it all the time or just once?

What was the setup like?

What part of the system was the error seen?

Comment by Minh Diep [ 13/Sep/12 ]

The setup is on VMs in the lab with b2_3 any build. in the local.sh file, setup export USE_OFD=yes, and LOAD_MODULES_REMOTE=true, then run ./auster -rvs sanity-quota

Let me know if you want to access one of my system to debug

Comment by Keith Mannthey (Inactive) [ 14/Sep/12 ]

I will try on my local system first. I will let you know if I need one of yours.

Comment by Keith Mannthey (Inactive) [ 23/Jan/13 ]

Has this issue been seen with Master post 2.3?

Comment by Keith Mannthey (Inactive) [ 01/May/13 ]

I have looked through Master runs for the last while I don't see any signs of THIS error and sanity-quota test 1. I am going to close please reopen in this is hit again on something other than the 2.3 branch.

Generated at Sat Feb 10 06:11:43 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.