[LU-409] Oops: RIP: _spin_lock_irq+0x15/0x40 Created: 13/Jun/11  Updated: 04/Feb/13  Resolved: 25/Oct/11

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.1.0, Lustre 1.8.6
Fix Version/s: Lustre 2.1.0, Lustre 1.8.6

Type: Bug Priority: Minor
Reporter: Jian Yu Assignee: Yang Sheng
Resolution: Fixed Votes: 0
Labels: None
Environment:

Lustre Branch: v1_8_6_RC2
Lustre Build: http://newbuild.whamcloud.com/job/lustre-b1_8/80/
e2fsprogs Build: http://newbuild.whamcloud.com/job/e2fsprogs-master/40/
Distro/Arch: RHEL6/x86_64(patchless client, in-kernel OFED, kernel version: 2.6.32-131.2.1.el6)
RHEL5/x86_64(server, OFED 1.5.3.1, kernel version: 2.6.18-238.12.1.el5_lustre)


Issue Links:
Related
is related to LU-2706 blockdev_attach fails Closed
is related to LU-2707 blockdev_attach can trigger LBUG Closed
Severity: 3
Rank (Obsolete): 4271

 Description   

After mounting and unmounting Lustre filesystem, running lustre_rmmod caused the Lustre client node crash as follows:

BUG: unable to handle kernel NULL pointer dereference at (null)
IP: [<ffffffff814dcf35>] _spin_lock_irq+0x15/0x40
PGD 31ae08067 PUD 312eae067 PMD 0
Oops: 0002 [#1] SMP
last sysfs file: /sys/devices/system/cpu/cpu3/cache/index2/shared_cpu_map
CPU 2
Modules linked in: llite_lloop(-)(U) lustre(U) mgc(U) lov(U) osc(U) mdc(U) lquota(U) ko2iblnd(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) nfs lockd fscache(T
) nfs_acl auth_rpcgss autofs4 sunrpc cpufreq_ondemand acpi_cpufreq freq_table ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa dm_mirror dm_reg
ion_hash dm_log mlx4_ib ib_mad ib_core mlx4_en mlx4_core igb serio_raw ghes hed i2c_i801 i2c_core sg iTCO_wdt iTCO_vendor_support ioatdma dca i7core_edac edac_core shpchp ext3 
jbd mbcache sd_mod crc_t10dif ahci dm_mod [last unloaded: microcode]

Modules linked in: llite_lloop(-)(U) lustre(U) mgc(U) lov(U) osc(U) mdc(U) lquota(U) ko2iblnd(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) nfs lockd fscache(T
) nfs_acl auth_rpcgss autofs4 sunrpc cpufreq_ondemand acpi_cpufreq freq_table ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa dm_mirror dm_reg
ion_hash dm_log mlx4_ib ib_mad ib_core mlx4_en mlx4_core igb serio_raw ghes hed i2c_i801 i2c_core sg iTCO_wdt iTCO_vendor_support ioatdma dca i7core_edac edac_core shpchp ext3 
jbd mbcache sd_mod crc_t10dif ahci dm_mod [last unloaded: microcode]
Pid: 4826, comm: rmmod Tainted: G           ---------------- T 2.6.32-131.2.1.el6.x86_64 #1 X8DTT
RIP: 0010:[<ffffffff814dcf35>]  [<ffffffff814dcf35>] _spin_lock_irq+0x15/0x40
RSP: 0018:ffff880318cd9da8  EFLAGS: 00010092 
RAX: 0000000000010000 RBX: ffff880328bda000 RCX: 000000000000b1a0
RDX: 0000000000000000 RSI: ffff88031ce09a90 RDI: 0000000000000000
RBP: ffff880318cd9da8 R08: 0000000000000001 R09: ffffffff817c3f86
R10: 0000000000000001 R11: 0000000000000000 R12: ffff88031ce09800
R13: ffff880328bda000 R14: ffff88031ce0b560 R15: 0000000000000001
FS:  00007fb1de18d700(0000) GS:ffff880032e40000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 000000031ae78000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process rmmod (pid: 4826, threadinfo ffff880318cd8000, task ffff88032123ca80)
Stack:
 ffff880318cd9dd8 ffffffff8125689c ffff880328bda000 ffff880328bda328
<0> ffff880328bda328 ffff88031ce0b560 ffff880318cd9df8 ffffffff8124ba66
<0> ffffffff81a8a820 ffff880328bda360 ffff880318cd9e28 ffffffff81264a2d
Call Trace:
 [<ffffffff8125689c>] blk_throtl_exit+0x3c/0xd0
 [<ffffffff8124ba66>] blk_release_queue+0x26/0x80
 [<ffffffff81264a2d>] kobject_release+0x8d/0x240
 [<ffffffff812649a0>] ? kobject_release+0x0/0x240
 [<ffffffff81265fd7>] kref_put+0x37/0x70
 [<ffffffff812648a7>] kobject_put+0x27/0x60  
 [<ffffffff81247687>] blk_cleanup_queue+0x57/0x70
 [<ffffffffa08070b1>] lloop_exit+0x61/0x300 [llite_lloop]
 [<ffffffff81069012>] ? put_online_cpus+0x52/0x70
 [<ffffffff810a8ef8>] ? module_refcount+0x58/0x70
 [<ffffffff810a9a74>] sys_delete_module+0x194/0x260
 [<ffffffff8100b172>] system_call_fastpath+0x16/0x1b
Code: c1 74 0e f3 90 0f b7 0f eb f5 83 3f 00 75 f4 eb df 48 89 d0 c9 c3 55 48 89 e5 0f 1f 44 00 00 fa 66 0f 1f 44 00 00 b8 00 00 01 00 <f0> 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 0f b7 17 eb f5
RIP  [<ffffffff814dcf35>] _spin_lock_irq+0x15/0x40
 RSP <ffff880318cd9da8>
CR2: 0000000000000000

This failure could be easily reproduced by running llmount.sh and then llmountcleanup.sh.



 Comments   
Comment by Peter Jones [ 13/Jun/11 ]

YangSheng

Can you please look into this failure as your top priority?

Thanks

Peter

Comment by Andreas Dilger [ 13/Jun/11 ]

This looks at first glance to be related to the lloop virtual block device,
probably because the block device interface has changed in newer kernels
and we have not updated the llite_lloop.c code to match the new interface.

This is an unsupported feature, and if this is causing problems then I would
suggest to simply stop loading/unloading this module from load_modules.
It looks like we already disable sanity.sh test_68, which was testing this
code.

Comment by Jian Yu [ 14/Jun/11 ]

After removing the "load_module llite/llite_lloop" line from load_modules_local() in test-framework.sh, the auster testing could go forward. And the testing is ongoing now.

Comment by Yang Sheng [ 14/Jun/11 ]

This is a known issue has discussed in kernel-list.

Date:	Wed, 16 Feb 2011 18:31:14 +1100
From:	NeilBrown <neilb@suse.de>
To:	Vivek Goyal <vgoyal@redhat.com>
Cc:	Jens Axboe <jaxboe@fusionio.com>, linux-kernel@vger.kernel.org
Subject: blk_throtl_exit  taking q->queue_lock is problematic
Message-ID: <20110216183114.26a3613b@notabene.brown>
X-Mailer: Claws Mail 3.7.8 (GTK+ 2.20.1; x86_64-unknown-linux-gnu)
Mime-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
Sender:	linux-kernel-owner@vger.kernel.org
Precedence: bulk
List-ID: <linux-kernel.vger.kernel.org>
X-Mailing-List:	linux-kernel@vger.kernel.org



Hi,

 I recently discovered that blk_throtl_exit takes ->queue_lock when a blockdev
is finally released.

This is a problem for because by that time the queue_lock doesn't exist any
more.  It is in a separate data structure controlled by the RAID personality
and by the time that the block device is being destroyed the raid personality
has shutdown and the data structure containing the lock has been freed.

This has not been a problem before.  Nothing else takes queue_lock after
blk_cleanup_queue.

I could of course set queue_lock to point to __queue_lock and initialise that,
but it seems untidy and probably violates some locking requirements.

Is there some way you could use some other lock - maybe a global lock, or
maybe used __queue_lock directly ???

Thanks,
NeilBrown
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Some fix patch already provided and landed to upstream. Do we carry it on our own patch series or just report to Redhat and waiting for next rhel6 update release? Since it looks like just effect llite_loop module. As andreas point out, it hasn't used for now.

Comment by Richard Henwood (Inactive) [ 15/Jun/11 ]

I believe I'm seeing this with 2.1 on RHEL6. To repoduce:

1. uname -a = 2.6.32-131.0.15.el6_lustre.x86_64
2. /usr/lib64/lustre/tests/llmount.sh
3. /usr/lib64/lustre/tests/llmountcleanup.sh

This is performed in a VM, watching dmesg over netcat.

Comment by Yang Sheng [ 15/Jun/11 ]

Hi, Andreas, I think need a decision for this issue.
1. Take upstream fix patch to our kernel patch list;
2. Just don't load llite_loop module;
Could you give a point to which direction we should push forward? TIA.

Comment by Andreas Dilger [ 16/Jun/11 ]

Yang Sheng,
since this problem only hits on the client, and we do not patch the client, then I don't think we can fix this problem in the kernel.

I think for the current time we should just disable the llite_loop module for 2.6.32 kernels.

Comment by Yang Sheng [ 16/Jun/11 ]

Hi, Yujian, Could you please push your working patch to gerrit? So we can save time to test it and ensure it works well.

Comment by Jian Yu [ 16/Jun/11 ]

Could you please push your working patch to gerrit? So we can save time to test it and ensure it works well.

Sure. Patch for b1_8: http://review.whamcloud.com/954.

Comment by Richard Henwood (Inactive) [ 16/Jun/11 ]

Change 954, ported to 2.1, works for me.

I'll await review of the 1.8 version before I submit a change.

Comment by Peter Jones [ 16/Jun/11 ]

Richard

This is likely to land sooner on master due to 1.8.x release testing so if you can submit a patch for master then it could even get into the next tag

Peter

Comment by Richard Henwood (Inactive) [ 16/Jun/11 ]

Change set for 2.1 is here:

http://review.whamcloud.com/956

Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » i686,client,el5,inkernel #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » x86_64,server,el5,inkernel #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » x86_64,client,ubuntu1004,inkernel #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » i686,server,el5,ofa #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » x86_64,client,el6,inkernel #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » x86_64,client,el5,ofa #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » i686,client,el5,ofa #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » x86_64,server,el5,ofa #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » i686,server,el5,inkernel #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 17/Jun/11 ]

Integrated in lustre-b1_8 » i686,client,el6,inkernel #83
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 19/Jun/11 ]

Integrated in lustre-b1_8 » x86_64,client,el5,inkernel #84
LU-409 stop loading the llite_lloop module for >= 2.6.32 kernels

Johann Lombardi : 2ed811cb0149c805a19a278a8350202e47724d46
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » x86_64,client,sles11,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » x86_64,client,el5,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » i686,client,el5,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » x86_64,client,ubuntu1004,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » x86_64,server,el5,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » i686,server,el5,ofa #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » x86_64,client,el6,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » x86_64,client,el5,ofa #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » i686,client,el5,ofa #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » x86_64,server,el5,ofa #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » i686,server,el5,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » x86_64,server,el6,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » i686,server,el6,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 22/Jun/11 ]

Integrated in lustre-master » i686,client,el6,inkernel #180
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : ff9f95abb13642fd2a1a183e2f92f390ffdbb1ae
Files :

  • lustre/tests/test-framework.sh
Comment by Peter Jones [ 22/Jun/11 ]

Landed for 2.1

Comment by Jinshan Xiong (Inactive) [ 23/Jun/11 ]

I tend to think the queue has been cleaned up in del_gendisk(). That means we don't need to do it lloop_exit() then this issue will be fixed.

diff --git a/lustre/llite/lloop.c b/lustre/llite/lloop.c
index 6975c85..c3b8fb0 100644
--- a/lustre/llite/lloop.c
+++ b/lustre/llite/lloop.c
@@ -878,7 +878,7 @@ static void lloop_exit(void)
         ll_iocontrol_unregister(ll_iocontrol_magic);
         for (i = 0; i < max_loop; i++) {
                 del_gendisk(disks[i]);
-                blk_cleanup_queue(loop_dev[i].lo_queue);
+//                blk_cleanup_queue(loop_dev[i].lo_queue);
                 put_disk(disks[i]);
         }
         if (ll_unregister_blkdev(lloop_major, "lloop"))
Comment by Jinshan Xiong (Inactive) [ 23/Jun/11 ]

I pushed a patch at: http://review.whamcloud.com/1011, please take a look. The patch itself needs polishing by adding macro to check if it's working in 2.6.32+ kernels.

Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » x86_64,client,el5,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » x86_64,server,el5,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » x86_64,client,sles11,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » x86_64,client,el5,ofa #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » x86_64,client,ubuntu1004,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » i686,server,el5,ofa #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » x86_64,server,el5,ofa #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » i686,client,el5,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » i686,client,el5,ofa #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » i686,server,el5,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » x86_64,client,el6,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » i686,server,el6,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » x86_64,server,el6,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 23/Jun/11 ]

Integrated in lustre-master » i686,client,el6,inkernel #184
LU-409: only load llite_lloop module if kernel < 2.6.32 or LOAD_LLOOP

Oleg Drokin : 5b840606641c3b227c451056c37a941cc13696c9
Files :

  • lustre/tests/test-framework.sh
Comment by Peter Jones [ 28/Jun/11 ]

Workaround in place so landing. More correct fix is lower priority.

Comment by Sarah Liu [ 28/Jun/11 ]

got this problem again when I run sanity test_68a with the latest master build RHEL6/x86_64/#190

Lustre: DEBUG MARKER: == sanity test 68a: lloop driver - basic test ========================== 14:48:58 (1309297738)
Lustre: 8193:0:(lloop.c:711:lloop_ioctl()) Enter llop_ioctl
BUG: unable to handle kernel NULL pointer dereference at (null)
IP: [<ffffffff814dcf35>] _spin_lock_irq+0x15/0x40
PGD 30e626067 PUD 30f053067 PMD 0
Oops: 0002 1 SMP
last sysfs file: /sys/devices/virtual/block/lloop11/range
CPU 2
Modules linked in: llite_lloop(U) ext2 lustre(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) lquota(U) ko2iblnd(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) nfs lockd fscache(T) nfs_acl auth_rpcgss autofs4 sunrpc cpufreq_ondemand acpi_cpufreq freq_table ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa dm_mirror dm_region_hash dm_log mlx4_ib ib_mad ib_core mlx4_en mlx4_core igb serio_raw ghes hed i2c_i801 i2c_core sg iTCO_wdt iTCO_vendor_support ioatdma dca i7core_edac edac_core shpchp ext3 jbd mbcache sd_mod crc_t10dif ahci dm_mod [last unloaded: microcode]

Modules linked in: llite_lloop(U) ext2 lustre(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) lquota(U) ko2iblnd(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) nfs lockd fscache(T) nfs_acl auth_rpcgss autofs4 sunrpc cpufreq_ondemand acpi_cpufreq freq_table ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa dm_mirror dm_region_hash dm_log mlx4_ib ib_mad ib_core mlx4_en mlx4_core igb serio_raw ghes hed i2c_i801 i2c_core sg iTCO_wdt iTCO_vendor_support ioatdma dca i7core_edac edac_core shpchp ext3 jbd mbcache sd_mod crc_t10dif ahci dm_mod [last unloaded: microcode]
Pid: 8201, comm: rmmod Tainted: G ---------------- T 2.6.32-131.2.1.el6.x86_64 #1 X8DTT
RIP: 0010:[<ffffffff814dcf35>] [<ffffffff814dcf35>] _spin_lock_irq+0x15/0x40
RSP: 0018:ffff88030ec31da8 EFLAGS: 00010092
RAX: 0000000000010000 RBX: ffff880326822aa0 RCX: 000000000000720e
RDX: 0000000000000000 RSI: ffff88030e6e1e90 RDI: 0000000000000000
RBP: ffff88030ec31da8 R08: 000000000000000c R09: 0000000000000000
R10: 0000000000000001 R11: 0000000000000000 R12: ffff88030e6e1c00
R13: ffff880326822aa0 R14: ffff8802fefa8740 R15: 0000000000000001
FS: 00007f58858a0700(0000) GS:ffff880032e40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 000000030e8ae000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process rmmod (pid: 8201, threadinfo ffff88030ec30000, task ffff88030e886b00)
Stack:
ffff88030ec31dd8 ffffffff8125689c ffff880326822aa0 ffff880326822dc8
<0> ffff880326822dc8 ffff8802fefa8740 ffff88030ec31df8 ffffffff8124ba66
<0> ffffffff81a8a820 ffff880326822e00 ffff88030ec31e28 ffffffff81264a2d
Call Trace:
[<ffffffff8125689c>] blk_throtl_exit+0x3c/0xd0
[<ffffffff8124ba66>] blk_release_queue+0x26/0x80
[<ffffffff81264a2d>] kobject_release+0x8d/0x240
[<ffffffff812649a0>] ? kobject_release+0x0/0x240
[<ffffffff81265fd7>] kref_put+0x37/0x70
[<ffffffff812648a7>] kobject_put+0x27/0x60
[<ffffffff81247687>] blk_cleanup_queue+0x57/0x70
[<ffffffffa00410b1>] lloop_exit+0x61/0x2f0 [llite_lloop]
[<ffffffff81069012>] ? put_online_cpus+0x52/0x70
[<ffffffff810a8ef8>] ? module_refcount+0x58/0x70
[<ffffffff810a9a74>] sys_delete_module+0x194/0x260
[<ffffffff8100b172>] system_call_fastpath+0x16/0x1b
Code: c1 74 0e f3 90 0f b7 0f eb f5 83 3f 00 75 f4 eb df 48 89 d0 c9 c3 55 48 89 e5 0f 1f 44 00 00 fa 66 0f 1f 44 00 00 b8 00 00 01 00 <f0> 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 0f b7 17 eb f5
RIP [<ffffffff814dcf35>] _spin_lock_irq+0x15/0x40
RSP <ffff88030ec31da8>
CR2: 0000000000000000
Initializing cgroup subsys cpuset
Initializing cgroup subsys cpu
Linux version 2.6.32-131.2.1.el6.x86_64 (mockbuild@x86-003.build.bos.redhat.com) (gcc version 4.4.5 20110214 (Red Hat 4.4.5-6) (GCC) ) #1 SMP Wed May 18 07:07:37 EDT 2011
Command line: ro root=UUID=e41f2282-ba65-4051-97ff-6b7f533b8a60 rd_NO_LUKS rd_NO_LVM rd_NO_MD rd_NO_DM LANG=en_US.UTF-8 SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us console=ttyS0,115200 irqpoll maxcpus=1 reset_devices cgroup_disable=memory memmap=exactmap memmap=640K@0K memmap=131436K@33408K elfcorehdr=164844K memmap=104K$920K memmap=8K$3136952K memmap=56K#3136960K memmap=328K#3137016K memmap=64K$3137344K memmap=8272K$3137456K memmap=262144K$3670016K memmap=4K$4175872K memmap=4096K$4190208K
KERNEL supported cpus:
Intel GenuineIntel
AMD AuthenticAMD
Centaur CentaurHauls
BIOS-provided physical RAM map:

Comment by Sarah Liu [ 13/Jul/11 ]

reproduced on the latest rhel6-x86_64/#201

Comment by Yang Sheng [ 28/Jul/11 ]

patch upload to: http://review.whamcloud.com/#change,1150

Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » x86_64,client,el5,ofa #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/sanity.sh
  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » i686,client,el6,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/test-framework.sh
  • lustre/tests/sanity.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » x86_64,client,el5,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/test-framework.sh
  • lustre/tests/sanity.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » x86_64,server,el6,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/test-framework.sh
  • lustre/tests/sanity.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » x86_64,client,el6,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/sanity.sh
  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » x86_64,client,sles11,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/test-framework.sh
  • lustre/tests/sanity.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » i686,client,el5,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/test-framework.sh
  • lustre/tests/sanity.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » x86_64,server,el5,ofa #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/sanity.sh
  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » x86_64,client,ubuntu1004,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/test-framework.sh
  • lustre/tests/sanity.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » x86_64,server,el5,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/sanity.sh
  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » i686,server,el6,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/sanity.sh
  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » i686,server,el5,ofa #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/sanity.sh
  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » i686,server,el5,inkernel #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/sanity.sh
  • lustre/tests/test-framework.sh
Comment by Build Master (Inactive) [ 02/Aug/11 ]

Integrated in lustre-master » i686,client,el5,ofa #239
LU-409 Oops: RIP: _spin_lock_irq+0x15/0x40

Oleg Drokin : 167f2a4ec9c577fcafa07ac5356708c3cc09bdea
Files :

  • lustre/tests/sanity.sh
  • lustre/tests/test-framework.sh
Comment by Yang Sheng [ 25/Oct/11 ]

Close as fixed.

Generated at Sat Feb 10 01:06:44 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.