[LU-8244] BUG: unable to handle kernel paging request at remove_proc_subtree Created: 07/Jun/16  Updated: 18/Jun/21  Resolved: 18/Jun/21

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.9.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Andreas Dilger Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None
Environment:

Parallels VM: RHEL7.2 kernel-3.10.0-327.13.1.el7_lustre
Lustre: 2.8.53_27_gae6fc01-3.10.0-327.13.1.el7_lustre.x86_64


Issue Links:
Related
is related to LU-4532 Test failure on test suite sanity, su... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

I hit this crash twice running Lustre and kernel RPMs from master commit ae6fc0156d1. It appears to be during the cleanup at the start of conf-sanity.sh. This is the first I've been running any testing in this VM, so it may be caused by some configuration problem.

[17289.106395] Lustre: DEBUG MARKER: -----============= acceptance-small: conf-sanity ============----- Mon Jun 6 22:44:28 MDT 2016
[17290.211370] Lustre: DEBUG MARKER: excepting tests: 32newtarball 84
[17290.235576] Lustre: DEBUG MARKER: skipping tests SLOW=no: 45 69
[17290.326440] BUG: unable to handle kernel paging request at 000000200000003f
[17290.326703] IP: [<ffffffff8124fd87>] remove_proc_subtree+0x67/0x190
[17290.326870] PGD 0
[17290.327025] Oops: 0000 [#1] SMP
[17290.335098] CPU: 1 PID: 29352 Comm: umount Tainted: P           OE  ------------   3.10.0-327.13.1.el7_lustre.x86_64 #1
[17290.335278] RIP: 0010:[<ffffffff8124fd87>]  [<ffffffff8124fd87>] remove_proc_subtree+0x67/0x190
[17290.336912] Call Trace:
[17290.336987]  [<ffffffff8124fecb>] proc_remove+0x1b/0x20
[17290.337079]  [<ffffffffa0631d65>] lprocfs_remove+0x15/0x20 [obdclass]
[17290.337155]  [<ffffffffa0ff2236>] osp_device_fini+0x186/0x540 [osp]
[17290.337230]  [<ffffffffa064737c>] class_cleanup+0x8dc/0xd70 [obdclass]
[17290.337321]  [<ffffffffa0649d5c>] class_process_config+0x1e2c/0x2f70 [obdclass]
[17290.337542]  [<ffffffffa064af8f>] class_manual_cleanup+0xef/0x810 [obdclass]
[17290.337608]  [<ffffffffa0ff1268>] osp_obd_disconnect+0x178/0x210 [osp]
[17290.337674]  [<ffffffffa0f93ef8>] lod_putref+0x258/0x960 [lod]
[17290.337738]  [<ffffffffa0f9608d>] lod_fini_tgt+0xdd/0x3a0 [lod]
[17290.337802]  [<ffffffffa0f8ac70>] lod_device_fini+0x60/0x1f0 [lod]
[17290.337875]  [<ffffffffa064737c>] class_cleanup+0x8dc/0xd70 [obdclass]
[17290.337949]  [<ffffffffa0649d5c>] class_process_config+0x1e2c/0x2f70 [obdclass]
[17290.338159]  [<ffffffffa064af8f>] class_manual_cleanup+0xef/0x810 [obdclass]
[17290.338224]  [<ffffffffa0f8abff>] lod_obd_disconnect+0x1bf/0x1d0 [lod]
[17290.338289]  [<ffffffffa0e820ba>] mdd_process_config+0x2ba/0x5c0 [mdd]
[17290.338359]  [<ffffffffa0edc4a3>] mdt_stack_fini+0x1b3/0xe00 [mdt]
[17290.338420]  [<ffffffffa0edd816>] mdt_device_fini+0x726/0xfc0 [mdt]
[17290.338496]  [<ffffffffa064737c>] class_cleanup+0x8dc/0xd70 [obdclass]
[17290.338569]  [<ffffffffa0649d5c>] class_process_config+0x1e2c/0x2f70 [obdclass]
[17290.338776]  [<ffffffffa064af8f>] class_manual_cleanup+0xef/0x810 [obdclass]
[17290.338854]  [<ffffffffa067c26e>] server_put_super+0x8de/0xcd0 [obdclass]
[17290.338917]  [<ffffffff811e0ad6>] generic_shutdown_super+0x56/0xe0
[17290.338976]  [<ffffffff811e0eb2>] kill_anon_super+0x12/0x20
[17290.339049]  [<ffffffffa064e6f2>] lustre_kill_super+0x32/0x50 [obdclass]
[17290.339109]  [<ffffffff811e1269>] deactivate_locked_super+0x49/0x60
[17290.339168]  [<ffffffff811e1866>] deactivate_super+0x46/0x60
[17290.339229]  [<ffffffff811fe7d5>] mntput_no_expire+0xc5/0x120
[17290.339289]  [<ffffffff811ff90f>] SyS_umount+0x9f/0x3c0


 Comments   
Comment by Andreas Dilger [ 07/Jun/16 ]

LU-4532 has the same stack.

Comment by Andreas Dilger [ 08/Jun/16 ]

I checked the other crash, and it had the same bad pointer:

BUG: unable to handle kernel paging request at 000000200000003f
Comment by Andreas Dilger [ 18/Jun/21 ]

Have not seen this since initial reports.

Generated at Sat Feb 10 02:15:52 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.