[LU-15097] ASSERTION( list_empty(&qmt->qmt_pool_list) Created: 13/Oct/21  Updated: 22/Aug/23  Resolved: 06/Jan/22

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Upstream
Fix Version/s: Lustre 2.15.0

Type: Bug Priority: Minor
Reporter: Alex Zhuravlev Assignee: Sergey Cheremencev
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Lustre: DEBUG MARKER: == ost-pools test 27: Race pool_list and pool_remove ===== 09:49:59 (1634118599)

Trace:

PID: 57686  TASK: ffff880122776000  CPU: 1   COMMAND: "umount"
 #0 [ffff88011750f900] panic at ffffffff810af881
    /tmp/kernel/kernel/panic.c: 265
 #1 [ffff88011750f990] qmt_pool_fini at ffffffffa09d7126 [lquota]
    /home/lustre/master-mine/lustre/quota/qmt_pool.c: 462
 #2 [ffff88011750f9b8] qmt_device_fini at ffffffffa09c2174 [lquota]
    /home/lustre/master-mine/lustre/quota/qmt_dev.c: 104
 #3 [ffff88011750f9d0] class_cleanup at ffffffffa017e483 [obdclass]
    /home/lustre/master-mine/lustre/include/obd_class.h: 597
 #4 [ffff88011750fa48] class_process_config at ffffffffa017f81a [obdclass]
    /home/lustre/master-mine/libcfs/include/libcfs/libcfs_debug.h: 154
 #5 [ffff88011750faf8] class_manual_cleanup at ffffffffa018299c [obdclass]
    /home/lustre/master-mine/lustre/obdclass/obd_config.c: 2277
 #6 [ffff88011750fb90] qmt_device_obd_disconnect at ffffffffa09c1e86 [lquota]
    /home/lustre/master-mine/libcfs/include/libcfs/libcfs_debug.h: 154
 #7 [ffff88011750fba8] mdt_device_fini at ffffffffa0c95853 [mdt]
    /home/lustre/master-mine/libcfs/include/libcfs/libcfs_debug.h: 154
 #8 [ffff88011750fc00] class_cleanup at ffffffffa017e756 [obdclass]
    /home/lustre/master-mine/lustre/include/obd_class.h: 598
 #9 [ffff88011750fc78] class_process_config at ffffffffa017f81a [obdclass]
    /home/lustre/master-mine/libcfs/include/libcfs/libcfs_debug.h: 154
#10 [ffff88011750fd28] class_manual_cleanup at ffffffffa018299c [obdclass]
    /home/lustre/master-mine/lustre/obdclass/obd_config.c: 2277
#11 [ffff88011750fdc0] server_put_super at ffffffffa01bcb65 [obdclass]
    /home/lustre/master-mine/libcfs/include/libcfs/libcfs_fail.h: 79
#12 [ffff88011750fe90] generic_shutdown_super at ffffffff811fc7df
    /tmp/kernel/./include/linux/compiler.h: 188
#13 [ffff88011750fea8] kill_anon_super at ffffffff811fcae9
    /tmp/kernel/fs/super.c: 1047
#14 [ffff88011750feb8] deactivate_locked_super at ffffffff811fcf34
    /tmp/kernel/fs/super.c: 333
#15 [ffff88011750fed0] cleanup_mnt at ffffffff8121dc96
    /tmp/kernel/fs/namespace.c: 1125
#16 [ffff88011750fee0] task_work_run at ffffffff810cebca
    /tmp/kernel/kernel/task_work.c: 115
#17 [ffff88011750ff20] exit_to_usermode_loop at ffffffff81001d94
    /tmp/kernel/./include/linux/tracehook.h: 193
#18 [ffff88011750ff38] do_syscall_64 at ffffffff81002603
    /tmp/kernel/arch/x86/entry/common.c: 197
#19 [ffff88011750ff50] entry_SYSCALL_64_after_hwframe at ffffffff818000ae
    /tmp/kernel/arch/x86/entry/entry_64.S: 242
    RIP: 00007fb588e44dfb  RSP: 00007ffddcb4b5d8  RFLAGS: 00000246
    RAX: 0000000000000000  RBX: 000056404c20e400  RCX: 00007fb588e44dfb
    RDX: 0000000000000001  RSI: 0000000000000000  RDI: 000056404c2129a0
    RBP: 0000000000000000   R8: 000056404c2129c0   R9: 00007fb588ec8580
    R10: 0000000000000000  R11: 0000000000000246  R12: 000056404c2129a0
    R13: 00007fb589bf2184  R14: 000056404c20e5e0  R15: 00000000ffffffff
    ORIG_RAX: 00000000000000a6  CS: 0033  SS: 002b


 Comments   
Comment by Gerrit Updater [ 15/Oct/21 ]

"Sergey Cheremencev <sergey.cheremencev@hpe.com>" uploaded a new patch: https://review.whamcloud.com/45256
Subject: LU-15097 quota: stop pool_recalc before killing pool
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 1c8ccbaef5e90bea6681ac2072eb114526eb5185

Comment by Sergey Cheremencev [ 15/Oct/21 ]

Accidentally pushed test with it's original HPe ticket number - https://review.whamcloud.com/#/c/45255/.
It successfully reproduced the issue from the description - http://testing.linuxhacker.ru/lustre-reports/19063/testresults/sanity-quota-special10-ldiskfs-centos7_x86_64-centos7_x86_64/

Comment by Gerrit Updater [ 06/Jan/22 ]

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/45256/
Subject: LU-15097 quota: stop pool_recalc before killing pool
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 862f0baa7c21cb631b98d3886ef9e938f4519573

Comment by Cory Spitz [ 06/Jan/22 ]

Landed for 2.15.0.

https://review.whamcloud.com/#/c/45255 can be abandoned, because the test was landed along with the fix, https://review.whamcloud.com/#/c/45256/.

Generated at Sat Feb 10 03:15:25 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.