Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
None
-
3
-
9223372036854775807
Description
This issue was created by maloo for Arshad <arshad.hussain@aeoncomputing.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/aec6b1c3-5f88-425a-8bf5-50415f59f012
Test session details:
clients: https://build.whamcloud.com/job/lustre-reviews/111944 - 4.18.0-553.44.1.el8_10.x86_64
servers: https://build.whamcloud.com/job/lustre-reviews/111944 - 4.18.0-553.44.1.el8_lustre.x86_64
Crashes executing sanity test 900 during umount:
[17722.241646] LustreError: MGC10.240.28.46@tcp: Connection to MGS (at 10.240.28.46@tcp) was lost; in progress operations using this service will fail [17727.828813] Lustre: lustre-MDT0001: Not available for connect from 10.240.28.46@tcp (stopping) [17729.469790] Lustre: lustre-MDT0001: Not available for connect from 10.240.24.216@tcp (stopping) [17729.471698] Lustre: Skipped 7 previous similar messages : [17771.667290] watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [umount:573821] [17772.371426] CPU: 1 PID: 573821 Comm: umount 4.18.0-553.44.1.el8_lustre.x86_64 #1 [17772.522202] RIP: 0010:cfs_hash_for_each_relax+0x17b/0x480 [obdclass] [17773.230656] Call Trace: [17774.042684] ? cleanup_resource+0x350/0x350 [ptlrpc] [17774.044248] ? cfs_hash_for_each_relax+0x17b/0x480 [obdclass] [17774.045444] ? cfs_hash_for_each_relax+0x172/0x480 [obdclass] [17774.046634] ? cleanup_resource+0x350/0x350 [ptlrpc] [17774.047719] ? cleanup_resource+0x350/0x350 [ptlrpc] [17774.048817] cfs_hash_for_each_nolock+0x124/0x200 [obdclass] [17774.049984] ldlm_namespace_cleanup+0x2b/0xc0 [ptlrpc] [17774.262073] __ldlm_namespace_free+0x52/0x4e0 [ptlrpc] [17774.263285] ldlm_namespace_free_prior+0x5e/0x200 [ptlrpc] [17774.264623] mdt_device_fini+0x480/0xf80 [mdt] [17775.511739] obd_precleanup+0xf4/0x220 [obdclass] [17775.514029] class_cleanup+0x322/0x900 [obdclass] [17775.515047] class_process_config+0x3bb/0x20a0 [obdclass] [17775.517336] class_manual_cleanup+0x45b/0x780 [obdclass] [17775.518435] server_put_super+0xd62/0x11f0 [ptlrpc] [17775.578275] generic_shutdown_super+0x6c/0x110 [17775.579220] kill_anon_super+0x14/0x30 [17775.580050] deactivate_locked_super+0x34/0x70 [17775.581003] cleanup_mnt+0x3b/0x70 [17775.581767] task_work_run+0x8a/0xb0 [17775.582579] exit_to_usermode_loop+0xef/0x100 [17775.583529] do_syscall_64+0x195/0x1a0 [17775.584330] entry_SYSCALL_64_after_hwframe+0x66/0xcb
+1 on master (sanity-pfl cleanup): https://testing.whamcloud.com/test_sets/517409d7-cdf2-4f70-b165-ada076d5414c