Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-18850

sanity/900 Crash with cfs_hash_for_each_relax+0x17b/0x480 [obdclass]

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Arshad <arshad.hussain@aeoncomputing.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/aec6b1c3-5f88-425a-8bf5-50415f59f012

      Test session details:
      clients: https://build.whamcloud.com/job/lustre-reviews/111944 - 4.18.0-553.44.1.el8_10.x86_64
      servers: https://build.whamcloud.com/job/lustre-reviews/111944 - 4.18.0-553.44.1.el8_lustre.x86_64

      Crashes executing sanity test 900 during umount:

      [17722.241646] LustreError: MGC10.240.28.46@tcp: Connection to MGS (at 10.240.28.46@tcp) was lost; in progress operations using this service will fail
      [17727.828813] Lustre: lustre-MDT0001: Not available for connect from 10.240.28.46@tcp (stopping)
      [17729.469790] Lustre: lustre-MDT0001: Not available for connect from 10.240.24.216@tcp (stopping)
      [17729.471698] Lustre: Skipped 7 previous similar messages
      :
      [17771.667290] watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [umount:573821]
      [17772.371426] CPU: 1 PID: 573821 Comm: umount 4.18.0-553.44.1.el8_lustre.x86_64 #1
      [17772.522202] RIP: 0010:cfs_hash_for_each_relax+0x17b/0x480 [obdclass]
      [17773.230656] Call Trace:
      [17774.042684] ? cleanup_resource+0x350/0x350 [ptlrpc]
      [17774.044248] ? cfs_hash_for_each_relax+0x17b/0x480 [obdclass]
      [17774.045444] ? cfs_hash_for_each_relax+0x172/0x480 [obdclass]
      [17774.046634] ? cleanup_resource+0x350/0x350 [ptlrpc]
      [17774.047719] ? cleanup_resource+0x350/0x350 [ptlrpc]
      [17774.048817] cfs_hash_for_each_nolock+0x124/0x200 [obdclass]
      [17774.049984] ldlm_namespace_cleanup+0x2b/0xc0 [ptlrpc]
      [17774.262073] __ldlm_namespace_free+0x52/0x4e0 [ptlrpc]
      [17774.263285] ldlm_namespace_free_prior+0x5e/0x200 [ptlrpc]
      [17774.264623] mdt_device_fini+0x480/0xf80 [mdt]
      [17775.511739] obd_precleanup+0xf4/0x220 [obdclass]
      [17775.514029] class_cleanup+0x322/0x900 [obdclass]
      [17775.515047] class_process_config+0x3bb/0x20a0 [obdclass]
      [17775.517336] class_manual_cleanup+0x45b/0x780 [obdclass]
      [17775.518435] server_put_super+0xd62/0x11f0 [ptlrpc]
      [17775.578275] generic_shutdown_super+0x6c/0x110
      [17775.579220] kill_anon_super+0x14/0x30
      [17775.580050] deactivate_locked_super+0x34/0x70
      [17775.581003] cleanup_mnt+0x3b/0x70
      [17775.581767] task_work_run+0x8a/0xb0
      [17775.582579] exit_to_usermode_loop+0xef/0x100
      [17775.583529] do_syscall_64+0x195/0x1a0
      [17775.584330] entry_SYSCALL_64_after_hwframe+0x66/0xcb

      Attachments

        Issue Links

          Activity

            People

              arshad512 Arshad Hussain
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: