Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17936

racer test_1: hang in mdt_reint_rename vs. mdt_object_local_lock with ldlm_completion_ast

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.16.0, Lustre 2.15.5
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/619bf499-6edf-42fe-8984-958c0273b87f

      test_1 failed with the following error:

      Timeout occurred after 711 minutes, last suite running was racer
      

      Test session details:
      clients: https://build.whamcloud.com/job/lustre-b2_15/87 - 5.14.21-150500.55.39-default
      servers: https://build.whamcloud.com/job/lustre-b2_15/87 - 4.18.0-477.27.1.el8_lustre.x86_64

      <<Please provide additional information about the failure here>>

      client dmesg

      [Fri May 31 18:55:06 2024] Lustre: DEBUG MARKER: /usr/sbin/lctl mark == racer test 1: racer on clients: onyx-75vm4,onyx-75vm5 DURATION=900 ========================================================== 18:55:07 \(1717181707\)
      [Fri May 31 18:55:06 2024] Lustre: DEBUG MARKER: == racer test 1: racer on clients: onyx-75vm4,onyx-75vm5 DURATION=900 ========================================================== 18:55:07 (1717181707)
      [Fri May 31 18:55:06 2024] Lustre: DEBUG MARKER: DURATION=900 			MDSCOUNT=1 OSTCOUNT=7			RACER_ENABLE_REMOTE_DIRS=false 			RACER_ENABLE_STRIPED_DIRS=false 			RACER_ENABLE_MIGRATION=false 			RACER_ENABLE_PFL=true 			RACER_ENABLE_DOM=true 			RACER_ENABLE_FLR=true 			RACER_MAX_CLEANUP_WAIT= 			RACER_ENABLE
      [Fri May 31 18:55:06 2024] Lustre: DEBUG MARKER: DURATION=900 			MDSCOUNT=1 OSTCOUNT=7			RACER_ENABLE_REMOTE_DIRS=false 			RACER_ENABLE_STRIPED_DIRS=false 			RACER_ENABLE_MIGRATION=false 			RACER_ENABLE_PFL=true 			RACER_ENABLE_DOM=true 			RACER_ENABLE_FLR=true 			RACER_MAX_CLEANUP_WAIT= 			RACER_ENABLE
      [Fri May 31 18:55:08 2024] 11[30738]: segfault at 8 ip 00007fadfa9754e8 sp 00007ffe9ed169a0 error 4 in ld-2.31.so[7fadfa968000+2a000]
      [Fri May 31 18:55:08 2024] Code: 85 4c 12 00 00 49 83 ba f0 00 00 00 00 48 c7 85 10 ff ff ff 00 00 00 00 0f 85 f2 10 00 00 49 8b 42 68 49 83 ba f8 00 00 00 00 <48> 8b 40 08 48 89 85 30 ff ff ff 0f 84 77 03 00 00 45 85 ed 0f 85
      [Fri May 31 18:55:08 2024] systemd-coredump[30819]: Not enough arguments passed by the kernel (0, expected 7).
      [Fri May 31 18:55:09 2024] 2[31099]: segfault at 8 ip 00007fc03af404e8 sp 00007ffc7cd120c0 error 4 in ld-2.31.so[7fc03af33000+2a000]
      [Fri May 31 18:55:09 2024] Code: 85 4c 12 00 00 49 83 ba f0 00 00 00 00 48 c7 85 10 ff ff ff 00 00 00 00 0f 85 f2 10 00 00 49 8b 42 68 49 83 ba f8 00 00 00 00 <48> 8b 40 08 48 89 85 30 ff ff ff 0f 84 77 03 00 00 45 85 ed 0f 85
      [Fri May 31 18:55:09 2024] LustreError: 30738:0:(file.c:242:ll_close_inode_openhandle()) lustre-clilmv-ffff9eb2697ae800: inode [0x2000301a1:0x2d:0x0] mdc close failed: rc = -13
      [Fri May 31 18:55:09 2024] systemd-coredump[31170]: Not enough arguments passed by the kernel (0, expected 7).
      [Fri May 31 18:55:09 2024] 7[31390]: segfault at 8 ip 00007fe40ee4b4e8 sp 00007ffe1187a8c0 error 4 in ld-2.31.so[7fe40ee3e000+2a000]
      [Fri May 31 18:55:09 2024] Code: 85 4c 12 00 00 49 83 ba f0 00 00 00 00 48 c7 85 10 ff ff ff 00 00 00 00 0f 85 f2 10 00 00 49 8b 42 68 49 83 ba f8 00 00 00 00 <48> 8b 40 08 48 89 85 30 ff ff ff 0f 84 77 03 00 00 45 85 ed 0f 85
      [Fri May 31 18:55:09 2024] systemd-coredump[31442]: Not enough arguments passed by the kernel (0, expected 7).
      [Fri May 31 18:55:16 2024] 6[6950]: segfault at 8 ip 00007f872bca14e8 sp 00007ffe59640b50 error 4 in ld-2.31.so[7f872bc94000+2a000]
      [Fri May 31 18:55:16 2024] Code: 85 4c 12 00 00 49 83 ba f0 00 00 00 00 48 c7 85 10 ff ff ff 00 00 00 00 0f 85 f2 10 00 00 49 8b 42 68 49 83 ba f8 00 00 00 00 <48> 8b 40 08 48 89 85 30 ff ff ff 0f 84 77 03 00 00 45 85 ed 0f 85
      [Fri May 31 18:55:16 2024] systemd-coredump[7076]: Not enough arguments passed by the kernel (0, expected 7).
      [Fri May 31 18:55:16 2024] LustreError: 6950:0:(file.c:242:ll_close_inode_openhandle()) lustre-clilmv-ffff9eb2697ae800: inode [0x2000301a4:0x2a6:0x0] mdc close failed: rc = -13
      [Fri May 31 18:55:16 2024] LustreError: 6950:0:(file.c:242:ll_close_inode_openhandle()) Skipped 1 previous similar message
      [Fri May 31 18:55:16 2024] 6[7325]: segfault at 8 ip 00007f35830d84e8 sp 00007ffddfd813e0 error 4 in ld-2.31.so[7f35830cb000+2a000]
      [Fri May 31 18:55:16 2024] Code: 85 4c 12 00 00 49 83 ba f0 00 00 00 00 48 c7 85 10 ff ff ff 00 00 00 00 0f 85 f2 10 00 00 49 8b 42 68 49 83 ba f8 00 00 00 00 <48> 8b 40 08 48 89 85 30 ff ff ff 0f 84 77 03 00 00 45 85 ed 0f 85
      [Fri May 31 18:55:16 2024] systemd-coredump[7404]: Not enough arguments passed by the kernel (0, expected 7).
      

      VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
      racer test_1 - Timeout occurred after 711 minutes, last suite running was racer

      Attachments

        Issue Links

          Activity

            People

              green Oleg Drokin
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated: