Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-19154

sanity test_56eaa: Connection to lustre-OST was lost

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Medium
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for S Buisson <sbuisson@ddn.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/d9255f74-6b3a-4ffd-895f-4763af42e41c

      test_56eaa failed with the following error:

      Timeout occurred after 143 minutes, last suite running was sanity
      

      Test session details:
      clients: https://build.whamcloud.com/job/lustre-reviews/114716 - 4.18.0-553.53.1.el8_10.x86_64
      servers: https://build.whamcloud.com/job/lustre-reviews/114716 - 4.18.0-553.53.1.el8_lustre.x86_64

      On the MDS:

      [ 3070.445290] Autotest: Test running for 50 minutes (lustre-reviews_review-dne-zfs-part-1_114716.35)
      [ 3088.326551] Lustre: 12272:0:(client.c:2453:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1751400349/real 1751400349]  req@ffff93e4aef76d80 x1836473309951360/t0(0) o13->lustre-OST0001-osc-MDT0002@10.240.25.37@tcp:7/4 lens 224/368 e 0 to 1 dl 1751400365 ref 1 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp-pre-1-2.0' uid:0 gid:0 projid:4294967295
      [ 3088.333126] Lustre: lustre-OST0007-osc-MDT0002: Connection to lustre-OST0007 (at 10.240.25.37@tcp) was lost; in progress operations using this service will wait for recovery to complete
      [ 3088.338286] Lustre: 12272:0:(client.c:2453:ptlrpc_expire_one_request()) Skipped 1 previous similar message
      [ 3088.341065] Lustre: Skipped 12 previous similar messages
      [ 3089.350550] Lustre: 12272:0:(client.c:2453:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1751400350/real 1751400350]  req@ffff93e4affaea40 x1836473309956224/t0(0) o13->lustre-OST0007-osc-MDT0000@10.240.25.37@tcp:7/4 lens 224/368 e 0 to 1 dl 1751400366 ref 1 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp-pre-7-0.0' uid:0 gid:0 projid:4294967295
      [ 3089.356221] Lustre: 12272:0:(client.c:2453:ptlrpc_expire_one_request()) Skipped 9 previous similar messages
      [ 3091.398529] Lustre: 12272:0:(client.c:2453:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1751400352/real 1751400352]  req@ffff93e4affad6c0 x1836473309960064/t0(0) o13->lustre-OST0000-osc-MDT0000@10.240.25.37@tcp:7/4 lens 224/368 e 0 to 1 dl 1751400368 ref 1 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp-pre-0-0.0' uid:0 gid:0 projid:4294967295
      

      VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
      sanity test_56eaa - Timeout occurred after 143 minutes, last suite running was sanity

      Attachments

        Activity

          People

            wc-triage WC Triage
            maloo Maloo
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated: