Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-15239

replay-single test_35 : Timeout occurred after 248 mins

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      https://testing.whamcloud.com/test_sets/68256292-28e9-4664-a117-da82cba40996
      == replay-single test 35: test recovery from llog for unlink op ========================================================== 22:10:18 (1637014218)
      CMD: onyx-124vm17 lctl set_param fail_loc=0x80000119
      fail_loc=0x80000119
      ...
      CMD: onyx-124vm17 zfs get -H -o value lustre:svname lustre-mdt1/mdt1 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
      pdsh@onyx-64vm10: onyx-124vm17: ssh exited with exit code 1
      CMD: onyx-124vm17 zfs get -H -o value lustre:svname lustre-mdt1/mdt1 2>/dev/null
      Started lustre-MDT0000
      rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error
      MDS:

      [Mon Nov 15 22:10:17 2021] Lustre: DEBUG MARKER: == replay-single test 35: test recovery from llog for unlink op ========================================================== 22:10:18 (1637014218)
      [Mon Nov 15 22:10:39 2021] LustreError: 167-0: lustre-MDT0000-mdc-ffff9c97fbee0800: This client was evicted by lustre-MDT0000; in progress operations using this service will fail.
      [Mon Nov 15 22:10:48 2021] Lustre: lustre-MDT0000-mdc-ffff9c97fbee0800: Connection to lustre-MDT0000 (at 10.240.30.48@tcp) was lost; in progress operations using this service will wait for recovery to complete
      [Mon Nov 15 22:10:48 2021] Lustre: Skipped 14 previous similar messages 

      Attachments

        Activity

          People

            wc-triage WC Triage
            scherementsev Sergey Cheremencev
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: