Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1359

Test failure on test suite parallel-scale-nfsv3, subtest test_iorfpp

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • Lustre 2.1.3
    • None
    • 3
    • 6410

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/b019eb0a-929d-11e1-9e8b-525400d2bfa6.

      The sub-test test_iorfpp failed with the following error:

      test failed to respond and timed out

      Got OOM on MDS:

      22:32:05:Out of memory: Killed process 2096, UID 51, (sendmail).
      22:32:09:Lustre: 3246:0:(client.c:1778:ptlrpc_expire_one_request()) @@@ Request x1400607847191960 sent from lustre-OST0005-osc-ffff810046783400 to NID 172.29.3.12@tcp has timed out for slow reply: [sent 1335753119] [real_sent 1335753119] [current 1335753127] [deadline 8s] [delay 0s] req@ffff8100478a3800 x1400607847191960/t0(0) o-1->lustre-OST0005_UUID@172.29.3.12@tcp:6/4 lens 488/424 e 0 to 1 dl 1335753127 ref 2 fl Rpc:X/ffffffff/ffffffff rc 0/-1
      22:32:09:Lustre: 3246:0:(client.c:1778:ptlrpc_expire_one_request()) Skipped 4 previous similar messages
      22:32:09:Lustre: lustre-OST0005-osc-ffff810046783400: Connection to service lustre-OST0005 via nid 172.29.3.12@tcp was lost; in progress operations using this service will wait for recovery to complete.
      22:32:09:Lustre: Skipped 4 previous similar messages
      22:32:09:crond invoked oom-killer: gfp_mask=0x200d2, order=0, oomkilladj=0
      22:32:09:
      22:32:09:Call Trace:
      22:32:09: [<ffffffff800c9fc2>] out_of_memory+0x8e/0x2f3
      22:32:09: [<ffffffff8000f67d>] __alloc_pages+0x27f/0x308
      22:32:09: [<ffffffff80032549>] read_swap_cache_async+0x45/0xd8
      22:32:09: [<ffffffff800cfe82>] swapin_readahead+0x60/0xd3
      22:32:10: [<ffffffff800092d9>] __handle_mm_fault+0xb64/0x103b
      22:32:10: [<ffffffff80063002>] thread_return+0x62/0xfe
      22:32:10: [<ffffffff80067202>] do_page_fault+0x499/0x842
      22:32:10: [<ffffffff8005a1aa>] hrtimer_cancel+0xc/0x16
      22:32:10: [<ffffffff80063cf9>] do_nanosleep+0x47/0x70
      22:32:10: [<ffffffff8005a097>] hrtimer_nanosleep+0x58/0x118
      22:32:10: [<ffffffff8005dde9>] error_exit+0x0/0x84

      Attachments

        Issue Links

          Activity

            People

              bogl Bob Glossman (Inactive)
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: