Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-12230

parallel-scale-nfsv3 test_connectathon times out with segfault

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • None
    • Lustre 2.13.0, Lustre 2.12.1, Lustre 2.14.0, Lustre 2.12.4, Lustre 2.12.5, Lustre 2.12.6, Lustre 2.12.8, Lustre 2.15.0
    • Ubuntu clients
    • 3
    • 9223372036854775807

    Description

      parallel-scale-nfsv3 test_connectathon hangs for Ubuntu 18.04 clients only and hangs 100% of the time for Ubuntu 18.04 clients.

      Test #6 - Try to lock the MAXEOF byte.
      	Parent: 6.0  - F_TLOCK [7fffffffffffffff,               1] PASSED.
      	Child:  6.1  - F_TEST  [7ffffffffffffffe,               1] PASSED.
      	Child:  6.2  - F_TEST  [7ffffffffffffffe,               2] PASSED.
      	Child:  6.3  - F_TEST  [7ffffffffffffffe,          ENDING] PASSED.
      	Child:  6.4  - F_TEST  [7fffffffffffffff,               1] PASSED.
      	Child:  6.5  - F_TEST  [7fffffffffffffff,               2] PASSED.
      	Child:  6.6  - F_TEST  [7fffffffffffffff,          ENDING] PASSED.
      	Child:  6.7  - F_TEST  [8000000000000000,          ENDING] PASSED.
      	Child:  6.8  - F_TEST  [8000000000000000,               1] PASSED.
      	Child:  6.9  - F_TEST  [8000000000000000,7fffffffffffffff] PASSED.
      	Child:  6.10 - F_TEST  [8000000000000000,8000000000000000] PASSED.
      	Parent: 6.11 - F_ULOCK [7fffffffffffffff,               1] PASSED.
      
      Test #7 - Test parent/child mutual exclusion.
      	Parent: 7.0  - F_TLOCK [             ffc,               9] PASSED.
      runtests: line 48:  1175 Segmentation fault      (core dumped) $i $TESTARGS $NFSTESTDIR
      lock tests failed
      

      Looking at the logs for a recent failure, with logs at https://testing.whamcloud.com/test_sets/4ba5183a-6661-11e9-aeec-52540065bddc , the console log for client 2 (vm9) shows the seg fault

      [ 2567.134886] Lustre: DEBUG MARKER: ./runtests -N 10 -l -f /mnt/lustre/d0.parallel-scale-nfs/d0.connectathon
      [ 2570.861240] tlocklfs[658]: segfault at 3cfa3a80 ip 00007f68b6289800 sp 00007ffea83150c8 error 4 in libc-2.27.so[7f68b61d8000+1e7000]
      [ 2571.788153] Lustre: DEBUG MARKER: /usr/sbin/lctl mark  parallel-scale-nfsv3 test_connectathon: @@@@@@ FAIL: connectathon failed: 1 
      [ 2572.001779] Lustre: DEBUG MARKER: parallel-scale-nfsv3 test_connectathon: @@@@@@ FAIL: connectathon failed: 1
      

      The console logs for all other nodes do not contain errors.

      There are several examples of this failure, but here are just a couple of additional links to logs
      https://testing.whamcloud.com/test_sets/0a36df6c-411a-11e9-8e92-52540065bddc
      https://testing.whamcloud.com/test_sets/7891c378-4e62-11e9-b98a-52540065bddc
      https://testing.whamcloud.com/test_sets/2d506d0a-560b-11e9-b98a-52540065bddc

      Attachments

        Issue Links

          Activity

            People

              Deiter Alex Deiter
              jamesanunez James Nunez (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: