Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17154

parallel-scale-nfsv4: hangs on umount after racer_on_nfs

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.16.0
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for jianyu <yujian@whamcloud.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/568e0c21-9347-476a-beac-081e9b2ee112

      Test session details:
      clients: https://build.whamcloud.com/job/lustre-master/4468 - 5.14.0-284.25.1.el9_2.x86_64
      servers: https://build.whamcloud.com/job/lustre-master/4468 - 5.14.0-284.25.1_lustre.el9.x86_64

      <<Please provide additional information about the failure here>>

      parallel-scale-nfsv4 hangs on:

      Stopping client trevis-27vm4 /mnt/lustre (opts:-f)
      CMD: trevis-27vm4 lsof -t /mnt/lustre
      pdsh@trevis-27vm1: trevis-27vm4: ssh exited with exit code 1
      CMD: trevis-27vm4 umount -f /mnt/lustre 2>&1
      

      Console long on trevis-27vm4:

      [70712.060132] Lustre: DEBUG MARKER: umount -f /mnt/lustre 2>&1
      [70712.213680] Lustre: setting import lustre-MDT0000_UUID INACTIVE by administrator request
      [70712.215066] LustreError: 2067684:0:(file.c:245:ll_close_inode_openhandle()) lustre-clilmv-ffffa03d96d7f000: inode [0x200000bd3:0x2c31:0x0] mdc close failed: rc = -108
      [70712.243116] Lustre: 1411383:0:(llite_lib.c:3965:ll_dirty_page_discard_warn()) lustre: dirty page discard: 10.240.38.143@tcp:/lustre/fid: [0x28000040a:0x3699:0x0]/ may get corrupted (rc -108)
      [70712.243167] Lustre: 1411382:0:(llite_lib.c:3965:ll_dirty_page_discard_warn()) lustre: dirty page discard: 10.240.38.143@tcp:/lustre/fid: [0x2c000040a:0x3318:0x0]/ may get corrupted (rc -108)
      <~snip~>
      [70742.217783] Lustre: lustre-MDT0000: haven't heard from client 0e545e12-9ad6-4857-a78b-e65f011477b4 (at 0@lo) in 31 seconds. I think it's dead, and I am evicting it. exp 00000000320f809c, cur 1695838270 expire 1695838240 last 1695838239
      [70745.262062] Lustre: lustre-MDT0002: haven't heard from client 0e545e12-9ad6-4857-a78b-e65f011477b4 (at 0@lo) in 34 seconds. I think it's dead, and I am evicting it. exp 00000000887f97a0, cur 1695838273 expire 1695838243 last 1695838239
      

      Attachments

        Issue Links

          Activity

            People

              Deiter Alex Deiter
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: