Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • Lustre 1.8.8
    • None
    • 3
    • 6342

    Description

      NOAA hit a problem that looks a lot like LU-441. The clients were unable to mount the filesystem for a while after rebooting.

      Here's the client's syslog:
      Aug 30 15:00:09 s1 kernel: Lustre:
      2524:0:(client.c:1487:ptlrpc_expire_one_request()) @@@ Request
      x1410486709891636 sent from MGC10.179.16.120@o2ib to NID
      10.179.16.121@o2ib 0s ago has failed due to network error (35s prior to
      deadline).
      Aug 30 15:00:09 s1 kernel: req@ffff8805fc06e400 x1410486709891636/t0
      o250->MGS@MGC10.179.16.120@o2ib_1:26/25 lens 368/584 e 0 to 1 dl
      1346338844 ref 1 fl Rpc:N/0/0 rc 0/0
      Aug 30 15:00:09 s1 kernel: LustreError:
      112398:0:(client.c:858:ptlrpc_import_delay_req()) @@@ IMP_INVALID
      req@ffff88041c05e800 x1410486709891637/t0
      o501->MGS@MGC10.179.16.120@o2ib_1:26/25 lens 264/432 e 0 to 1 dl 0 ref 1
      fl Rpc:/0/0 rc 0/0
      Aug 30 15:00:09 s1 kernel: LustreError: 15c-8: MGC10.179.16.120@o2ib:
      The configuration from log 'lfs2-client' failed (-108). This may be the
      result of communication errors between this node and the MGS, a bad
      configuration, or other errors. See the syslog for more information.
      Aug 30 15:00:09 s1 kernel: LustreError:
      112398:0:(llite_lib.c:1095:ll_fill_super()) Unable to process log: -108
      Aug 30 15:00:09 s1 kernel: Lustre: client lfs2-client(ffff88041c2ea400)
      umount complete
      Aug 30 15:00:09 s1 kernel: LustreError:
      112398:0:(obd_mount.c:2065:lustre_fill_super()) Unable to mount (-108)

      MDS logs to come.

      Attachments

        Activity

          People

            keith Keith Mannthey (Inactive)
            kitwestneat Kit Westneat (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: