Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1976

SWL - mds hard crash

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.3.0, Lustre 2.4.0
    • Lustre 2.3.0
    • None
    • 3
    • 4425

    Description

      Console [hyperion-rst6] log at 2012-09-18 18:00:00 PDT.
      2012-09-18 18:04:56 Lustre: lustre-MDT0000: haven't heard from client 3beba6a9-a86c-e3b3-e02d-311fe4e1c5ec (at 192.168.118.135@o2ib1) in 227 seconds. I think it's dead, and I am evicting it. exp ffff880288182400, cur 1348016696 expire 1348016546 last 1348016469
      2012-09-18 18:17:21 BUG: unable to handle kernel paging request at 000000008a5e6591
      2012-09-18 18:17:21 IP: [<ffffffffa0855018>] unlock_res_and_lock+0x18/0x40 [ptlrpc]
      2012-09-18 18:17:21 PGD 0
      2012-09-18 18:17:21 BUG: unable to handle kernel NULL pointer dereference at 0000000000000068
      2012-09-18 18:17:21 IP: [<ffffffff81043b49>] no_context+0x99/0x260

      MDS fails to dump a stack, but does dump vmcore. at the same time one client dumped vmcore. both dumps are on brent ~cliffw/hyperion/

      Attachments

        Activity

          People

            yong.fan nasf (Inactive)
            cliffw Cliff White (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: