Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-3005

MDT attempted to access beyond the disk

Details

    • Bug
    • Resolution: Duplicate
    • Blocker
    • None
    • Lustre 2.4.0
    • Lustre 2.3.62 running the servers as well as the clients.
    • 4
    • 7318

    Description

      While running mdtest the file system went into read only mode and the file system reported corruption on the MDS. The error is:

      [56736.791445] attempt to access beyond end of device
      [56736.804198] md5: rw=0, want=10484831811994872656, limit=282775552
      [56736.818141] LDISKFS-fs error (device md5): ldiskfs_xattr_delete_inode: inode 22016621: block 3616446985713053033 read error
      [56736.837277] Aborting journal on device md5-8.
      [56736.860700] LDISKFS-fs (md5): Remounting filesystem read-only
      [56736.881388] LDISKFS-fs error (device md5) in ldiskfs_free_inode: Journal has aborted
      [56736.896935] LustreError: 8976:0:(osd_handler.c:636:osd_trans_commit_cb()) transaction @0xffff880414777b80 commit error: 2
      [56736.915693] LustreError: 8976:0:(osd_handler.c:636:osd_trans_commit_cb()) transaction @0xffff880113757680 commit error: 2
      [56741.072627] LustreError: 9039:0:(llog.c:161:llog_cancel_rec()) lustre-OST0000-osc-MDT0000: fail to write header for llog #0x81#0x1#00000000: rc = -30
      [56741.101191] LustreError: 9039:0:(llog_cat.c:535:llog_cat_cancel_records()) lustre-OST0000-osc-MDT0000: fail to cancel 1 of 1 llog-records: rc = -30
      [56741.129587] LustreError: 9039:0:(osp_sync.c:720:osp_sync_process_committed()) lustre-OST0000-osc-MDT0000: can't cancel record: -30
      [56741.156530] LustreError: 9039:0:(llog.c:161:llog_cancel_rec()) lustre-OST0000-osc-MDT0000: fail to write header for llog #0x81#0x1#00000000: rc = -30
      [56741.157240] LustreError: 9041:0:(llog_cat.c:535:llog_cat_cancel_records()) lustre-OST0001-osc-MDT0000: fail to cancel 1 of 1 llog-records: rc = -30
      [56741.157245] LustreError: 9041:0:(osp_sync.c:720:osp_sync_process_committed()) lustre-OST0001-osc-MDT0000: can't cancel record: -30
      [56741.241180] LustreError: 9039:0:(llog.c:161:llog_cancel_rec()) Skipped 2 previous similar messages

      Attachments

        Issue Links

          Activity

            People

              bfaccini Bruno Faccini (Inactive)
              simmonsja James A Simmons
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: