Details
-
Bug
-
Resolution: Duplicate
-
Blocker
-
None
-
Lustre 2.4.0
-
Lustre 2.3.62 running the servers as well as the clients.
-
4
-
7318
Description
While running mdtest the file system went into read only mode and the file system reported corruption on the MDS. The error is:
[56736.791445] attempt to access beyond end of device
[56736.804198] md5: rw=0, want=10484831811994872656, limit=282775552
[56736.818141] LDISKFS-fs error (device md5): ldiskfs_xattr_delete_inode: inode 22016621: block 3616446985713053033 read error
[56736.837277] Aborting journal on device md5-8.
[56736.860700] LDISKFS-fs (md5): Remounting filesystem read-only
[56736.881388] LDISKFS-fs error (device md5) in ldiskfs_free_inode: Journal has aborted
[56736.896935] LustreError: 8976:0:(osd_handler.c:636:osd_trans_commit_cb()) transaction @0xffff880414777b80 commit error: 2
[56736.915693] LustreError: 8976:0:(osd_handler.c:636:osd_trans_commit_cb()) transaction @0xffff880113757680 commit error: 2
[56741.072627] LustreError: 9039:0:(llog.c:161:llog_cancel_rec()) lustre-OST0000-osc-MDT0000: fail to write header for llog #0x81#0x1#00000000: rc = -30
[56741.101191] LustreError: 9039:0:(llog_cat.c:535:llog_cat_cancel_records()) lustre-OST0000-osc-MDT0000: fail to cancel 1 of 1 llog-records: rc = -30
[56741.129587] LustreError: 9039:0:(osp_sync.c:720:osp_sync_process_committed()) lustre-OST0000-osc-MDT0000: can't cancel record: -30
[56741.156530] LustreError: 9039:0:(llog.c:161:llog_cancel_rec()) lustre-OST0000-osc-MDT0000: fail to write header for llog #0x81#0x1#00000000: rc = -30
[56741.157240] LustreError: 9041:0:(llog_cat.c:535:llog_cat_cancel_records()) lustre-OST0001-osc-MDT0000: fail to cancel 1 of 1 llog-records: rc = -30
[56741.157245] LustreError: 9041:0:(osp_sync.c:720:osp_sync_process_committed()) lustre-OST0001-osc-MDT0000: can't cancel record: -30
[56741.241180] LustreError: 9039:0:(llog.c:161:llog_cancel_rec()) Skipped 2 previous similar messages
Let's focus discussion under the original ticket -
LU-2980