[LU-4169] lfsck: FAIL: e2fsck returned 4, should be <= 1 Created: 28/Oct/13 Updated: 04/Dec/14 Resolved: 04/Dec/14 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.4.1, Lustre 2.5.0, Lustre 2.6.0, Lustre 2.4.2, Lustre 2.5.2, Lustre 2.5.3 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Maloo | Assignee: | James Nunez (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
server: 2.4.1 RHEL6 ldiskfs |
||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 11298 | ||||||||
| Description |
|
This issue was created by maloo for sarah <sarah@whamcloud.com> This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/74730ca2-3eb9-11e3-a21b-52540035b04c. 22:57:37:MDclient-32vm3: [QUOTA WARNING] Usage inconsistent for ID 0:actual (1929216, 465) != expected (1835008, 457) 22:57:37:client-32vm3: [QUOTA WARNING] Usage inconsistent for ID 0:actual (1929216, 465) != expected (1835008, 457) 22:57:37:T: dirfid [0x2:0x0:0x0] child [0x47:0xd5e863cf:0x0] file oi.16.58 22:57:37:MDT: inode 72, file oi.16.59, type 1 22:57:38:MDT: dirfid [0x2:0x0:0x0] child [0x48:0xd5e863d0:0x0] file oi.16.59 22:57:38:MDT: inode 73, file oi.16.60, type 1 22:57:38:MDT: dirfid [0x2:0x0:0x0] child [0x49:0xd5e863d1:0x0] file oi.16.60 22:57:38:MDT: inode 74, file oi.16.61, type 1 22:57:39:MDT: dirfid [0x2:0x0:0x0] child [0x4a:0xd5e863d2:0x0] file oi.16.61 22:57:39:MDT: inode 75, file oi.16.62, type 1 22:57:39:MDT: dirfid [0x2:0x0:0x0] child [0x4b:0xd5e863d3:0x0] file oi.16.62 22:57:40:MDT: inode 76, file oi.16.63, type 1 22:57:40:MDT: dirfid [0x2:0x0:0x0] child [0x4c:0xd5e863d4:0x0] file oi.16.63 22:57:40:MDT: inode 524326, file NIDTBL_VERSIONS, type 2 22:57:40:MDT: inode 83, file last_rcvd, type 1 22:57:41:MDT: inode 84, file fld, type 1 22:57:41:MDT: inode 85, file seq_ctl, type 1 22:57:41:MDT: inode 86, file seq_srv, type 1 22:57:41:MDT: inode 32769, file quota_master, type 2 22:57:41:MDT: inode 32772, file quota_slave, type 2 22:57:42:MDT: inode 32773, file ROOT, type 2 22:57:42:MDT: inode 32776, file PENDING, type 2 22:57:42:MDT: inode 96, file changelog_catalog, type 1 22:57:42:MDT: inode 97, file changelog_users, type 1 22:57:44:MDT: inode 98, file lfsck_bookmark, type 1 22:57:46:MDT: inode 99, file lfsck_namespace, type 1 22:57:49:MDT: inode 104, file lov_objid, type 1 22:57:51:MDT: inode 105, file lov_objseq, type 1 22:57:53:MDT: inode 106, file CATALOGS, type 1 22:57:53:MDS: max_files = 466 22:57:53:MDS: num_osts = 7 22:57:53:MDS: 'lustre-MDT0000_UUID' mdt idx 0: compat 0xc rocomp 0x1 incomp 0x21c 22:57:54:Update quota info for quota type 0? no 22:57:55: 22:57:55:Update quota info for quota type 1? no 22:57:57: 22:57:57: 22:57:57:lustre-MDT0000: ********** WARNING: Filesystem still has errors ********** 22:57:57: 22:57:58: 22:57:58: 466 inodes used (0.04%, out of 1048576) 22:57:58: 12 non-contiguous files (2.6%) 22:57:59: 0 non-contiguous directories (0.0%) 22:57:59: # of inodes with ind/dind/tind blocks: 0/0/0 22:57:59: 154106 blocks used (29.39%, out of 524288) 22:57:59: 0 bad blocks 22:58:00: 1 large file 22:58:00: 22:58:00: 282 regular files 22:58:01: 183 directories 22:58:01: 0 character device files 22:58:01: 0 block device files 22:58:01: 0 fifos 22:58:01: 10 links 22:58:02: 0 symbolic links (0 fast symbolic links) 22:58:02: 0 sockets 22:58:03:------------ 22:58:03: 475 files 22:58:03:Memory used: 2672k/21180k (1018k/1655k), time: 0.82/ 0.07/ 0.03 22:58:04:I/O read: 4MB, write: 0MB, rate: 4.86MB/s 22:58:05: lfsck : @@@@@@ FAIL: e2fsck -d -v -t -t -f -n --mdsdb /home/autotest/.autotest/shared_dir/2013-10-25/141244-69842626388280/mdsdb /dev/mapper/lvm--MDS-P1 returned 4, should be <= 1 22:58:06: Trace dump: |
| Comments |
| Comment by Jian Yu [ 18/Nov/13 ] |
|
One more instance: |
| Comment by Jian Yu [ 18/Nov/13 ] |
|
Lustre master build: http://build.whamcloud.com/job/lustre-master/1764/ lfsck test also failed with the same issue: |
| Comment by Jian Yu [ 20/Nov/13 ] |
|
Lustre b2_4 build: http://build.whamcloud.com/job/lustre-b2_4/54/ lfsck test also failed with the same issue in manual test run: However, it passed in autotest run: |
| Comment by Jian Yu [ 23/Dec/13 ] |
|
Lustre Tag: 2.4.2 RC2 The same failure occurred: |
| Comment by Jian Yu [ 05/Jan/14 ] |
|
Lustre Build: http://build.whamcloud.com/job/lustre-b2_5/5/ The same failure occurred: |
| Comment by Jian Yu [ 10/Feb/14 ] |
|
More instance on Lustre b2_5 branch: |
| Comment by Jian Yu [ 16/Jun/14 ] |
|
Lustre build: http://build.whamcloud.com/job/lustre-b2_5/63/ (2.5.2 RC1) The same failure occurred: https://maloo.whamcloud.com/test_logs/b31a15b8-f2fa-11e3-a3d9-52540035b04c/show_text |
| Comment by Jian Yu [ 31/Aug/14 ] |
|
Lustre Build: https://build.hpdd.intel.com/job/lustre-b2_5/86/ (2.5.3 RC1) The same failure occurred: |
| Comment by Jian Yu [ 22/Oct/14 ] |
|
Do we have a method to resolve this issue? If it was fixed, then we would likely have full green test session. E.g., https://testing.hpdd.intel.com/test_sessions/9bd48638-4fa5-11e4-9a74-5254006e85c2 |
| Comment by Peter Jones [ 22/Oct/14 ] |
|
James Can you please look into why this is failing? Thanks Peter |
| Comment by Andreas Dilger [ 24/Oct/14 ] |
|
It would probably be best to just turn off the old lfsck test and focus on the new LFSCK. |
| Comment by Jian Yu [ 24/Oct/14 ] |
|
I just created TEI-2860 for this. |
| Comment by James Nunez (Inactive) [ 04/Dec/14 ] |
|
The patch for TEI-2860 "turn off lfsck test in full test group" has landed. |