[LU-4169] lfsck: FAIL: e2fsck returned 4, should be <= 1 Created: 28/Oct/13  Updated: 04/Dec/14  Resolved: 04/Dec/14

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.1, Lustre 2.5.0, Lustre 2.6.0, Lustre 2.4.2, Lustre 2.5.2, Lustre 2.5.3
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: James Nunez (Inactive)
Resolution: Fixed Votes: 0
Labels: None
Environment:

server: 2.4.1 RHEL6 ldiskfs
client: lustre-b2_5 build #2 RHEL6 ldiskfs


Issue Links:
Duplicate
is duplicated by LU-5190 lfsck: FAIL: e2fsck returned 4, shoul... Resolved
Severity: 3
Rank (Obsolete): 11298

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/74730ca2-3eb9-11e3-a21b-52540035b04c.

22:57:37:MDclient-32vm3: [QUOTA WARNING] Usage inconsistent for ID 0:actual (1929216, 465) != expected (1835008, 457)
22:57:37:client-32vm3: [QUOTA WARNING] Usage inconsistent for ID 0:actual (1929216, 465) != expected (1835008, 457)
22:57:37:T: dirfid [0x2:0x0:0x0] child [0x47:0xd5e863cf:0x0] file oi.16.58
22:57:37:MDT: inode 72, file oi.16.59, type 1
22:57:38:MDT: dirfid [0x2:0x0:0x0] child [0x48:0xd5e863d0:0x0] file oi.16.59
22:57:38:MDT: inode 73, file oi.16.60, type 1
22:57:38:MDT: dirfid [0x2:0x0:0x0] child [0x49:0xd5e863d1:0x0] file oi.16.60
22:57:38:MDT: inode 74, file oi.16.61, type 1
22:57:39:MDT: dirfid [0x2:0x0:0x0] child [0x4a:0xd5e863d2:0x0] file oi.16.61
22:57:39:MDT: inode 75, file oi.16.62, type 1
22:57:39:MDT: dirfid [0x2:0x0:0x0] child [0x4b:0xd5e863d3:0x0] file oi.16.62
22:57:40:MDT: inode 76, file oi.16.63, type 1
22:57:40:MDT: dirfid [0x2:0x0:0x0] child [0x4c:0xd5e863d4:0x0] file oi.16.63
22:57:40:MDT: inode 524326, file NIDTBL_VERSIONS, type 2
22:57:40:MDT: inode 83, file last_rcvd, type 1
22:57:41:MDT: inode 84, file fld, type 1
22:57:41:MDT: inode 85, file seq_ctl, type 1
22:57:41:MDT: inode 86, file seq_srv, type 1
22:57:41:MDT: inode 32769, file quota_master, type 2
22:57:41:MDT: inode 32772, file quota_slave, type 2
22:57:42:MDT: inode 32773, file ROOT, type 2
22:57:42:MDT: inode 32776, file PENDING, type 2
22:57:42:MDT: inode 96, file changelog_catalog, type 1
22:57:42:MDT: inode 97, file changelog_users, type 1
22:57:44:MDT: inode 98, file lfsck_bookmark, type 1
22:57:46:MDT: inode 99, file lfsck_namespace, type 1
22:57:49:MDT: inode 104, file lov_objid, type 1
22:57:51:MDT: inode 105, file lov_objseq, type 1
22:57:53:MDT: inode 106, file CATALOGS, type 1
22:57:53:MDS: max_files = 466
22:57:53:MDS: num_osts = 7
22:57:53:MDS: 'lustre-MDT0000_UUID' mdt idx 0: compat 0xc rocomp 0x1 incomp 0x21c
22:57:54:Update quota info for quota type 0? no
22:57:55:
22:57:55:Update quota info for quota type 1? no
22:57:57:
22:57:57:
22:57:57:lustre-MDT0000: ********** WARNING: Filesystem still has errors **********
22:57:57:
22:57:58:
22:57:58:         466 inodes used (0.04%, out of 1048576)
22:57:58:          12 non-contiguous files (2.6%)
22:57:59:           0 non-contiguous directories (0.0%)
22:57:59:             # of inodes with ind/dind/tind blocks: 0/0/0
22:57:59:      154106 blocks used (29.39%, out of 524288)
22:57:59:           0 bad blocks
22:58:00:           1 large file
22:58:00:
22:58:00:         282 regular files
22:58:01:         183 directories
22:58:01:           0 character device files
22:58:01:           0 block device files
22:58:01:           0 fifos
22:58:01:          10 links
22:58:02:           0 symbolic links (0 fast symbolic links)
22:58:02:           0 sockets
22:58:03:------------
22:58:03:         475 files
22:58:03:Memory used: 2672k/21180k (1018k/1655k), time:  0.82/ 0.07/ 0.03
22:58:04:I/O read: 4MB, write: 0MB, rate: 4.86MB/s
22:58:05: lfsck : @@@@@@ FAIL: e2fsck -d -v -t -t -f -n --mdsdb /home/autotest/.autotest/shared_dir/2013-10-25/141244-69842626388280/mdsdb /dev/mapper/lvm--MDS-P1 returned 4, should be <= 1 
22:58:06:  Trace dump:


 Comments   
Comment by Jian Yu [ 18/Nov/13 ]

One more instance:
https://maloo.whamcloud.com/test_sets/ca44b5ac-4c17-11e3-bea4-52540035b04c

Comment by Jian Yu [ 18/Nov/13 ]

Lustre master build: http://build.whamcloud.com/job/lustre-master/1764/
Distro/Arch: RHEL6.4/x86_64

lfsck test also failed with the same issue:
https://maloo.whamcloud.com/test_sets/29cde8e0-5064-11e3-9f14-52540035b04c

Comment by Jian Yu [ 20/Nov/13 ]

Lustre b2_4 build: http://build.whamcloud.com/job/lustre-b2_4/54/
Distro/Arch: RHEL6.4/x86_64

lfsck test also failed with the same issue in manual test run:
https://maloo.whamcloud.com/test_sets/4dedb6a4-51e7-11e3-9472-52540035b04c

However, it passed in autotest run:
https://maloo.whamcloud.com/test_sets/4e293922-5192-11e3-ad30-52540035b04c

Comment by Jian Yu [ 23/Dec/13 ]

Lustre Tag: 2.4.2 RC2
Lustre Client: CentOS 6.5/x86_64 (kernel version: 2.6.32-431.1.2.0.1.el6.x86_64)
Lustre Server: CentOS 6.4/x86_64 (kernel version: 2.6.32-358.23.2.el6_lustre.x86_64)

The same failure occurred:
https://maloo.whamcloud.com/test_sets/c7d10f5c-6b82-11e3-91a4-52540035b04c

Comment by Jian Yu [ 05/Jan/14 ]

Lustre Build: http://build.whamcloud.com/job/lustre-b2_5/5/
Lustre Client: CentOS 6.5/x86_64
Lustre Server: CentOS 6.4/x86_64

The same failure occurred:
https://maloo.whamcloud.com/test_sets/d22a5f7c-75d1-11e3-a081-52540035b04c

Comment by Jian Yu [ 10/Feb/14 ]

More instance on Lustre b2_5 branch:
https://maloo.whamcloud.com/test_sets/f29451fe-9053-11e3-8d88-52540035b04c

Comment by Jian Yu [ 16/Jun/14 ]

Lustre build: http://build.whamcloud.com/job/lustre-b2_5/63/ (2.5.2 RC1)

The same failure occurred: https://maloo.whamcloud.com/test_logs/b31a15b8-f2fa-11e3-a3d9-52540035b04c/show_text

Comment by Jian Yu [ 31/Aug/14 ]

Lustre Build: https://build.hpdd.intel.com/job/lustre-b2_5/86/ (2.5.3 RC1)

The same failure occurred:
https://testing.hpdd.intel.com/test_sets/c3284f52-30a7-11e4-9f57-5254006e85c2
https://testing.hpdd.intel.com/test_sets/1b5e504e-3086-11e4-a3d9-5254006e85c2
https://testing.hpdd.intel.com/test_sets/baaa142c-307b-11e4-9e60-5254006e85c2
https://testing.hpdd.intel.com/test_sets/3f915928-30bf-11e4-9e60-5254006e85c2

Comment by Jian Yu [ 22/Oct/14 ]

Do we have a method to resolve this issue? If it was fixed, then we would likely have full green test session.

E.g., https://testing.hpdd.intel.com/test_sessions/9bd48638-4fa5-11e4-9a74-5254006e85c2

Comment by Peter Jones [ 22/Oct/14 ]

James

Can you please look into why this is failing?

Thanks

Peter

Comment by Andreas Dilger [ 24/Oct/14 ]

It would probably be best to just turn off the old lfsck test and focus on the new LFSCK.

Comment by Jian Yu [ 24/Oct/14 ]

I just created TEI-2860 for this.

Comment by James Nunez (Inactive) [ 04/Dec/14 ]

The patch for TEI-2860 "turn off lfsck test in full test group" has landed.

Generated at Sat Feb 10 01:40:17 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.