[LU-4154] lfsck fails in DNE mode Created: 28/Oct/13  Updated: 14/Feb/14  Resolved: 20/Jan/14

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.1, Lustre 2.5.0, Lustre 2.6.0, Lustre 2.4.2
Fix Version/s: Lustre 2.6.0, Lustre 2.5.1

Type: Bug Priority: Minor
Reporter: Maloo Assignee: Emoly Liu
Resolution: Fixed Votes: 0
Labels: dne, mn4
Environment:

client and server: lustre-b2_5 build #2 RHEL6 ldiskfs


Issue Links:
Duplicate
is duplicated by LU-4255 Failure on test suite lfsck: CMD not ... Closed
Severity: 3
Rank (Obsolete): 11277

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/babe2fbe-3f2e-11e3-b5b4-52540035b04c.

suite log shows:

"CMD: wtm-75,wtm-76,wtm-87,wtm-88,wtm-89 PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/utils/gss:/usr/lib64/lustre/utils:/usr/lib64/openmpi/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/root/bin::/sbin:/bin:/usr/sbin: NAME=sarah-quota-dne-2mds sh rpc.sh check_logdir /home/sarah/test_logs 
 lfsck : @@@@@@ FAIL: CMD is not supported "


 Comments   
Comment by Andreas Dilger [ 29/Oct/13 ]

The old lfsck code is not going to be updated to handle DNE filesystems. Instead, generate_db() should be updated so that if there are multiple MDTs then e2fsck should still be run, but it should return an error now that http://review.whamcloud.com/7532 has landed.

Comment by Emoly Liu [ 04/Nov/13 ]

Does that mean when running old lfsck on a DNE filesystem, we only generate mds db for master MDS, and return error when generating ost db?

Comment by Andreas Dilger [ 04/Nov/13 ]

Ideally there would also be an error when generating the mdsdb, but yes your statement is correct. The goal is that the old e2fsck/lfsck tool will not be allowed to run on a DNE filesystem, so that it does not corrupt it or delete all of the files on the non-zero MDTs.

Comment by Emoly Liu [ 07/Nov/13 ]

patch tracking at: http://review.whamcloud.com/8206

Comment by Jian Yu [ 17/Dec/13 ]

Lustre Build: http://build.whamcloud.com/job/lustre-b2_4/67/
MDSCOUNT=4

The same failure occurred:
https://maloo.whamcloud.com/test_sets/3245ab52-66a8-11e3-ab63-52540035b04c

Comment by Jian Yu [ 19/Dec/13 ]

Lustre Build: http://build.whamcloud.com/job/lustre-b2_4/69/ (2.4.2 RC1)
MDSCOUNT=4

The same failure occurred:
https://maloo.whamcloud.com/test_sets/5ce9b330-6874-11e3-a9a3-52540035b04c

Comment by Jian Yu [ 04/Jan/14 ]

Lustre Build: http://build.whamcloud.com/job/lustre-b2_5/5/
Distro/Arch: RHEL6.4/x86_64
MDSCOUNT=2

The same failure occurred:
https://maloo.whamcloud.com/test_sets/15acdebe-7497-11e3-8b21-52540035b04c

Comment by Emoly Liu [ 20/Jan/14 ]

Patch landed for 2.6

Comment by Jian Yu [ 07/Feb/14 ]

Here is the back-ported patch for Lustre b2_5 branch: http://review.whamcloud.com/9176

Generated at Sat Feb 10 01:40:09 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.