[LU-3098] failure to auto trigger paused LFSCK causes mount to fail on slave MDTs Created: 03/Apr/13  Updated: 22/Apr/13  Resolved: 22/Apr/13

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.0
Fix Version/s: Lustre 2.4.0

Type: Bug Priority: Minor
Reporter: John Hammond Assignee: WC Triage
Resolution: Fixed Votes: 0
Labels: lfsck, mdt

Severity: 3
Rank (Obsolete): 7521

 Description   

After 7e915f5d, in mdt_prepare() if the OBD_IOC_START_LFSCK ioctl() fails and mdt->mdt_seq_site.ss_node_id != 0 then the MDT will fail to mount.

static int mdt_prepare(const struct lu_env *env,
                struct lu_device *pdev,
                struct lu_device *cdev)
{
	...

	rc = mdt->mdt_child->md_ops->mdo_iocontrol(env, mdt->mdt_child,
                                                   OBD_IOC_START_LFSCK,
                                                   0, NULL);
        if (rc != 0)
                CWARN("Fail to auto trigger paused LFSCK.\n");

	if (mdt->mdt_seq_site.ss_node_id == 0) {
                rc = mdt->mdt_child->md_ops->mdo_root_get(env, mdt->mdt_child, &mdt->mdt_md_root_fid);
                if (rc)
                        RETURN(rc);
        }

        ...
}

If this is really the intended behavior (I assume not) then the CWARN() should be upgraded to CERROR().



 Comments   
Comment by John Hammond [ 03/Apr/13 ]

Please see http://review.whamcloud.com/5931.

Comment by Jodi Levi (Inactive) [ 22/Apr/13 ]

Patch landed. If more work is needed let me know and I'll reopen this ticket.

Generated at Sat Feb 10 01:30:57 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.