[LU-350] port bug24050 to master("lustre_start" caused client nodes failed to mount.) Created: 20/May/11 Updated: 18/Aug/11 Resolved: 18/Aug/11 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.1.0 |
| Fix Version/s: | Lustre 2.1.0 |
| Type: | Bug | Priority: | Minor |
| Reporter: | Hongchao Zhang | Assignee: | Hongchao Zhang |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Severity: | 3 |
| Bugzilla ID: | 24,050 |
| Rank (Obsolete): | 4927 |
| Description |
|
for newly formatted Lustre system, the MDT must be started before OST, or it will cause clients can't mount the Lustre |
| Comments |
| Comment by Brian Murrell (Inactive) [ 20/May/11 ] |
|
So, is the bug here the wrong order being used in lustre_start or is the actual bug the requirement that the MDT be mounted first for a newly formatted filesystem? To be sure, one has always been able to mount the servers in any order (for non-newly formatted filesystems at least) but I don't recall if that flexibility also applies/applied to newly formatted filesystems. We probably need somebody with a clearer picture (and/or memory) than I to chime in with an opinion on that in order to determine which is the real bug here. |
| Comment by Peter Jones [ 07/Jun/11 ] |
| Comment by Hongchao Zhang [ 14/Jun/11 ] |
|
paste comments in Gerrit for future reference Oleg Drokin: Huh? I think the entire point of starting MGS first then OSTs was so that MDS will start and be able to connect to all OSTs. Why would MDT erase any config on MGS at all? That sounds like a different problem that needs to be solved differently I think. Yu Jian: I also doubted this in bug 24050 comment #30, #48, #51, #57 and got a partial answer from Johann in comment #55. I think the fix of lustre_start utility was just a workaround, not fixing the Lustre issue. Lustre really supported the starting order of MGS->OST->MDT before. So, Hongchao, could you please investigate this? We need figure out the real issue. Oleg Drokin: The comment in bugzilla indicates that this order only needs to happen on first mount which I guess I can believe even though it is still somewhat strange that first MDT connect would wipe config data. Perhaps we just need to incorporate a real mount in MGS->MDT->OST) order in our formatting scripts instead? |
| Comment by Hongchao Zhang [ 21/Jun/11 ] |
|
this problem is introduced by the patch in bug22464, which add "writeconf" option during formatting the disk, but it only cause problem in b1_8, but I try to start Lustre by MGS-OST-MDS order at master, it still fails for there are bugs in "mgs_steal_llog_for_mdt_from_client", the problem is diff --git a/lustre/mgs/mgs_llog.c b/lustre/mgs/mgs_llog.c + sscanf(lustre_cfg_buf(lcfg, 2), "%d", &tmti->mti_stripe_index); |
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|
| Comment by Build Master (Inactive) [ 08/Aug/11 ] |
|
Integrated in Oleg Drokin : 3227436e11b4bc77ffd261e8f13adf905fae2353
|