[LU-6553] Recurrence of LU-5299: obd_mount_server.c:1690:osd_start()) ASSERTION( obd ) failed Created: 01/May/15 Updated: 13/Oct/21 Resolved: 13/Oct/21 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.7.0, Lustre 2.5.4 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Patrick Farrell (Inactive) | Assignee: | Bruno Faccini (Inactive) |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Combined MGT/MDT, racing multiple mount commands. |
||
| Attachments: |
|
||||||||||||
| Issue Links: |
|
||||||||||||
| Severity: | 3 | ||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||
| Description |
|
The patch for Specifically, the code which enables the combined MGT/MDT to start correctly also disables the race protection for a combined MGT/MDT. So racing multiple mount commands on a combined MGT/MDT can still cause this problem. I've taken a look, and I don't see any easy way to fix this in the current context. I can provide dumps if needed, and I'll attach a log now. Note the attempts to start MDT0000. There are five, four of which start after the first one but before it has completed. |
| Comments |
| Comment by Bruno Faccini (Inactive) [ 02/May/15 ] |
|
Hello Patrick, |
| Comment by Patrick Farrell (Inactive) [ 04/May/15 ] |
|
Thanks, Bruno - Good luck. I couldn't find an easy way to do it, but I expect you know this code much better than me. |
| Comment by Wally Wang (Inactive) [ 10/Sep/15 ] |
|
Hi Bruno, any progress on this one? Thanks. |
| Comment by Bruno Faccini (Inactive) [ 11/Feb/16 ] |
|
Patrick, Wally, |
| Comment by Patrick Farrell (Inactive) [ 11/Feb/16 ] |
|
Bruno - We don't have a specific reproducer. It actually turned out we were doing concurrent mounts because our failover stuff was misconfigured on an internal system. |
| Comment by Wally Wang (Inactive) [ 15/Mar/16 ] |
|
Bruno, 1. create and start a Lustre file system with mgt/mdt combo cat test_mount for ((i=0;i<5;i++));do ./test_mount & done; |
| Comment by Patrick Farrell (Inactive) [ 15/Mar/16 ] |
|
Thanks, Wally! |