[LU-10843] snapshot_create fails after MGS restart Created: 23/Mar/18 Updated: 13/Feb/23 Resolved: 17/May/18 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.10.3 |
| Fix Version/s: | Lustre 2.12.0, Lustre 2.10.5 |
| Type: | Bug | Priority: | Major |
| Reporter: | Nathaniel Clark | Assignee: | Nathaniel Clark |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||
| Description |
|
If the MGS is unmounted and remount (or failed over) lctl snapshot_create fails with the following error: # lctl snapshot_create -F scratch -n test Fail to freeze barrier for scratch: No such device Can't create the snapshot test snapshot_destroy still works without issue. WORKAROUND: unmount/mount the MDT and snapshot_create will commence working again. NOTE: mount -o remount is insufficient |
| Comments |
| Comment by Nathaniel Clark [ 23/Mar/18 ] |
|
After remount of MGS FSNAME-barrier is missing from mgs->mgs_fs_db_list. It is only added during mgs_handler.c::mgs_target_reg() |
| Comment by Gerrit Updater [ 26/Mar/18 ] |
|
Nathaniel Clark (nathaniel.l.clark@intel.com) uploaded a new patch: https://review.whamcloud.com/31779 |
| Comment by Nathaniel Clark [ 30/Mar/18 ] |
|
In debugging #31779 I used wireshare to figure out when the MGS actually started accepting barrier commands and to that end, I've added barrier parsing to wireshark: |
| Comment by Gerrit Updater [ 17/May/18 ] |
|
Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/31779/ |
| Comment by Peter Jones [ 17/May/18 ] |
|
Landed for 2.12 |
| Comment by Gerrit Updater [ 23/May/18 ] |
|
Minh Diep (minh.diep@intel.com) uploaded a new patch: https://review.whamcloud.com/32519 |
| Comment by Gerrit Updater [ 01/Aug/18 ] |
|
John L. Hammond (jhammond@whamcloud.com) merged in patch https://review.whamcloud.com/32519/ |