[LU-6840] update memory reply data in DNE update replay Created: 13/Jul/15 Updated: 02/Sep/15 Resolved: 28/Aug/15 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.8.0 |
| Fix Version/s: | Lustre 2.8.0 |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Di Wang | Assignee: | Di Wang |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||||||||||||||||||||||
| Severity: | 3 | ||||||||||||||||||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||||||||||||||||||
| Description |
|
DNE update replay might update last_rcvd file, but because it will be operated on OSD/OSP directly, so the memory structure (ted/lrd etc) will not be updated by update replay handler, so it needs update these memory structure by itself after each update replay. (see replay_request_or_update() ). In current implementation, this is done in target_update_lcd(), but multiple slot patch just changed this process, so target_update_lcd needs to be fixed as well. |
| Comments |
| Comment by Gerrit Updater [ 13/Jul/15 ] |
|
wangdi (di.wang@intel.com) uploaded a new patch: http://review.whamcloud.com/15576 |
| Comment by Gregoire Pichon [ 20/Jul/15 ] |
|
I am not sure to understand the context, but the metadata operations operated on OSD/OSP should not make use of the multiple slot patch. At the moment (lustre 2.8) multiple slot feature is only supported for MDT exports that have the OBD_CONNECT_MULTIMODRPCS flag, that is to say exports for MDC. This flag should not be set on exports for OSP. |
| Comment by Di Wang [ 20/Jul/15 ] |
|
For cross-MDT operation, the updates (including update last_rcvd and reply_data) will be recorded on all of MDTs. So during the recovery, once the updates on the master MDT are missing, then these updates needs to be redo on the master MDT, which means these last_rcvd and reply_data file needs to be updated as well in this process, and those memory stuff (lcd etc) needs to be updated as well, which is normally through the trans stop callback. In current implementation, due to the speciality of the DNE recovery and recent changes of multiple slot, stop callback can not be called correctly, which is this patch trying to resolve. |
| Comment by Gerrit Updater [ 28/Aug/15 ] |
|
Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/15576/ |
| Comment by Joseph Gmitter (Inactive) [ 28/Aug/15 ] |
|
Landed for 2.8. |