[LU-3751] disable OUT_PORTAL on OST for now Created: 13/Aug/13 Updated: 08/Jan/14 Resolved: 29/Aug/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.5.0 |
| Fix Version/s: | Lustre 2.5.0 |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Di Wang | Assignee: | Di Wang |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | HB, dne | ||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 9674 | ||||||||
| Description |
|
Disable OUT_PORTAL on OST, otherwise it will confuse the receiver when MDT and OST are on the same node, which might make ll_ost_outxx handle CONNECT requests from MDTs. Then it would cause panic like Lustre: 13261:0:(client.c:1896:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1401492648/real 1401492648] req@ffff880044bf9000 x1469570233011472/t0(0) o1000->lustre-MDT0000-osp-MDT0001@0@lo:24/10 lens 8416/8416 e 0 to 1 dl 1401492655 ref 2 fl Rpc:X/0/ffffffff rc 0/-1 Call Trace: |
| Comments |
| Comment by Di Wang [ 13/Aug/13 ] |
| Comment by Alex Zhuravlev [ 15/Aug/13 ] |
|
I used the following: diff --git a/lustre/include/lustre/lustre_idl.h b/lustre/include/lustre/lustre_idl.h /* Portal 63 is reserved for the Cray Inc DVS - nic@cray.com, roe@cray.com, n8851@cray.com */ diff --git a/lustre/ost/ost_handler.c b/lustre/ost/ost_handler.c
|
| Comment by Di Wang [ 15/Aug/13 ] |
|
Alex, could you please push your patch to review and try to land it? |
| Comment by Andreas Dilger [ 23/Aug/13 ] |
|
What are the implications here for unified targets? Isn't the whole point of UT that the same RPC to the same portal will execute the same operation on an OSD? If the "OST OUT" has a different portal than the "MDT OUT", we will need to handle OST updates separately from MDT updates forever in the future, which doesn't make sense to me. My preference would be Di's patch that just disables this code for 2.5 (rather than changing the protocol forever in the future as Alex's patch does). However, disabling the OST OUT handler will cause problems for LFSCK Phase 2, which is supposed to be using this service for MDT->OST communications, though that will only become a problem in 2.6. Mike, there needs to be some way for OUT to handle RPCs for both MDT and OST devices. Is that part of your later UT patch series? |
| Comment by Alex Zhuravlev [ 23/Aug/13 ] |
|
I'm not saying that change is the right thing in the long term, but at least it gives me a way to develop stuff before UT is complete. |
| Comment by Mikhail Pershin [ 26/Aug/13 ] |
|
Problem should gone when MDT part of UT patch will be landed, no need to change protocol. Alex, could you base your changes on later patches in UT series? I expect it should work with http://review.whamcloud.com/6973 |
| Comment by Alex Zhuravlev [ 27/Aug/13 ] |
|
sure, I will do. |
| Comment by Jodi Levi (Inactive) [ 27/Aug/13 ] |
|
With Change, 7323 now landed to Master can this ticket be closed or is additional work needed? |
| Comment by Jodi Levi (Inactive) [ 29/Aug/13 ] |
|
The patch has landed to master. Let me know if more work is needed and I will reopen this ticket. |