[LU-8172] match 1 length 172 too big: 160 left, 160 allowed Created: 19/May/16 Updated: 24/May/16 Resolved: 24/May/16 |
|
| Status: | Closed |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.7.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Mahmoud Hanafi | Assignee: | Doug Oucharek (Inactive) |
| Resolution: | Not a Bug | Votes: | 0 |
| Labels: | None | ||
| Environment: |
2.7.1 with |
||
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
Trying to run simple lnet_selftest failed between 2 nodes with this error 00000400:00020000:13.0F:1463699033.377705:0:8764:0:(lib-ptl.c:190:lnet_try_match_md()) Matching packet from 12345-10.151.27.60@o2ib, match 1 length 172 too big: 160 left, 160 allowed 00000400:00000100:13.0:1463699033.420950:0:8764:0:(lib-move.c:1479:lnet_parse_put()) Dropping PUT from 12345-10.151.27.60@o2ib portal 51 match 1 offset 0 length 172: 4 It works within the same node. |
| Comments |
| Comment by Jay Lan (Inactive) [ 20/May/16 ] |
|
I rebased our nas-2.7.1-fe git repo with b2_7_fe on 5/12. There are some LU patches we carry not in b2_7_fe:
The git repo can be accessed at github. Peter Jones's account has access to it. |
| Comment by Peter Jones [ 20/May/16 ] |
|
Doug Could you please advise? Thanks Peter |
| Comment by Doug Oucharek (Inactive) [ 20/May/16 ] |
|
Is the exact same build running on all the nodes involved in the self test? My interpretation of the error is that the message size we are anticipating is 12 bytes less than the amount which was sent. |
| Comment by Mahmoud Hanafi [ 21/May/16 ] |
|
Are there any issue with running 2.7.1+ |
| Comment by Mahmoud Hanafi [ 21/May/16 ] |
|
There is incompatibility of lst between lu-3322 builds and non lu-3322. This was the issue I was having Should lst work with a build without |
| Comment by Jay Lan (Inactive) [ 21/May/16 ] |
|
Mahmoud, our nas-2.7.1-4.1nasS server and nas-2.7.1-4nasC client both have |
| Comment by Mahmoud Hanafi [ 21/May/16 ] |
|
after trying out different version it looks like " |
| Comment by Peter Jones [ 21/May/16 ] |
|
Mahmoud I recall that this patch was quite different between the two versions. Do you have the appropriate version on both releases? Does everything work fine if you remove it altogether? Peter |
| Comment by Jay Lan (Inactive) [ 24/May/16 ] |
|
Peter, We have correct versions of the |
| Comment by Doug Oucharek (Inactive) [ 24/May/16 ] |
|
The 3 new stats I added to the debug patch for As well as LNet, the debug patches for |
| Comment by Mahmoud Hanafi [ 24/May/16 ] |
|
It does look like this was the main issue and lst didn't match across all sides. You can close this issue. |