[LU-844] LBUG: ASSERTION(iobuf->dr_npages < iobuf->dr_max_pages) when run obdfilter_survey using rsz >= 2M Created: 14/Nov/11 Updated: 04/Jan/12 Resolved: 04/Jan/12 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Lustre 2.2.0 |
| Type: | Bug | Priority: | Critical |
| Reporter: | Sean Xu (Inactive) | Assignee: | Zhenyu Xu |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
On linux 2.6.18-238.12.1 |
||
| Attachments: |
|
| Severity: | 4 |
| Epic: | performance |
| Rank (Obsolete): | 4783 |
| Description |
|
Ran obdfilter_survey(lustre-iokit-1.2-200709210921) for rsz>=2M against local disk using command:
I hit a LBUG: Nov 11 13:24:01 oss0 kernel: LustreError: 12359:0:(filter_io_26.c:293:filter_iobuf_add_page()) ASSERTION(iobuf->dr_npages < iobuf->dr_max_pages) failed I tested it using Lustre 1.8.5, 1.8.6-wc1, 1.8.7-wc1 and 2.1. All have the exact same problem. Especially, 1.8.5 and 2.1 will cause kernel panic. This problem happened on both real machine and virtual machine. It does not matter what other parameters I used, once I set rsc 1M to run obdfilter_survey, the problem happened immediately. |
| Comments |
| Comment by Cliff White (Inactive) [ 15/Nov/11 ] |
|
I will see if I can replicate. Lustre does not use IO larger than 1M, so the 2M size may be unsupported. obdfilter-survey is now part of the main Lustre package, and should be installed in /usr/bin/obdfilter-survey. Can you re-try with that version, just for verification? We don't support the stand-alone version, and haven't since 2007 (hence the old date on the tarball) |
| Comment by Sean Xu (Inactive) [ 15/Nov/11 ] |
|
Thanks for your information. I tested using /usr/bin/obdfilter-survey on Lustre-1.8.6-wc1 and 2.1. rsz=2M still triggered same LBUG and kernel panic (lustre 2.1). |
| Comment by Cliff White (Inactive) [ 15/Nov/11 ] |
|
I can confirm that it LBUGs on a test machine here also. However, max Lustre IO size is 1M, so this is either a bug |
| Comment by Peter Jones [ 16/Nov/11 ] |
|
Bobi Could you please help on this one? Thanks Peter |
| Comment by Zhenyu Xu [ 16/Nov/11 ] |
|
What environmental variables have you set? esp. what rszhi did you set? |
| Comment by Sean Xu (Inactive) [ 17/Nov/11 ] |
|
I set rszlo=2048 and rszhi=2048 to run obdfilter-survery: $ nobjhi=2 thrhi=2 rszlo=2048 rszhi=2048 size=10240 case=dsik sh /usr/bin/obdfilter-survery It triggered LBUG immediately. |
| Comment by Zhenyu Xu [ 17/Nov/11 ] |
|
I think its obdfilter-survey issue, "disk" case interacts with filter module directly, and it only supports 1M IO data, please use rszhi less than 1024KB with "disk" test case. I'll come up a obdfilter-survey patch to constrain the limit. |
| Comment by Sean Xu (Inactive) [ 18/Nov/11 ] |
|
Alright, thanks. |
| Comment by Zhenyu Xu [ 25/Nov/11 ] |
|
patch tracking at http://review.whamcloud.com/1741 |
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Peter Jones [ 04/Jan/12 ] |
|
Landed for 2.2 |
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 04/Jan/12 ] |
|
Integrated in Result = SUCCESS
|