[LU-275] I/O errors when lustre uses multipath devices Created: 04/May/11 Updated: 14/Jun/11 Resolved: 13/Jun/11 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.0.0 |
| Fix Version/s: | Lustre 2.1.0 |
| Type: | Bug | Priority: | Major |
| Reporter: | Gregoire Pichon | Assignee: | Zhenyu Xu |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
RHEL 6.0 GA, ofed1.5.2, Lustre 2.0.0.1, Mellanox QDR Ib cards |
||
| Attachments: |
|
| Severity: | 3 |
| Epic: | mount, multipath, tuning |
| Rank (Obsolete): | 4992 |
| Description |
|
When the lustre servers have their MDT or OST configured with multipath devices, there are I/O errors that can lead to a server crash. The following error appears in the system log: Followed by several I/O errors Here is the corresponding code in blk_rq_check_limits() routine: ... This error comes from the block device tuning performed by lustre.mount in set_blockdev_tunables() routine. The max_sectors_kb value of the multipath device (dm-10) is updated from 1024 to 32767 which is the value of max_hw_sectors_kb. However, the slave devices are not tuned, which leads to build block requests that cannot be handled by the slave devices. A workaround to this issue is to tune the slave devices max_sectors_kb value with its max_hw_sectors_kb value. In attachment is a patch of set_blockdev_tunables() routines to make it support multipath device case. |
| Comments |
| Comment by Peter Jones [ 05/May/11 ] |
|
Gregoire Could you please upload your suggested change as a patch in gerrit. Then we can look into testing and landing this change Thanks Peter |
| Comment by Zhenyu Xu [ 05/May/11 ] |
|
patch with minor change tracking at http://review.whamcloud.com/504 |
| Comment by Gregoire Pichon [ 06/May/11 ] |
|
Zhenyu, I have detected a bug in my patch proposal. thanks, |
| Comment by Zhenyu Xu [ 06/May/11 ] |
|
thanks, looks like globfree() should be called even if glob() returns non-zero value. |
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Build Master (Inactive) [ 13/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|
| Comment by Zhenyu Xu [ 13/Jun/11 ] |
|
landed on master branch for 2.1.0 |
| Comment by Build Master (Inactive) [ 14/Jun/11 ] |
|
Integrated in Oleg Drokin : 515fd66ef9443ad6d95ff23bd865eb7923ab6eb6
|