[LU-9552] lustre uses multipath devices I/O errors Created: 24/May/17  Updated: 28/May/17  Resolved: 28/May/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.7.0
Fix Version/s: None

Type: Bug Priority: Critical
Reporter: haoguozhen Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: multipath, patch
Environment:

CentOS Linux release 7.3.1611 (Core),OFED.3.4.2.0.0.1,lustre-2.7.19.8,Mellanox Technologies MT27500 Family


Attachments: Text File oss01.log    
Issue Links:
Duplicate
duplicates LU-9551 I/O errors when lustre uses multipath... Resolved
Epic/Theme: centos7.3, lustre-2.7.19.8
Severity: 3
Epic: server
Rank (Obsolete): 9223372036854775807

 Description   

When the lustre servers have OST configured with multipath devices, there are I/O errors that can lead to a server crash.
The following error appears in the system log:
Mar 31 00:02:44 oss01 kernel: blk_cloned_rq_check_limits: over max size limit.
Mar 31 00:02:44 oss01 kernel: device-mapper: multipath: Failing path 8:160.

Followed by several I/O errors
Mar 31 00:17:30 oss01 kernel: blk_update_request: I/O error, dev dm-17, sector 1182279680
Mar 31 00:17:30 oss01 kernel: blk_update_request: I/O error, dev dm-17, sector 1182291968
Mar 31 00:17:30 oss01 kernel: blk_update_request: I/O error, dev dm-17, sector 1182267392
Mar 31 00:17:30 oss01 kernel: blk_update_request: I/O error, dev dm-17, sector 1182304256
Mar 30 21:04:22 oss01 kernel: LDISKFS-fs (dm-17): Remounting filesystem read-only


Generated at Sat Feb 10 02:27:12 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.