[LU-13455] HSM client disconnected and fail to reconnect to server Created: 15/Apr/20  Updated: 14/May/20  Resolved: 14/May/20

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: Lustre 2.14.0

Type: Bug Priority: Minor
Reporter: Andriy Skulysh Assignee: Andriy Skulysh
Resolution: Fixed Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   
[4354891.599839] Lustre: 14762:0:(client.c:2150:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1575163728/real 1575163728]  req@ffff9e568f3bb900 x1647812376
751168/t0(0) o59->snx11133-MDT0000-mdc-ffff9e787a482000@10.156.8.4@o2ib:12/10 lens 456/224 e 0 to 1 dl 1575164273 ref 2 fl Rpc:X/0/ffffffff rc 0/-1
Stack : 
__schedule
schedule
schedule_timeout
ptlrpc_set_wait
ptlrpc_queue_wait
mdc_ioc_hsm_ct_register
mdc_hsm_ct_reregister
libcfs_kkuc_group_foreach
mdc_import_event
ptlrpc_activate_import
ptlrpc_import_recovery_state_machine
ptlrpc_connect_interpret
ptlrpc_check_set
ptlrpc_check_set
ptlrpcd_check
ptlrpcd
kthread
Progs:  14762 "ptlrpcd_rcv"


 Comments   
Comment by Gerrit Updater [ 15/Apr/20 ]

Andriy Skulysh (c17819@cray.com) uploaded a new patch: https://review.whamcloud.com/38243
Subject: LU-13455 ptlrpc: connect to MDT stucks
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 4c77a360c3c29760ef8e9121c083f69afcd7cc2c

Comment by Gerrit Updater [ 14/May/20 ]

Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/38243/
Subject: LU-13455 ptlrpc: connect to MDT stucks
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 3d58403e62b7b2de32f76c7bdd6224325ab333bc

Comment by Peter Jones [ 14/May/20 ]

Landed for 2.14

Generated at Sat Feb 10 03:01:25 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.