Details
-
Bug
-
Resolution: Fixed
-
Major
-
Lustre 2.7.0
-
None
-
3
-
9223372036854775807
Description
In fld_client_rpc():
if (rc != 0) { if (imp->imp_state != LUSTRE_IMP_CLOSED && !imp->imp_deactive) { /* Since LWP is not replayable, so it will keep * trying unless umount happens, otherwise it would * cause unecessary failure of the application. */ ptlrpc_req_finished(req); rc = 0; goto again; } GOTO(out_req, rc); }
If the connection is broken, this function will run into an dead loop. I think we'd reshape the function somehow to make it interruptable, otherwise, if connection never being established, caller will stuck in this function forever.
Seems fld_update_from_controller() has similar problem.
Yang Sheng (yang.sheng@intel.com) uploaded a new patch: http://review.whamcloud.com/17041
Subject:
LU-7115fld: don't try again for no LWP deviceProject: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 4fb4d98b6432e2a9d2e0397599421a1c2032e51e