Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.11.0, Lustre 2.10.2, Lustre 2.10.3
-
Soak performance cluster - Lustre version=2.10.2_4_gb151f34
-
3
-
9223372036854775807
Description
We do OSS failover, trigger LFSCK:
lctl lfsck_start -M soaked-MDT0000 -s 1000 -t all -A{code]
The lfsck start hangs, lfsck is not started, the clients wedge in state 'comp' the entire system wedges. I have dumped Lustre Logs from all MDS, attached. I have crash-dumped all the MDT nodes and the dumps are available on Spirit. lfsck_layout is unkillable.
John L. Hammond (jhammond@whamcloud.com) merged in patch https://review.whamcloud.com/30831/
Subject:
LU-10419lfsck: no delay for notify RPCProject: fs/lustre-release
Branch: b2_10
Current Patch Set:
Commit: 9fef9ad10b26a4338c22105e66308ead5408173e