Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.8.0
-
3
-
9223372036854775807
Description
In DNE failover tests, we found MDT is being evicted by another MDT because of pinger timeout.
Lustre: soaked-MDT0007: haven't heard from client soaked-MDT0004-mdtlov_UUID (at 192.168.1.110@o2ib10) in 230 seconds. I think it's dead, and I am evicting it. exp ffff88080c87d800, cur 1456388330 expire 1456388180 last 1456388100 Lustre: soaked-MDT0006: haven't heard from client soaked-MDT0004-mdtlov_UUID (at 192.168.1.110@o2ib10) in 232 seconds. I think it's dead, and I am evicting it. exp ffff88078f6f3c00, cur 1456388331 expire 1456388181 last 1456388099
which actually should not happen. It looks like we should not add MDS-MDS export to obd_chain_timed list. I will cook a patch soon.