Details
-
Bug
-
Resolution: Cannot Reproduce
-
Major
-
None
-
None
-
Sequoia login node. PPC64. 48 core Power 7. Lustre version [2.4.0-RC2_11chaos|https://github.com/chaos/lustre/tree/2.4.0-RC2_11chaos]
-
3
-
9945
Description
Using Lustre version 2.4.0-RC2_11chaos, we hit the following assertion on a login node (lustre client):
osc_cache.c:1267:osc_completion()) ASSERTION( (!!(page->cp_state == CPS_PAGEOUT) == !!(cmd == 0x02) ) failed:
The backtrace looks roughly like this:
osc_ap_compltion osc_extent_finish brw_interpret ptlrpc_check_set ptlrpcd_check ptlrpcd
There was not much going on in the client's log. About one minute earlier an OSC lost connection to one of the OSTs and then reconnected. But that is a fairly common occurance. It is not immediately obvious to me that there is a connection to this assertion.
This client is a Power7 login node with 48 cores and 64GB of RAM.