Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 1.8.8
-
3
-
3998
Description
lnd_cb.c:558:kgnilnd_setup_phys_buffer()) failed to allocate tx_phys [2012-04-07 02:08:24][c5-0c0s5n2]LNet: 29099:0:(gnilnd_cb.c:1068:kgnilnd_tx_done()) $$ error -12 on tx 0xffff88000fe06b40-><?> id 0/0 state GNILND_TX_ALLOCD age 17481575s msg@0xffff88000fe06bc0 m/v/ty/ck/pck/pl b00fbabe/8/3/0/78db/0 x0:GNILND_MSG_PUT_REQ [2012-04-07 02:08:24][c5-0c0s5n2]LustreError: 29099:0:(events.c:198:client_bulk_callback()) event type 0, status -5, desc ffff880627c24000
The error is detected on both client and server; the server expects the client to retry but it doesn't. In the mean time, the OSS issues a lock callback to the client, but the client does not respond because it is waiting for the I/O to complete. Eventually the OSS evicts the client. Lustre does not retry the bulk op when it detects the error.
Attachments
Issue Links
- Trackbacks
-
Lustre 1.8.x known issues tracker While testing against Lustre b18 branch, we would hit known bugs which were already reported in Lustre Bugzilla https://bugzilla.lustre.org/. In order to move away from relying on Bugzilla, we would create a JIRA
Activity
Fix Version/s | New: Lustre 1.8.9 [ 10204 ] | |
Fix Version/s | Original: Lustre 1.8.x (1.8.0 - 1.8.5) [ 10086 ] |
Fix Version/s | New: Lustre 2.4.0 [ 10154 ] | |
Fix Version/s | New: Lustre 2.1.4 [ 10158 ] | |
Fix Version/s | New: Lustre 1.8.x (1.8.0 - 1.8.5) [ 10086 ] | |
Resolution | New: Fixed [ 1 ] | |
Status | Original: Open [ 1 ] | New: Resolved [ 5 ] |
Ok it seems the patches for 1.8,2.3 and Master has been merged. 2.2 and 2.3 are dead branches at this point.
I think this issue is safe to close. Please reopen if you disagree.