Details
-
Bug
-
Resolution: Fixed
-
Major
-
None
-
None
-
3
-
Orion
-
3019
Description
Observed when manually restarting zfs servers, specifically:
1) umount /mnt/lustre/server
2) export zpool
3) import zpool
4) mount zpool/server /mnt/lustre/server
There is some suspicion that we may be rolling back to a previous uberblock due to how things were shutdown but that's just a suspicion at the moment until we have more data.
2012-05-17 11:24:31 LustreError: 11-0: lstest-OST02e6-osc-ffff88063d2a2000: Communicating with 172.20.4.142@o2ib500, operation obd_ping failed with -107 2012-05-17 11:24:31 Lustre: lstest-OST02ef-osc-ffff88063d2a2000: Connection to lstest-OST02ef (at 172.20.4.151@o2ib500) was lost; in progress operations using this service will wait for recovery to complete 2012-05-17 11:24:31 Lustre: Skipped 19 previous similar messages 2012-05-17 11:24:31 LustreError: Skipped 27 previous similar messages 2012-05-17 11:24:53 Lustre: lstest-OST02b8-osc-ffff88063d2a2000: Connection to lstest-OST02b8 (at 172.20.4.96@o2ib500) was lost; in progress operations using this service will wait for recovery to complete 2012-05-17 11:24:53 Lustre: Skipped 30 previous similar messages 2012-05-17 11:24:53 LustreError: 23000:0:(import.c:930:ptlrpc_connect_interpret()) lstest-OST029a_UUID went back in time (transno 4315649123 was previously committed, server now claims 4295433349)! See https://bugzilla.lustre.org/show_bug.cgi?id=9646 2012-05-17 11:24:53 LustreError: 23000:0:(import.c:930:ptlrpc_connect_interpret()) Skipped 11 previous similar messages 2012-05-17 11:24:57 Lustre: lstest-OST0286-osc-ffff88063d2a2000: Connection restored to lstest-OST0286 (at 172.20.4.46@o2ib500) 2012-05-17 11:24:57 Lustre: Skipped 12 previous similar messages 2012-05-17 11:25:02 Lustre: lstest-OST02e9-osc-ffff88063d2a2000: Connection restored to lstest-OST02e9 (at 172.20.4.145@o2ib500) 2012-05-17 11:25:47 Lustre: lstest-OST02e3-osc-ffff88063d2a2000: Connection restored to lstest-OST02e3 (at 172.20.4.139@o2ib500) 2012-05-17 11:25:47 Lustre: Skipped 25 previous similar messages 2012-05-17 11:25:57 Lustre: lstest-OST028d-osc-ffff88063d2a2000: Connection restored to lstest-OST028d (at 172.20.4.53@o2ib500) 2012-05-17 11:25:57 Lustre: Skipped 3 previous similar messages 2012-05-17 11:26:11 LustreError: 23000:0:(import.c:930:ptlrpc_connect_interpret()) lstest-OST02e4_UUID went back in time (transno 4320876109 was previously committed, server now claims 4296801416)! See https://bugzilla.lustre.org/show_bug.cgi?id=9646 2012-05-17 11:26:11 LustreError: 23000:0:(import.c:930:ptlrpc_connect_interpret()) Skipped 67 previous similar messages
Attachments
Activity
Fix Version/s | Original: Lustre 2.4.0 [ 10154 ] |
Resolution | New: Fixed [ 1 ] | |
Status | Original: Open [ 1 ] | New: Resolved [ 5 ] |
Fix Version/s | New: Lustre 2.4.0 [ 10154 ] | |
Fix Version/s | Original: Sprint 17 (Cherokee) [ 10121 ] | |
Key |
Original:
|
New:
|
Project | Original: Orion [ 10051 ] | New: Lustre [ 10000 ] |
Rank (Obsolete) | Original: 6570000000 | |
Release Version History | Original: Sprint 16 (Saline Valley) [ 10120 ] | |
Workflow | Original: jira [ 14555 ] | New: Sub-task Blocking [ 16399 ] |
Labels | New: topsequoia |
Fix Version/s | New: Sprint 17 (Cherokee) [ 10121 ] | |
Fix Version/s | Original: Sprint 16 (Saline Valley) [ 10120 ] |
Fix Version/s | New: Sprint 16 (Saline Valley) [ 10120 ] |
The fix from patch 3519 was already landed to master as part of another patch.