[LU-13667] ptlrpc_pinger_main is stuck in endless loop Created: 12/Jun/20 Updated: 17/Oct/20 Resolved: 11/Jul/20 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Lustre 2.14.0, Lustre 2.12.6 |
| Type: | Bug | Priority: | Minor |
| Reporter: | Hongchao Zhang | Assignee: | Hongchao Zhang |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | llnl | ||
| Issue Links: |
|
||||
| Severity: | 3 | ||||
| Rank (Obsolete): | 9223372036854775807 | ||||
| Description |
|
In ptlrpc_pinger_main, the process of the pingable imports or obd_update_maxusage |
| Comments |
| Comment by Gerrit Updater [ 12/Jun/20 ] |
|
Hongchao Zhang (hongchao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/38915 |
| Comment by Olaf Faaland [ 02/Jul/20 ] |
|
I believe we hit this today on a Lustre 2.12.4 client. The pinger was taking 100% of a core. Over 498 seconds, the "next wakeup in" message appeared in the debug log 76,716 times, and the time_to_next_wake started at -41,852 and ended at -42,350 (getting more and more negative with time). |
| Comment by Olaf Faaland [ 02/Jul/20 ] |
|
Please let me know whether you agree my described symptoms match this issue, thanks. |
| Comment by Hongchao Zhang [ 02/Jul/20 ] |
|
Yes, it should be the same issue with this ticket. |
| Comment by Olaf Faaland [ 02/Jul/20 ] |
|
Thank you. This should go into b2_12 after it's merged to master. |
| Comment by Gerrit Updater [ 10/Jul/20 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/38915/ |
| Comment by Peter Jones [ 11/Jul/20 ] |
|
Landed for 2.14 |
| Comment by Gerrit Updater [ 13/Jul/20 ] |
|
Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/39344 |
| Comment by Gerrit Updater [ 07/Aug/20 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/39344/ |