Details
-
Improvement
-
Resolution: Unresolved
-
Major
-
None
-
None
-
3
-
9223372036854775807
Description
LNet's neterr logs are turned off by default. I have been told this is due to the fact that they are very noisy. Logically, if logs are happening very frequently, they are not really errors then but normal operations. If they were errors, we should be fixing them.
The big problem here is that when a networking field issue happens, we have little to nothing in the logs to go on. Debugging requires that the problem be easy to reproduce with neterr turned on (not usually the case for production errors), or becomes a discipline of the mind (i.e. guesswork).
This ticket is for cleaning up the neterr logs to be true errors so we can have neterr logs on be default.