Details
-
Bug
-
Resolution: Cannot Reproduce
-
Major
-
None
-
Lustre 1.8.6
-
None
-
CentOS 5.5 on Lustre servers
RHEL 6.1 on clients
-
3
-
4030
Description
Customer reported a number of clients were evicted. All clients had difficulties communicating with OSTs on a single OSS. Johann has looked at the client logs but I did not have server logs at the time. I now have the server logs. I have attached them to this ticket. I need recommendations on how to prevent this from happening in the future.
Should I consider changing the OBD timeout from the default 100s?
Should I consider reducing the number of OST service threads (default 256)?
Attachments
Issue Links
- Trackbacks
-
Lustre 1.8.x known issues tracker
While testing against Lustre b18 branch, we would hit known bugs which were already reported in Lustre Bugzilla https://bugzilla.lustre.org/. In order to move away from relying on Bugzilla, we would create a JIRA