Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-5639

Message is hashed to invalid match-table of LNet request portal

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Blocker Blocker
    • Lustre 2.7.0, Lustre 2.5.4
    • Lustre 2.5.3
    • None
    • 3
    • 15789

      When new message arrived at LNet request portal, if there is no receiving buffer (ME/MD) in match-table of the partition that message arrived in, either because service does not post any buffer in that partition, or because server is under memory pressure, LNet will increase a integer (rotor) then hash this message to another match-table within a different partition which has receiving buffers. This rotor should be unsigned when hashing, otherwise we will refer to illegal address when the integer is overflowed.

      This issue should be rare because Lustre services will always post enough request buffers

            liang Liang Zhen (Inactive)
            liang Liang Zhen (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: