Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-5639

Message is hashed to invalid match-table of LNet request portal

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.7.0, Lustre 2.5.4
    • Lustre 2.5.3
    • None
    • 3
    • 15789

    Description

      When new message arrived at LNet request portal, if there is no receiving buffer (ME/MD) in match-table of the partition that message arrived in, either because service does not post any buffer in that partition, or because server is under memory pressure, LNet will increase a integer (rotor) then hash this message to another match-table within a different partition which has receiving buffers. This rotor should be unsigned when hashing, otherwise we will refer to illegal address when the integer is overflowed.

      This issue should be rare because Lustre services will always post enough request buffers

      Attachments

        Issue Links

          Activity

            People

              liang Liang Zhen (Inactive)
              liang Liang Zhen (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: