Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7650

ko2iblnd map_on_demand can't negotitate when page sizes are different between nodes.

Details

    • Bug
    • Resolution: Duplicate
    • Major
    • None
    • Lustre 2.8.0, Lustre 2.9.0
    • Power8 client nodes running Ubuntu 14.04 and Lustre servers running RHE6.6 with all running the latest pre-2.8 lustre stack.
    • 3
    • 9223372036854775807

    Description

      With the mlx5 driver support mostly complete I have been testing on our Power8 nodes the latest 2.8 stack plus some ko2iblnd patches to make it work. With the work from LU-5783 I can now set the peer_credits to 63 and using the patch from LU-7101 set the map_on_demand to 16 pages. The problem is that the Power8 clients can't connect to our servers which have map_on_demand set to 256 pages. The reason is the set test to validate the connection compares the page counts on each side. On Power8 16 pages equals 256 pages on x86 platforms so comparing page count is an invalid test.

      Attachments

        Issue Links

          Activity

            [LU-7650] ko2iblnd map_on_demand can't negotitate when page sizes are different between nodes.

            This is now a duplicate of LU-10157

            simmonsja James A Simmons added a comment - This is now a duplicate of LU-10157
            pjones Peter Jones added a comment -

            James

            Doug is focused on some of the upstream changes in the coming weeks so will not get to this in the 2.10 timeframe

            Peter

            pjones Peter Jones added a comment - James Doug is focused on some of the upstream changes in the coming weeks so will not get to this in the 2.10 timeframe Peter

            Doug do you have time to take another shot at this?

            simmonsja James A Simmons added a comment - Doug do you have time to take another shot at this?

            Is there any good "architecture" style document about this API? I don't find header files very useful in understanding "how" the API should be used.

            doug Doug Oucharek (Inactive) added a comment - Is there any good "architecture" style document about this API? I don't find header files very useful in understanding "how" the API should be used.

            Actually I just got a copy of the RHEL7.3 source and the new RDMA generic api is there. So the answer is the solution is ready for todays customers.

            simmonsja James A Simmons added a comment - Actually I just got a copy of the RHEL7.3 source and the new RDMA generic api is there. So the answer is the solution is ready for todays customers.

            James: how far back does the new RDMA RW api go (kernel version)? My understanding is the new api starts about Linux 4.6 which is very far out for current Lustre users. We need to band-aid ko2iblnd to keep going for the next few years.

            I agree that ko2iblnd is going to need a rewrite to stay relevant for the upstream kernel.

            doug Doug Oucharek (Inactive) added a comment - James: how far back does the new RDMA RW api go (kernel version)? My understanding is the new api starts about Linux 4.6 which is very far out for current Lustre users. We need to band-aid ko2iblnd to keep going for the next few years. I agree that ko2iblnd is going to need a rewrite to stay relevant for the upstream kernel.

            With the new RDMA RW api and map_on_demand being broken not only on Power8 but in general it looks like a big rewrite of the ko2ibllnd driver is needed

            simmonsja James A Simmons added a comment - With the new RDMA RW api and map_on_demand being broken not only on Power8 but in general it looks like a big rewrite of the ko2ibllnd driver is needed
            pjones Peter Jones added a comment -

            Yep. We'll need to take another run at this.

            pjones Peter Jones added a comment - Yep. We'll need to take another run at this.

            Appears support for Power8 broke things.

            simmonsja James A Simmons added a comment - Appears support for Power8 broke things.
            pjones Peter Jones added a comment -

            Landed for 2.9

            pjones Peter Jones added a comment - Landed for 2.9

            People

              ashehata Amir Shehata (Inactive)
              simmonsja James A Simmons
              Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: