Details

    • New Feature
    • Resolution: Won't Do
    • Major
    • None
    • None
    • None
    • GDS 1.0 + MOFED 5.3
    • 9223372036854775807

    Description

      Nvidia release a final version for the own GDS project.
      Lets add support in lustre code.

      Attachments

        Issue Links

          Activity

            [LU-14795] NVidia GDS support in lustre

            WC approach landed already.

            shadow Alexey Lyashkov added a comment - WC approach landed already.

            Let's run a 'configure' script in the gds sources, it configure options removed after DKMS build.
            or make nv_configure in /usr/src/nvidia-fs-2.7.50/
            (i use an make all, but configure is enough).

            shadow Alexey Lyashkov added a comment - Let's run a 'configure' script in the gds sources, it configure options removed after DKMS build. or make nv_configure in /usr/src/nvidia-fs-2.7.50/ (i use an make all, but configure is enough).

            checked out head of LU-14795 https://review.whamcloud.com/#/c/44099, but build failed below.

            # sh ./autogen.sh ; ./configure --with-o2ib=/usr/src/ofa_kernel/default --disable-server; make debs
            - snip - 
            checking if Linux kernel was built with CONFIG_MODULES... yes
            checking if Linux kernel was built with CONFIG_MODVERSIONS... yes
            checking if Linux kernel was built with CONFIG_KALLSYMS... yes
            checking if Linux kernel module loading is possible... yes
            checking for /usr/src/nvidia-470.57.02/nvidia//nv-p2p.h... yes
            NVIDIA path is /usr/src/nvidia-470.57.02/nvidia/
            checking for /usr/src/nvidia-fs-2.7.50/nvfs-dma.h... yes
            checking for /usr/src/nvidia-fs-2.7.50/config-host.h... no
            configure: error: GDS sources don't ready for build. run configure please
            make: *** No rule to make target 'debs'.  Stop.
            

            testing GDS-1.0.1.3 which is part of latest cuda 11.4.1

            # /usr/local/cuda-11.4/gds/tools/gdscheck -v
             GDS release version: 1.0.1.3
             nvidia_fs version:  2.7 libcufile version: 2.4
            
            sihara Shuichi Ihara added a comment - checked out head of LU-14795 https://review.whamcloud.com/#/c/44099 , but build failed below. # sh ./autogen.sh ; ./configure --with-o2ib=/usr/src/ofa_kernel/default --disable-server; make debs - snip - checking if Linux kernel was built with CONFIG_MODULES... yes checking if Linux kernel was built with CONFIG_MODVERSIONS... yes checking if Linux kernel was built with CONFIG_KALLSYMS... yes checking if Linux kernel module loading is possible... yes checking for /usr/src/nvidia-470.57.02/nvidia//nv-p2p.h... yes NVIDIA path is /usr/src/nvidia-470.57.02/nvidia/ checking for /usr/src/nvidia-fs-2.7.50/nvfs-dma.h... yes checking for /usr/src/nvidia-fs-2.7.50/config-host.h... no configure: error: GDS sources don't ready for build. run configure please make: *** No rule to make target 'debs'. Stop. testing GDS-1.0.1.3 which is part of latest cuda 11.4.1 # /usr/local/cuda-11.4/gds/tools/gdscheck -v GDS release version: 1.0.1.3 nvidia_fs version: 2.7 libcufile version: 2.4
            spitzcor Cory Spitz added a comment -

            This ticket "pre-dates" LU-14798.

            spitzcor Cory Spitz added a comment - This ticket "pre-dates" LU-14798 .

            Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44099
            Subject: LU-14795 lnet: add GPU <> IB device affinity handing.
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 31e16fcdf48de3e59bd0156f9e7918d854380082

            gerrit Gerrit Updater added a comment - Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44099 Subject: LU-14795 lnet: add GPU <> IB device affinity handing. Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 31e16fcdf48de3e59bd0156f9e7918d854380082

            Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44098
            Subject: LU-14795 lnet: LNet GDS support
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 0013ef246d0c8b5e506cdf31a5a9879f3962fe6e

            gerrit Gerrit Updater added a comment - Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44098 Subject: LU-14795 lnet: LNet GDS support Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 0013ef246d0c8b5e506cdf31a5a9879f3962fe6e

            Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44097
            Subject: LU-14795 ptlrpc: GDS bulk transfer support
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 647ce2dfab23de67b4b20b3187911db3419ff4ff

            gerrit Gerrit Updater added a comment - Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44097 Subject: LU-14795 ptlrpc: GDS bulk transfer support Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 647ce2dfab23de67b4b20b3187911db3419ff4ff

            Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44096
            Subject: LU-14795 llite: detect an GPU transfer
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 632892ba1c00f44c48a9125bd364e81cbff7dd5a

            gerrit Gerrit Updater added a comment - Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44096 Subject: LU-14795 llite: detect an GPU transfer Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 632892ba1c00f44c48a9125bd364e81cbff7dd5a

            Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44095
            Subject: LU-14795 lnet: export additional info
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 002f3c65b57aacf57bc23ca1a8fc4553ad0cdbe3

            gerrit Gerrit Updater added a comment - Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44095 Subject: LU-14795 lnet: export additional info Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 002f3c65b57aacf57bc23ca1a8fc4553ad0cdbe3

            Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44094
            Subject: LU-14795 lnet: add GDS configure
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: d108819dbbfb58a980f2323906e38f510401c1d1

            gerrit Gerrit Updater added a comment - Alexey Lyashkov (alexey.lyashkov@hpe.com) uploaded a new patch: https://review.whamcloud.com/44094 Subject: LU-14795 lnet: add GDS configure Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: d108819dbbfb58a980f2323906e38f510401c1d1

            People

              shadow Alexey Lyashkov
              shadow Alexey Lyashkov
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: