gdrcopy: Fast CPU-GPU memory copy library based on Nvidia GPUDirect RDMA
github.com·6h·
Discuss: Hacker News
Flag this post

GDRCopy

A low-latency GPU memory copy library based on NVIDIA GPUDirect RDMA technology.

Introduction

While GPUDirect RDMA is meant for direct access to GPU memory from third-party devices, it is possible to use these same APIs to create perfectly valid CPU mappings of the GPU memory.

The advantage of a CPU driven copy is the very small overhead involved. That might be useful when low latencies are required.

What is inside

GDRCopy offers the infrastructure to create user-space mappings of GPU memory, which can then be manipulated as if it was plain host memory (caveats apply here).

A simple by-product of it is a copy library with the following characteristics:

very low overhead, as it is driven by the CPU. As a reference, currently a cudaMemcpy can incur in a 6-…

Similar Posts

Loading similar posts...