Memory Resources

The <cuda/memory_resource> header provides a standard C++ interface for heterogeneous, stream-ordered memory allocation tailored to the needs of CUDA C++ developers. This design builds off of the success of the RAPIDS Memory Manager (RMM) project and evolves the design based on lessons learned.

<cuda/memory_resource> is not intended to replace RMM, but instead moves the definition of the memory allocation interface to a more centralized home in CCCL. RMM will remain as a collection of implementations of the cuda::mr interfaces.

We are still experimenting with the design, so for now the contents of <cuda/memory_resource> are only available if LIBCUDACXX_ENABLE_EXPERIMENTAL_MEMORY_RESOURCE is defined.

At a high level, the header provides:

cuda::get_property

Infrastructure to tag a user defined type with a given property

CCCL 2.2.0 / CUDA 12.3

cuda::mr::{async}_resource and cuda::mr::{async}_resource_with

Concepts that provide proper constraints for arbitrary memory resources.

CCCL 2.2.0 / CUDA 12.3

cuda::mr::{async}_resource_ref

A type-erased memory resource wrapper that enables consumers to specify properties of resources that they expect.

CCCL 2.2.0 / CUDA 12.3

These features are an evolution of std::pmr::memory_resource that was introduced in C++17. While std::pmr::memory_resource provides a polymorphic memory resource that can be adopted through inheritance, it is not properly suited for heterogeneous systems.

With the current design it ranges from cumbersome to impossible to verify whether a memory resource provides allocations that are e.g. accessible on device, or whether it can utilize other allocation mechanisms.

To better support asynchronous CUDA stream-ordered allocations libcu++ provides cuda::stream_ref as a wrapper around cudaStream_t. The definition of cuda::stream_ref can be found in the <cuda/stream_ref> header.