Skip to main content
Back to top
Ctrl
+
K
CUDA Core Compute Libraries
Choose version
Search
Ctrl
+
K
GitHub
Search
Ctrl
+
K
CUDA Core Compute Libraries
Choose version
GitHub
Table of Contents
CUDA C++ Core Libraries
libcu++
Overview
Setup
Requirements
Getting libcu++
Building & Testing libcu++
Releases
Changelog
Versioning
Standard API
Algorithms Library
C Library
Concepts Library
Container Library
Execution Library
Numerics Library
Ranges Library
Synchronization Library
Time Library
Type Support Library
Utility Library
Extended API
Bit
Execution model
Exception Handling
Memory model
Thread Groups
Synchronization Primitives
Asynchronous Operations
Memory access properties
Functional
Fancy Iterators
Type traits
Vector Tuple Protocol
Numeric
Random
Memory
Memory Resources
Math
Mdspan
Tensor Memory Accelerator (TMA)
Warp
Utility
Work stealing
Runtime
CUDA Runtime interactions
Streams
Events
Algorithm
Devices
Hierarchy
Launch
Buffer
Memory Pools
Legacy resources
PTX API
Examples
PTX Instructions
PTX Pragmas
API reference
CUB
Overview
CUB Tests
CUB Benchmarks
CUB Tunings
CUB Developer Overview
Thread-level
Warp-level
Block-scope
Device-scope
NVTX
CUB Releases
CUB 2.1.0
API documentation
Thread-level Primitives
Warp-Wide “Collective” Primitives
Block-Wide “Collective” Primitives
Device-Wide Primitives
API reference
Thrust
Overview
Thrust Developer Overview
Developer CMake Options
Thrust systems
Releases
Changelog
Versioning
Release Process
API documentation
Algorithms
Containers
Function Objects
Iterators
Memory Management
Numerics
Parallel Execution Policies
Random Number Generators
System
Utility
API reference
CUDA Experimental
Overview
Containers library
cuda::experimental::uninitialized_buffer
Memory Resources
basic_any_resource
memory_pool_properties
device_memory_pool
pinned_memory_pool
managed_memory_pool
legacy_pinned_memory_resource
legacy_managed_memory_resource
shared_resource
Graphs library
cuda::experimental::graph
cuda::experimental::graph_builder
cuda::experimental::graph_builder_ref
cuda::experimental::graph_node_ref
cuda::experimental::stf::graphed_interface_of
cuda::experimental::stf::graphed_interface_of< mdspan< T, P… > >
cuda::experimental::stf::graphed_interface_of< scalar_view< T > >
cuda::experimental::stf::graphed_interface_of< void_interface >
CUDASTF
Implementation of the
matrix
class
Lower-level API
API reference
Tensor Memory Accelerator (TMA)
CCCL 2.x ‐ CCCL 3.0 migration guide
CCCL Development Guide
CCCL Internal Macros
CCCL Testing Utilities
Build and Bisect Utilities
Symbol Visibility
Host Stub Visibility Issue
Device Kernel Visibility Issue
Linking TUs compiled with different architectures
Contributing to the CUDA Core Compute Libraries
Code of Conduct
License
CCCL Python Libraries
Setup and Installation
cuda.compute
: Parallel Computing Primitives
cuda.coop
: Cooperative Algorithms
Resources
API Reference
cuda.compute
API Reference
cuda.coop
API Reference
CUDA C++ Core Libraries
libcu++
Setup
Setup
#
Requirements
Getting libcu++
Building & Testing libcu++