Setup and Installation

This guide walks you through installing and setting up the CUDA Python Core Libraries (CCCL).

Prerequisites

Before installing cuda-cccl, ensure you have:

Python 3.9 or later
CUDA Toolkit 12.x
Compatible NVIDIA GPU with Compute Capability 6.0 or higher
Operating Systems: Linux (tested on Ubuntu 20.04+) or Windows 10/11 (with WSL2 support)

Installation

Install from PyPI

The easiest way to install cuda-cccl is using pip:

pip install cuda-cccl

This will install cuda-cccl along with all required dependencies including:

Install from Source

For development or to access the latest features:

git clone https://github.com/NVIDIA/cccl.git
cd cccl/python/cuda_cccl
pip install -e .

Development Setup

For contributing to cuda-cccl or advanced development:

# Clone the repository
git clone https://github.com/NVIDIA/cccl.git
cd cccl/python/cuda_cccl

# Install in development mode with test dependencies
pip install -e .[test]

# Run tests to verify everything works
pytest tests/

Next Steps

Now that you have cuda-cccl installed, check out:

parallel: Device-Level Parallel Algorithms - Device-level parallel algorithms
cooperative: Cooperative Algorithms - Block and warp-level cooperative primitives