NVTX#

The NVIDIA Tools Extension SDK (NVTX) is a cross-platform API for annotating source code to provide contextual information to developer tools. All device-scope algorithms in CUB are annotated with NVTX ranges, allowing their start and stop to be visualized in profilers like NVIDIA Nsight Systems. Only the public APIs available in the <cub/device/device_xxx.cuh> headers are annotated, excluding direct calls to the dispatch layer. NVTX annotations can be disabled by defining NVTX_DISABLE during compilation. When CUB device algorithms are called on a stream subject to graph capture, the NVTX range is reported for the duration of capture (where no execution happens), and not when a captured graph is executed later (the actual execution).