NVTX#
The NVIDIA Tools Extension SDK (NVTX) is a cross-platform API
for annotating source code to provide contextual information to developer tools.
All device-scope algorithms in CUB are annotated with NVTX ranges,
allowing their start and stop to be visualized in profilers
like NVIDIA Nsight Systems.
Only the public APIs available in the <cub/device/device_xxx.cuh>
headers are annotated,
excluding direct calls to the dispatch layer.
NVTX annotations can be disabled by defining NVTX_DISABLE
during compilation.
When CUB device algorithms are called on a stream subject to
graph capture,
the NVTX range is reported for the duration of capture (where no execution happens),
and not when a captured graph is executed later (the actual execution).