Thread-level Primitives#

CUB thread-level algorithms are specialized for execution by a single thread.