Thread-level Primitives

CUB thread-level algorithms are specialized for execution by a single thread.