Device-Wide Primitives
CUB device-level single-problem parallel algorithms:
cub::DeviceAdjacentDifference
computes the difference between adjacent elements residing within device-accessible memorycub::DeviceFor
provides device-wide, parallel operations for iterating over data residing within device-accessible memorycub::DeviceHistogram
constructs histograms from data samples residing within device-accessible memorycub::DevicePartition
partitions data residing within device-accessible memorycub::DeviceMerge
merges two sorted sequences in device-accessible memory into a single onecub::DeviceMergeSort
sorts items residing within device-accessible memorycub::DeviceRadixSort
sorts items residing within device-accessible memory using radix sorting methodcub::DeviceReduce
computes reduction of items residing within device-accessible memorycub::DeviceRunLengthEncode
demarcating “runs” of same-valued items withing a sequence residing within device-accessible memorycub::DeviceScan
computes a prefix scan across a sequence of data items residing within device-accessible memorycub::DeviceSelect
compacts data residing within device-accessible memory
CUB device-level segmented-problem (batched) parallel algorithms:
cub::DeviceSegmentedSort
computes batched sort across non-overlapping sequences of data residing within device-accessible memorycub::DeviceSegmentedRadixSort
computes batched radix sort across non-overlapping sequences of data residing within device-accessible memorycub::DeviceSegmentedReduce
computes reductions across multiple sequences of data residing within device-accessible memorycub::DeviceCopy
provides device-wide, parallel operations for batched copying of data residing within device-accessible memorycub::DeviceMemcpy
provides device-wide, parallel operations for batched copying of data residing within device-accessible memory