CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Internal details made public to facilitate introspection.
#include <regular_tile_access_iterator_tensor_op.h>
Static Public Attributes | |
static int const | kAccessSizeInBits = 128 |
static int const | kPointerCount |
|
static |
This iterator is specialized for an access size that is 128 bits in length.
|
static |
Number of pointers
Note:TN kblock32 layouts only needs 1 pointer, but strangely reducing pointer count hurts perfomrnace