CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Defines basic properties needed by CTA-level GEMMs assuming expectations about data layout of the global memory fragments, data types, and internal tile sizes. More...
#include "cutlass/cutlass.h"
#include "cutlass/array.h"
#include "cutlass/numeric_types.h"
#include "cutlass/matrix_shape.h"
#include "cutlass/gemm/warp/mma.h"
#include "cutlass/gemm/threadblock/mma_pipelined.h"
#include "cutlass/gemm/threadblock/mma_singlestage.h"
Go to the source code of this file.
Namespaces | |
cutlass | |
cutlass::gemm | |
cutlass::gemm::threadblock | |
Partial specializations for threadblock::Mma operations targeting TensorOp instructions.