CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Implements several possible threadblock-swizzling functions mapping blockIdx to GEMM problems. More...
Go to the source code of this file.
Classes | |
struct | cutlass::gemm::threadblock::GemmIdentityThreadblockSwizzle |
Threadblock swizzling function for GEMMs. More... | |
struct | cutlass::gemm::threadblock::GemmHorizontalThreadblockSwizzle |
Threadblock swizzling function for GEMMs. More... | |
struct | cutlass::gemm::threadblock::GemmBatchedIdentityThreadblockSwizzle |
Threadblock swizzling function for batched GEMMs. More... | |
struct | cutlass::gemm::threadblock::GemmSplitKIdentityThreadblockSwizzle |
Threadblock swizzling function for split-K GEMMs. More... | |
struct | cutlass::gemm::threadblock::GemmSplitKHorizontalThreadblockSwizzle |
Threadblock swizzling function for split-K GEMMs. More... | |
struct | cutlass::gemm::threadblock::GemvBatchedStridedThreadblockDefaultSwizzle |
Threadblock swizzling function for batched GEMVs. More... | |
Namespaces | |
cutlass | |
cutlass::gemm | |
cutlass::gemm::threadblock | |
Functions | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeThreadIdxX () |
Helper to rematerialize block Idx. Reduces register liveness. More... | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeThreadIdxY () |
Helper to rematerialize block Idx. Reduces register liveness. More... | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeThreadIdxZ () |
Helper to rematerialize block Idx. Reduces register liveness. More... | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeBlockIdxX () |
Helper to rematerialize block Idx. Reduces register liveness. More... | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeBlockIdxY () |
Helper to rematerialize block Idx. Reduces register liveness. More... | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeBlockIdxZ () |
Helper to rematerialize block Idx. Reduces register liveness. More... | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeBlockDimX () |
Helper to rematerialize block Dim. Reduces register liveness. More... | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeBlockDimY () |
Helper to rematerialize block Dim. Reduces register liveness. More... | |
CUTLASS_DEVICE int | cutlass::gemm::threadblock::RematerializeBlockDimZ () |
Helper to rematerialize block Dim. Reduces register liveness. More... | |