CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Files | |
file | default_gemm_configuration.h [code] |
Definitions for GEMM structures. | |
file | include/cutlass/gemm/device/gemm.h [code] |
Template for a pipelined GEMM kernel. Does not compute batching or support split-K. | |
file | device/gemm_batched.h [code] |
Template for a pipelined GEMM kernel. Does not compute batching or support split-K. | |
file | include/cutlass/gemm/device/gemm_complex.h [code] |
Template for a pipelined GEMM kernel. Does not compute batching or support split-K. | |
file | device/gemm_splitk_parallel.h [code] |
Template for GEMM performing a reduction over K partitions in parallel. | |