40 typename ThreadBlockShape_,
42 typename ThreadShape_,
56 typename ElementAccumulator_ = ElementCD_>
Describes the size of a matrix tile.
Definition: matrix_shape.h:42
Definition: aligned_buffer.h:35
ElementAccumulator_ ElementAccumulator
Data type of accumulators.
Definition: default_gemv.h:78
typename ThreadBlockGemv::IteratorB IteratorB
Definition: default_gemv.h:101
ThreadShape_ ThreadShape
Shape of warp-level matrix operation (concept: GemmShape)
Definition: default_gemv.h:63
LayoutCD_ LayoutCD
Layout of input/output matrix C/D.
Definition: default_gemv.h:87
LayoutCD_ LayoutAccumulator
Data type of accumulators (same as C/D)
Definition: default_gemv.h:81
Template defining a shape used by pitch-linear operators.
Definition: pitch_linear.h:43
ElementA_ ElementA
Data type of multiplicand A.
Definition: default_gemv.h:66
typename IteratorCD::Fragment FragmentCD
Fragment storage for C/D.
Definition: default_gemv.h:116
Structure to compute the matrix-vector product using SIMT math instructions.
Definition: gemv.h:50
LayoutA_ LayoutA
Layout of multiplicand A.
Definition: default_gemv.h:69
Definition: default_gemv_core.h:68
Definition: default_gemv.h:57
typename platform::conditional< platform::is_same< LayoutCD, layout::RowMajor >::value, cutlass::transform::PitchLinearTilePolicyStripminedThreadContiguous< layout::PitchLinearShape< ThreadBlockShape::kN, ThreadBlockShape::kM >, Core::kThreadsPerN, ThreadShape::kN >, cutlass::transform::PitchLinearTilePolicyStripminedThreadStrided< layout::PitchLinearShape< ThreadBlockShape::kM, ThreadBlockShape::kN >, Core::kThreadsPerN, ThreadShape::kM >>::type IteratorPolicyCD
Policy for the iterator that reads/writes C/D.
Definition: default_gemv.h:109
typename Core_::IteratorA IteratorA
Iterates over A in global memory.
Definition: gemv.h:58
Defines basic properties needed by CTA-level batched GEMV assuming expectations about data layout of ...
typename Core_::IteratorB IteratorB
Iterates over B in global memory.
Definition: gemv.h:61
Threadblock swizzling function for batched GEMVs.
Definition: gemm/threadblock/threadblock_swizzle.h:296
Template for a threadblock-scoped GEMV kernel.
typename ThreadBlockGemv::IteratorA IteratorA
Definition: default_gemv.h:98
Implements several possible threadblock-swizzling functions mapping blockIdx to GEMM problems...
typename cutlass::gemm::threadblock::DefaultGemvCore< ThreadBlockShape, ThreadShape, ElementA, LayoutA, ElementB, LayoutB, ElementAccumulator, LayoutAccumulator > Core
Definition: default_gemv.h:92
ElementB_ ElementB
Data type of multiplicand B.
Definition: default_gemv.h:72
LayoutB_ LayoutB
Layout of multiplicand B.
Definition: default_gemv.h:75
ThreadBlockShape_ ThreadBlockShape
Shape of Threadblock-level matrix operation (concept: GemmShape)
Definition: default_gemv.h:60
ElementCD_ ElementCD
Data type of input/output matrix C/D.
Definition: default_gemv.h:84