48 typename ElementOutput_,
50 typename ElementAccumulator_ = ElementOutput_,
110 return destination_converter(accumulator);
Definition: conversion_op.h:53
CUTLASS_HOST_DEVICE Convert(Params const ¶ms=Params())
Constructs the function object, possibly loading from pointers in host memory.
Definition: conversion_op.h:83
FragmentAccumulator ComputeFragment
Definition: conversion_op.h:64
Definition: aligned_buffer.h:35
CUTLASS_HOST_DEVICE constexpr bool is_source_needed() const
Returns true if source is needed based on state of runtime arguments.
Definition: conversion_op.h:89
static FloatRoundStyle const kRound
Definition: conversion_op.h:66
ElementAccumulator_ ElementAccumulator
Definition: conversion_op.h:57
Statically sized array of elements that accommodates all CUTLASS-supported numeric types and is safe ...
CUTLASS_HOST_DEVICE FragmentOutput operator()(FragmentAccumulator const &accumulator, FragmentOutput const &source, ElementCompute uniform=ElementCompute(0)) const
Computes linear scaling: D = alpha * accumulator + beta * source.
Definition: conversion_op.h:102
Boost-like numeric conversion operator for CUTLASS numeric types.
static int const kCount
Definition: conversion_op.h:60
#define CUTLASS_HOST_DEVICE
Definition: cutlass.h:89
Top-level include for all CUTLASS numeric types.
CUTLASS_HOST_DEVICE Params()
Definition: conversion_op.h:76
ElementOutput_ ElementOutput
Definition: conversion_op.h:56
Array< ElementAccumulator, kCount > FragmentAccumulator
Definition: conversion_op.h:63
ElementAccumulator_ ElementCompute
Definition: conversion_op.h:58
FloatRoundStyle
Definition: numeric_conversion.h:43
CUTLASS_HOST_DEVICE constexpr bool is_source_ever_needed() const
Definition: conversion_op.h:96
Conversion operator for Array.
Definition: numeric_conversion.h:294
Array< ElementOutput, kCount > FragmentOutput
Definition: conversion_op.h:62
Basic include for CUTLASS.
Host-constructable parameters structure.
Definition: conversion_op.h:69
Define basic numeric operators with specializations for Array<T, N>. SIMD-ize where possible...