Inference Phase#
-
namespace trt_edgellm
-
namespace rt
Enums
-
enum class InferencePhase : uint8_t#
Phase distinction for preprocessor calls — replaces the earlier
bool isPrefillflag. Extend with richer phases only when a consumer (e.g. DecodingStrategy) actually needs them.Values:
-
enumerator kPrefill#
-
enumerator kDecode#
-
enumerator kPrefill#
-
enum class InferencePhase : uint8_t#
-
namespace rt