LLM Inference Runtime#
-
namespace trt_edgellm
-
namespace rt
Typedefs
-
using LLMInferenceRuntime = LLMInferenceSpecDecodeRuntime#
Compatibility typedef — LLMInferenceRuntime is now LLMInferenceSpecDecodeRuntime. The unified runtime supports both vanilla (no draft model) and Eagle spec-decode modes. Construct without EagleDraftingConfig for vanilla-only behavior identical to the old LLMInferenceRuntime.
-
using LLMInferenceRuntime = LLMInferenceSpecDecodeRuntime#
-
namespace rt