LLM Inference Runtime#

namespace trt_edgellm
namespace rt

Typedefs

using LLMInferenceRuntime = LLMInferenceSpecDecodeRuntime#

Compatibility typedef — LLMInferenceRuntime is now LLMInferenceSpecDecodeRuntime. The unified runtime supports both vanilla (no draft model) and Eagle spec-decode modes. Construct without EagleDraftingConfig for vanilla-only behavior identical to the old LLMInferenceRuntime.