Registry Builder#

TensorRegistry trt_edgellm::rt::buildRegistryForLLM(
LLMEngineConfig const &cfg,
std::optional<int32_t> specDecodeBaseOutputHiddenDim = std::nullopt
)#

Build a TensorRegistry for the base LLM engine.

Produces a registry with all tensor specs matching the engine’s I/O contract for llmEngineRunner. Config flags control which optional tensor groups (deepstack, Mamba/recurrent state, EAGLE, LoRA) are included.

Parameters:
  • cfg – The engine configuration.

  • specDecodeBaseOutputHiddenDim – Optional hidden-state output dim for a SpecDecode base engine. When absent, the legacy EAGLE-3 convention is used.

Returns:

A populated TensorRegistry.

TensorRegistry trt_edgellm::rt::buildRegistryForSpecDecodeDraft(
DeploymentConfig const &bundle
)#

Build a TensorRegistry for a SpecDecode draft engine.

Produces a registry with all tensor specs matching the engine’s I/O contract for the draft runner. The draft engine always uses plugin-based KV cache and proposal-attention tensors.

Parameters:

bundle – The deployment configuration. bundle.draft and bundle.specConfig must both be set; the draft registry needs the consolidated SpecDecode settings to size cross-engine bindings (e.g. base hidden states fed into the draft).

Returns:

A populated TensorRegistry.