Registry Builder#
- TensorRegistry trt_edgellm::rt::buildRegistryForLLM(
- LLMEngineConfig const &cfg,
- std::optional<int32_t> specDecodeBaseOutputHiddenDim = std::nullopt
Build a TensorRegistry for the base LLM engine.
Produces a registry with all tensor specs matching the engine’s I/O contract for
llmEngineRunner. Config flags control which optional tensor groups (deepstack, Mamba/recurrent state, EAGLE, LoRA) are included.- Parameters:
cfg – The engine configuration.
specDecodeBaseOutputHiddenDim – Optional hidden-state output dim for a SpecDecode base engine. When absent, the legacy EAGLE-3 convention is used.
- Returns:
A populated TensorRegistry.
- TensorRegistry trt_edgellm::rt::buildRegistryForSpecDecodeDraft(
- DeploymentConfig const &bundle
Build a TensorRegistry for a SpecDecode draft engine.
Produces a registry with all tensor specs matching the engine’s I/O contract for the draft runner. The draft engine always uses plugin-based KV cache and proposal-attention tensors.
- Parameters:
bundle – The deployment configuration.
bundle.draftandbundle.specConfigmust both be set; the draft registry needs the consolidated SpecDecode settings to size cross-engine bindings (e.g. base hidden states fed into the draft).- Returns:
A populated TensorRegistry.