mod adaptive_hints_intercept#
- module adaptive_hints_intercept#
AdaptiveHintsIntercept: opt-in LLM request intercept that injects AgentHints from HotCache trie.
This module provides
AdaptiveHintsIntercept, which buildsAgentHintsfrom the prediction trie inHotCacheand injects them into LLM request headers as a request intercept. AdaptiveHintsIntercept is opt-in and synchronously transforms theLlmRequestbefore it reaches the callable.Structs and Unions
- struct AdaptiveHintsIntercept#
Opt-in LLM request intercept that injects
AgentHintsinto request headers from the prediction trie inHotCache.Constructed via
AdaptiveHintsIntercept::newand converted to anLlmRequestInterceptFnviaAdaptiveHintsIntercept::into_request_fnfor registration with the NeMo Flow runtime.Implementations
- impl AdaptiveHintsIntercept#
Functions
- fn into_request_fn(self) -> LlmRequestInterceptFn#
Converts this intercept into an
LlmRequestInterceptFnsuitable for registration withregister_llm_request_intercept.The returned closure reads the HotCache trie, builds AgentHints, injects them into the request headers and body, and returns the transformed request.