C++ API Reference# This section provides documentation for the TensorRT Edge-LLM C++ API. Builder Module Builder Common Module Binding Names Check Macros CUDA Utils File Utils Hash Utils Logger MMAP Reader Safetensors Utils String Utils Tensor TRT Utils Version Kernels Module Apply Rope Write KV Batch Evict Kernels Context FMHA Runner Decoder XQA Runner Dequantize EAGLE Accept Kernels EAGLE Util Kernels Embedding Kernels FMHA Params V2 Image Util Kernels Initialize Cos Sin Cache Int4 Groupwise GEMM KV Cache Utils Kernels Util Kernels Vectorized Types Multimodal Module Image Utils Intern ViT Runner Model Types Multimodal Runner Phi4mm ViT Runner Qwen ViT Runner Plugins Module Attention Plugin Int4 Groupwise GEMM Plugin Plugin Utils Profiling Module Metrics Timer Runtime Module EAGLE Draft Engine Runner Image Utils Linear KV Cache LLM Engine Runner LLM Inference Runtime LLM Inference Spec Decode Runtime LLM Runtime Utils Sampler Module Sampling Tokenizer Module Pre Tokenizer Token Encoder Tokenizer Tokenizer Utils Unicode Data