TensorRT Edge-LLM Documentation#
Welcome to the TensorRT Edge-LLM documentation. This library provides optimized inference capabilities for large language models and vision-language models on edge devices.
Getting Started
Examples
Features
Input & Chat Format
Performance
Software Design
Customization
Testing
APIs