Getting Started
Installation
LLM API
LLM API Examples
Model Definition API
C++ API
Command-Line Reference
Architecture
Advanced
Performance
Reference
Blogs
tensorrt_llm
tensorrt_llm.functional
tensorrt_llm.layers.activation
tensorrt_llm.layers.attention
tensorrt_llm.layers.cast
tensorrt_llm.layers.conv
tensorrt_llm.layers.embedding
tensorrt_llm.layers.linear
tensorrt_llm.layers.mlp
tensorrt_llm.layers.normalization
tensorrt_llm.layers.pooling
tensorrt_llm.models
tensorrt_llm.plugin
tensorrt_llm.quantization
tensorrt_llm.runtime