TensorRT Model Optimizer
Getting Started
Overview
Installation
Quick Start: PTQ - PyTorch
Quick Start: PTQ - ONNX
Quick Start: PTQ - Windows
Quick Start: QAT
Quick Start: Pruning
Quick Start: Distillation
Quick Start: Speculative Decoding
Quick Start: Sparsity
Guides
Support Matrix
Quantization
Saving & Restoring
Pruning
Distillation
Speculative Decoding
Sparsity
NAS
AutoCast (ONNX)
Deployment
TensorRT-LLM
DirectML
Unified HuggingFace Checkpoint
Examples
All GitHub Examples
Reference
Changelog
modelopt API
deploy
onnx
torch
distill
export
nas
opt
prune
quantization
sparsity
speculative
config
eagle
medusa
mode
plugins
speculative_decoding
utils
trace
utils
Support
Contact us
FAQs
TensorRT Model Optimizer
modelopt API
torch
speculative
eagle
default_config
View page source
default_config
Default EAGLE architecture config.