TensorRT Model Optimizer
Getting Started
Overview
Installation
Quick Start: PTQ - PyTorch
Quick Start: PTQ - ONNX
Quick Start: PTQ - Windows
Quick Start: QAT
Quick Start: Pruning
Quick Start: Distillation
Quick Start: Speculative Decoding
Quick Start: Sparsity
Guides
Support Matrix
Quantization
Saving & Restoring
Pruning
Distillation
Speculative Decoding
Sparsity
NAS
AutoCast (ONNX)
Deployment
TensorRT-LLM
DirectML
Unified HuggingFace Checkpoint
Examples
All GitHub Examples
Reference
Changelog
modelopt API
deploy
onnx
torch
distill
export
nas
opt
peft
prune
quantization
sparsity
speculative
config
eagle
medusa
mode
plugins
speculative_decoding
utils
trace
utils
Support
Contact us
FAQs
TensorRT Model Optimizer
modelopt API
torch
speculative
eagle
default_config
View page source
default_config
Default EAGLE architecture config.