TensorRT Model Optimizer
Getting Started
Overview
Installation
Quick Start: PTQ - PyTorch
Quick Start: PTQ - ONNX
Quick Start: PTQ - Windows
Quick Start: QAT
Quick Start: Pruning
Quick Start: Distillation
Quick Start: Speculative Decoding
Quick Start: Sparsity
Guides
Support Matrix
Quantization
Saving & Restoring
Pruning
Distillation
Speculative Decoding
Sparsity
NAS
AutoCast (ONNX)
Deployment
TensorRT-LLM
DirectML
Unified HuggingFace Checkpoint
Examples
All GitHub Examples
Reference
Changelog
Model Optimizer Changelog (Linux)
Model Optimizer Changelog (Windows)
modelopt API
Support
Contact us
FAQs
TensorRT Model Optimizer
Changelog
View page source
Changelog
Model Optimizer Changelog (Linux)
Model Optimizer Changelog (Windows)