quantization
Modules
| Module for advanced quantization algorithms. | |
| Quantization backends. | |
| Calibrator classes. | |
| Compress model weights of quantized model. | |
| This document lists the quantization formats supported by Model Optimizer and example quantization configs. | |
| Quantization conversion/restore utilities. | |
| Utility to export a quantized torch model to quantized ONNX. | |
| Module to load C++ / CUDA extensions. | |
| This module contains the mode descriptor for the quantization mode. | |
| Calibration utilities. | |
| User-facing quantization API. | |
| Modules with quantization support. | |
| Handles quantization plugins to correctly quantize third-party modules. | |
| Tensor Class for Real Quantization. | |
| Basic tensor quantization functions. | |
| Triton quantization kernels. | |
| Quantization utilities. | 
Quantization package.