Python API Reference#

This section provides documentation for the TensorRT Edge-LLM Python package.

The tensorrt_edgellm package provides utilities for quantizing large language models and exporting them to ONNX format for efficient inference on edge devices.

Main Module#