llm_export_utils
Modules
Utilities for exporting LLM models to ONNX. |
|
Quantization utilities for LLM models. |
|
Utilities to surgeon ONNX graph after export. |
Deprecated shim for the legacy modelopt.onnx.llm_export_utils package.
The in-repo LLM ONNX export pipeline (formerly examples/torch_onnx/llm_export.py
plus this package) was removed in 0.44.0rc1 in favor of
TensorRT-Edge-LLM, which provides
a more complete and actively maintained pipeline.
This package is preserved only as a compatibility shim so external consumers that
still import modelopt.onnx.llm_export_utils (notably TensorRT-Edge-LLM 0.6.1
and earlier) continue to work. It will be removed in a future release.
New code should migrate to:
modelopt.onnx.export— quant exporters (FP8QuantExporter,NVFP4QuantExporter, etc.)modelopt.onnx.graph_surgery— graph transforms (GQA replacement, BF16 conversion, etc.)TensorRT-Edge-LLM — end-to-end LLM export.