llm_export_utils

Modules

`modelopt.onnx.llm_export_utils.export_utils`	Utilities for exporting LLM models to ONNX.
`modelopt.onnx.llm_export_utils.quantization_utils`	Quantization utilities for LLM models.
`modelopt.onnx.llm_export_utils.surgeon_utils`	Utilities to surgeon ONNX graph after export.

Deprecated shim for the legacy modelopt.onnx.llm_export_utils package.

The in-repo LLM ONNX export pipeline (formerly examples/torch_onnx/llm_export.py plus this package) was removed in 0.44.0rc1 in favor of TensorRT-Edge-LLM, which provides a more complete and actively maintained pipeline.

This package is preserved only as a compatibility shim so external consumers that still import modelopt.onnx.llm_export_utils (notably TensorRT-Edge-LLM 0.6.1 and earlier) continue to work. It will be removed in a future release.

New code should migrate to:

modelopt.onnx.export — quant exporters (FP8QuantExporter, NVFP4QuantExporter, etc.)
modelopt.onnx.graph_surgery — graph transforms (GQA replacement, BF16 conversion, etc.)
TensorRT-Edge-LLM — end-to-end LLM export.