llm

Modules

modelopt.deploy.llm.generate

A wrapper over the TensorRT-LLM high level API runner.

modelopt.deploy.llm.nemo_utils

The utils to support Nemo models.

LLM deployment utils with tensorrt_llm.