llm

Modules

modelopt.deploy.llm.generate

A wrapper over the TensorRT-LLM high level API runner.

LLM deployment utils with tensorrt_llm.