PyTorch Backend#
Note
Note: This feature is currently experimental, and the related API is subjected to change in future versions.
To enhance the usability of the system and improve developer efficiency, TensorRT-LLM launches a new experimental backend based on PyTorch.
The PyTorch backend of TensorRT-LLM is available in version 0.17 and later. You can try it via importing tensorrt_llm._torch
.
Quick Start#
Here is a simple example to show how to use tensorrt_llm.LLM
API with Llama model.
Features#
Developer Guide#
Key Components#
Known Issues#
The PyTorch backend on SBSA is incompatible with bare metal environments like Ubuntu 24.04. Please use the PyTorch NGC Container for optimal support on SBSA platforms.