Skip to content

🚧 Llama-3.1 Optimized with NVIDIA TransformerEngine

This folder contains source code and tests for an Llama-3.1 model that inherits from the transformers PreTrainedModel class and uses TransformerEngine layers.

This folder is currently work in progress and is not yet ready for general use.