🚧 Llama-3.1 Optimized with NVIDIA TransformerEngine
This folder contains source code and tests for an Llama-3.1 model that inherits from the transformers PreTrainedModel
class and uses TransformerEngine layers.
This folder is currently work in progress and is not yet ready for general use.