NeMo-Skills

NeMo-Skills is a collection of pipelines to improve "skills" of large language models (LLMs). We support everything needed for LLM development, from synthetic data generation, to model training, to evaluation on a wide range of benchmarks. Start developing on a local workstation and move to a large-scale Slurm cluster with just a one-line change.

Here are some of the features we support:

To get started, follow these steps, browse available pipelines or run ns --help to see all available commands and their options.

You can find more examples of how to use NeMo-Skills in the tutorials page.

We've built and released many popular models and datasets using NeMo-Skills. See all of them in the Papers & Releases documentation.

We support many popular benchmarks and it's easy to add new in the future. The following categories of benchmarks are supported