Librispeech (mini)#

This config can be used to prepare Librispeech mini dataset in the NeMo format.

It produces manifests for the mini split of Libripseech.

This config performs the following data processing.

Required arguments.

workspace_dir: specify the workspace folder where all audio files will be stored.

Note that you can customize any part of this config either directly or from command-line.

Output format.

This config generates 2 output manifest files:

Output manifest contains the following fields: