Skip to main content
Back to top
Ctrl
+
K
NVIDIA NeMo SDP
Search
Ctrl
+
K
How to write config files?
How to add a new processor?
Supported datasets
MCV Italian
MCV Spanish
MCV Portuguese
MCV Kazakh
MCV Georgian
MCV Uzbek
Mozilla Common Voice Arabic (MCV)
MLS Italian (with P&C)
MLS Italian (no P&C)
MLS Spanish
MLS Spanish (no P&C)
MLS Portuguese
VoxPopuli Italian
VoxPopuli Spanish
Fisher Spanish
SLR83
CORAAL
Text MCV (Armenian)
Audio books (Armenian)
FLEURS
FLEURS
FLEURS
Librispeech
Librispeech (mini)
Librispeech (all)
Coraa Portuguese
MTEDX Portuguese
Kazakh Speech Dataset (KSD)
Kazakh Speech Corpus (KSC)
Kazakh Speech Corpus 2
uzbekvoice
Massive Arabic Speech Corpus (MASC)
Massive Arabic Speech Corpus (MASC): Extracting clean data from noisy train subset.
MediaSpeech
Tarteel AI’s Everyayah
API
.rst
.pdf
MLS Spanish
MLS Spanish
#
TBD
Config link:
dataset_configs/spanish_pc/mls/config.yaml