MediaSpeech#
Dataset must be downloaded manually prior.
The config creates initial manifest for MediaSpeech dataset.
Required arguments.
raw_data_dir: path to the tarred dataset.
workspace_dir: specify the workspace folder where all audio files will be stored.
Output format.
This config dumps the final manifest at ${manifest_dir}/manifest.json
and wav files ${manifest_dir}/audios
.
The output manifest contains the following fields:
audio_filepath (str): relative path to the audio files.
text (str): transcription.
duration (float): audio duration in seconds.
Config link: dataset_configs/arabic/mediaspeech/config.yaml