MediaSpeech

MediaSpeech#

Dataset must be downloaded manually prior.

The config creates initial manifest for MediaSpeech dataset.

Required arguments.

  • raw_data_dir: path to the tarred dataset.

  • workspace_dir: specify the workspace folder where all audio files will be stored.

Output format.

This config dumps the final manifest at ${manifest_dir}/manifest.json and wav files ${manifest_dir}/audios. The output manifest contains the following fields:

  • audio_filepath (str): relative path to the audio files.

  • text (str): transcription.

  • duration (float): audio duration in seconds.

Config link: dataset_configs/arabic/mediaspeech/config.yaml