prepare_dataset

Functions

process_and_save_dataset

process_and_save_dataset(dataset_name, output_dir, split=('code', 'math', 'stem', 'chat'), overwrite=False)
Parameters:
  • dataset_name (str)

  • output_dir (str)

  • split (tuple)

  • overwrite (bool)