Skip to main content

read speech

AIR-RS-DB: A dataset for classifying Spontaneous and Read Speech

 

A set of 1028 audio files generated from 7 mp3 files downloaded from All India Radio. https://newsonair.gov.in/ and converted into wav  and then speaker diarized is  using https://huggingface.co/pyannote/speaker-diarization (pyannote/speaker-diarization@2022072,model) and derive 1028 audio files.

These are available as air-rs-db.zip (which can be downloaded)

 

Categories: