Sunil Kumar Kopparapu's picture
Congratulations!  You have been automatically subscribed to IEEE DataPort and can access all datasets on IEEE DataPort!
First Name: 
Sunil Kumar
Last Name: 
Kopparapu
Affiliation: 
TCS Research
Job Title: 
Principal Scientist
Expertise: 
Speech Processing

Datasets & Competitions

AIR-RS-DB: A dataset for classifying Spontaneous and Read Speech

 

A set of 1028 audio files generated from 7 mp3 files downloaded from All India Radio. https://newsonair.gov.in/ and converted into wav  and then speaker diarized is  using https://huggingface.co/pyannote/speaker-diarization (pyannote/speaker-diarization@2022072,model) and derive 1028 audio files.

Categories:
96 Views

Spoken Indian Language Identification Database

(9 languages, 8 different utterance lengths)

Languages

  1. Assamese 
  2. Bengali 
  3. Gujarati 
  4. Hindi 
  5. Kannada 
  6. Malayalam 
  7. Marathi 
  8. Tamil 
  9. Telugu

Durations

  1. 30 sec
  2. 10 sec
  3. 5 sec
  4. 3 sec
  5. 1 sec
  6. 0.5 sec
  7. 0.2 sec
  8. 0.1 sec

 

 

Categories:
1129 Views

Speech Processing in noisy condition allows researcher to build solutions that work in real world conditions. Environmental noise in Indian conditions are very different from typical noise seen in most western countries. This dataset is a collection of various noises, both indoor and outdoor ollected over a period of several months. The audio files are of the format RIFF (little-endian) data, WAVE audio, Microsoft PCM, 8 bit, mono 11025 Hz and have been recorded using the Dialogic CTI card.

Categories:
1054 Views