mp3

This is the official Thaat and Raga Forest (TRF) Dataset

Please do cite our paper: Link to Paper

Dataset is also available here: Link to Dataset

Categories:
1123 Views

Spoken Indian Language Identification Database

(9 languages, 8 different utterance lengths)

Languages

  1. Assamese 
  2. Bengali 
  3. Gujarati 
  4. Hindi 
  5. Kannada 
  6. Malayalam 
  7. Marathi 
  8. Tamil 
  9. Telugu

Durations

  1. 30 sec
  2. 10 sec
  3. 5 sec
  4. 3 sec
  5. 1 sec
  6. 0.5 sec
  7. 0.2 sec
  8. 0.1 sec

 

 

Categories:
1181 Views

The steganography and steganalysis of audio, especially compressed audio, have drawn increasing attention in recent years, and various algorithms are proposed. However, there is no standard public dataset for us to verify the efficiency of each proposed algorithm. Therefore, to promote the study field, we construct a dataset including 33038 stereo WAV audio clips with a sampling rate of 44.1 kHz and duration of 10s. And, all audio files are from the Internet through data crawling, which is for a better simulation of a real detection environment.

Categories:
3640 Views