Speech Dataset in Hindi Language

Citation Author(s):
Shivam
Shukla
Submitted by:
Shivam Shukla
Last updated:
Tue, 06/09/2020 - 05:49
DOI:
10.21227/5vgy-yb08
Data Format:
Links:
License:
5
2 ratings - Please login to submit your rating.

Abstract 

100 Speakers each consisting of 5 voice samples for training data and 1 voice sample for testing data. Total of 600 voice samples collected in different audio formats like mpeg, mp4, mp3, ogg etc. These samples were than preprocessed and converted into .wav format. Each voice sample has a time duration of 5-10 seconds due to different lengths tuning of parameters should be done before usage. Whole Dataset size is 600mb and duration is 1 hour 40 minutes. This dataset can be used for speech synthesis, speaker identification. speaker recognition, speech recogniton etc. Preprocessing of data is required.

Instructions: 

-> Download the Dataset

-> Unzip the files

-> Add the voice_samples._path.txt to your training model so that it can extract data from the location.

->make changes to your path.txt file according to your need

Comments

thank you

Submitted by Engin Butun on Thu, 08/06/2020 - 10:37

i am not able to download

Submitted by Prapti Trivedi on Sat, 11/28/2020 - 05:24

thanks

Submitted by TSAI JAN CHANG on Mon, 12/21/2020 - 08:25

Thanks

Submitted by Neekhil Rj on Mon, 10/04/2021 - 23:15

unable to download

Submitted by Sridhar Koneru on Sat, 12/18/2021 - 07:47

Very nice

Submitted by Neekhil Rj on Sat, 12/18/2021 - 10:51

Thanks

Submitted by Neekhil Rj on Mon, 04/25/2022 - 22:25

How to Download this Data set plz Help?

Submitted by Neekhil Rj on Mon, 04/25/2022 - 22:30

Dataset Files