Voice Pre-Processing and Quality Assessment Dataset (VPQAD)

Citation Author(s):
Ajan
Ahmed
Clarkson University
Md Jahangir Alam
Khondkar
Clarkson University
Ansen
Herrick
Clarkson University
Masudul H.
Imtiaz
Clarkson University
Submitted by:
Ajan Ahmed
Last updated:
Tue, 09/17/2024 - 04:43
DOI:
10.21227/yb1h-hs38
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

Voice Pre-processing and Quality Assessment Dataset (VPQAD), a scalable resource has been developed to validate various pre-processing techniques and improve voice signal quality in noisy environments. The dataset comprises voice recordings from 50 participants aged 18 to 40, captured in controlled real-life conditions using Audio Technica AT2020 and SHURE SM58 microphones. These high-quality recordings, made under diverse noise levels and settings, could be used for testing and developing voice enhancement algorithms. The dataset includes detailed metadata on the environment and participant demographics for analyzing and improving speech clarity and intelligibility, particularly in challenging conditions. To protect privacy, all data have been anonymized. VPQAD has been made public to promote collaborative research and advance research in biometrics, telecommunications, assistive technologies, and other applications requiring clear voice communication.

Instructions: 

Instructions will be given in the Acknowledgement form.

Documentation

AttachmentSize
File readme.txt1.28 KB
File End User License Agreement.pdf121.78 KB