Abstract

Voice Pre-processing and Quality Assessment Dataset (VPQAD), a scalable resource has been developed to validate various pre-processing techniques and improve voice signal quality in noisy environments. The dataset comprises voice recordings from 50 participants aged 18 to 40, captured in controlled real-life conditions using Audio Technica AT2020 and SHURE SM58 microphones. These high-quality recordings, made under diverse noise levels and settings, could be used for testing and developing voice enhancement algorithms. The dataset includes detailed metadata on the environment and participant demographics for analyzing and improving speech clarity and intelligibility, particularly in challenging conditions. To protect privacy, all data have been anonymized. VPQAD has been made public to promote collaborative research and advance research in biometrics, telecommunications, assistive technologies, and other applications requiring clear voice communication.

Instructions:

Instructions will be given in the Acknowledgement form.