Congratulations! You have been automatically subscribed to IEEE DataPort and can access all datasets on IEEE DataPort!
Congratulations! You have been automatically subscribed to IEEE DataPort and can access all datasets on IEEE DataPort!
Introduced here is the Extended-Length Audio Dataset for Synthetic Voice Detection and Speaker Recognition (ELAD-SVDSR), a resource designed to advance research in synthetic voice (DeepFake) detection and automatic speaker recognition (ASR). It features around 45-minute audio recordings from 36 participants, each of whom read aloud different newspaper articles during controlled sessions, captured with five different high-quality microphones. Synthetic voices generated from 20 subjects of this dataset using open-source and commercial software are also included.
Voice Pre-processing and Quality Assessment Dataset (VPQAD), a scalable resource has been developed to validate various pre-processing techniques and improve voice signal quality in noisy environments. The dataset comprises voice recordings from 50 participants aged 18 to 40, captured in controlled real-life conditions using Audio Technica AT2020 and SHURE SM58 microphones. These high-quality recordings, made under diverse noise levels and settings, could be used for testing and developing voice enhancement algorithms.