Audio Dataset

This dataset contains audio recordings sourced from more than 57 TV shows provided by the Saudi Broadcasting Authority. The total number of hours published for these recordings is ~667 hours. The recordings are in Arabic, the majority are in Saudi dialects, and some are in other dialects. To enhance the usage of SADA, the dataset is split into training, validation, and testing sets. Each of validation and testing sets is around 10 hours in audio segments length while training set is 418 hours.

Categories:
78 Views

Introduced here is the Extended-Length Audio Dataset for Synthetic Voice Detection and Speaker Recognition (ELAD-SVDSR), a resource designed to advance research in synthetic voice (DeepFake) detection and automatic speaker recognition (ASR). It features around 45-minute audio recordings from 36 participants, each of whom read aloud different newspaper articles during controlled sessions, captured with five different high-quality microphones. Synthetic voices generated from 20 subjects of this dataset using open-source and commercial software are also included.

Categories:
74 Views