.wav

The dataset collected for the whole Quran; 114 sura (6236 ayah) recited by 35 Reciters (approximately 218000 audio files), downloaded from this website https://www.a-quran.com/showthread.php?t=11017, the audio files downloaded in mp3 format, all the downloaded files based on the Hafs from A’asim narration, the dataset figure shows reciters names who participate in this dataset.

 

Categories:
435 Views

Data are collected before and after percutaneous transluminal angiography (PTA) for dialysis patients.

Each sample is labeled as a-b-before.wav or a-b-after.wav and the associated txt, where a is the patient id and b is the location id.

The first position was the arterial-venous junction,  and the second point was 3 cm from the first position along the vein.

 The distances between the adjacent positions were also about 3 cm.

 

Categories:
460 Views

Speech Processing in noisy condition allows researcher to build solutions that work in real world conditions. Environmental noise in Indian conditions are very different from typical noise seen in most western countries. This dataset is a collection of various noises, both indoor and outdoor ollected over a period of several months. The audio files are of the format RIFF (little-endian) data, WAVE audio, Microsoft PCM, 8 bit, mono 11025 Hz and have been recorded using the Dialogic CTI card.

Categories:
1056 Views

[Now uploading... Total size is 300GB.]

Categories:
271 Views

Time Scale Modification (TSM) is a well-researched field; however, no effective objective measure of quality exists.  This paper details the creation, subjective evaluation, and analysis of a dataset for use in the development of an objective measure of quality for TSM. Comprised of two parts, the training component contains 88 source files processed using six TSM methods at 10 time scales, while the testing component contains 20 source files processed using three additional methods at four time scales.

Categories:
661 Views

With the development of audio synthesis techniques, the most state-of-art synthesis methods based on  Generative Adversarial Network(GAN) have been proposed. Whether the automatic speaker verification (ASV) systems are vulnerability to the GAN based synthesized audios is urgently needed to be verified. We present a publicly available set of GAN based synthesized audios generated by some open source schemes (WaveGAN,TifGAN,GANSynth,MelGAN), which allows researches to verify impact of the GAN-synthetic audio on security of ASV systems.

Categories:
656 Views