FMFCC-A DATASET

Citation Author(s):: Zhenyu Zhang

Yewei Gu

Xiaowei Yi

Xianfeng Zhao
Submitted by:: Zhenyu Zhang
Last updated:: Thu, 10/14/2021 - 07:55
DOI:: 10.21227/wsf7-6j26
Data Format:: *.wav; *.txt

264 views

Categories:

Artificial Intelligence
Signal Processing

Keywords:

synthetic speech

ACCESS DATASET CITE

Abstract

FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection

Instructions:

The FMFCC-A dataset is by far the largest publicly-available Mandarin dataset for synthetic speech detection, which contains 40,000 synthesized Mandarin utterances that generated by 11 Mandarin TTS systems and two Mandarin VC systems, and 10,000 genuine Mandarin utterances collected from 58 speakers.

Due to the limitation of data transmission， the FMFCC-A dataset are now shared through BaiduCloud (website: https://pan.baidu.com/s/1UR9BVJa0AU0OYEdPjwRq9Q , password: IIES)