Skip to main content

Datasets

Standard Dataset

FMFCC-A DATASET

Citation Author(s):
Zhenyu Zhang
Yewei Gu
Xiaowei Yi
Xianfeng Zhao
Submitted by:
Zhenyu Zhang
Last updated:
DOI:
10.21227/wsf7-6j26
Data Format:
No Ratings Yet

Abstract

FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection

Instructions:

The FMFCC-A dataset is by far the largest publicly-available Mandarin dataset for synthetic speech detection, which contains 40,000 synthesized Mandarin utterances that generated by 11 Mandarin TTS systems and two Mandarin VC systems, and 10,000 genuine Mandarin utterances collected from 58 speakers. 

Due to the limitation of data transmission, the FMFCC-A dataset are now shared through BaiduCloud (website: https://pan.baidu.com/s/1UR9BVJa0AU0OYEdPjwRq9Q , password: IIES)