Datasets
Standard Dataset
Deepfake Synthetic-20K Dataset
- Citation Author(s):
- Submitted by:
- Sahil Sharma
- Last updated:
- Mon, 04/15/2024 - 06:14
- DOI:
- 10.21227/67x4-9g14
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
The Deepfake-Synthetic-20K dataset significantly contributes to digital forensics and deepfake detection research. It comprises 20,000 high-resolution, synthetic human face images generated using the advanced StyleGAN-2 architecture. This dataset is designed to support the development and evaluation of machine-learning models that can differentiate between real and artificially synthesized human faces. Each image in the dataset has been meticulously crafted to ensure a diverse representation of age, gender, and ethnicity, reflecting the variability seen in global human populations. The images were produced under controlled conditions to mimic the subtleties and complexities of genuine human faces, thereby providing a robust platform for training and testing deepfake detection algorithms. Additionally, the dataset adheres to ethical guidelines, avoiding privacy violations and consent issues associated with using real human images. This dataset is available to academics and researchers under a Creative Commons license, facilitating wide access and collaborative advancements in combating deepfake technology's challenges.
This dataset has 20K synthetically generated face images for deepfake research.
Dataset Files
- 128px Deepfake-Synthetic-20K Dataset-128px.zip (550.82 MB)
- 256px Deepfake-Synthetic-20K Dataset-256px.zip (1.95 GB)
- 512px Deepfake-Synthetic-20K Dataset-512px.zip (7.21 GB)
- 1024px Deepfake-Synthetic-20K Dataset-1024px.zip (24.76 GB)
Documentation
Attachment | Size |
---|---|
Readme file | 123.15 KB |
Comments
.
Can I have the dataset for research purpose. I will cite the dataset
Dear Sir,
I have downloaded the dataset you have provided. However, I noticed there are no real images for the corresponding fake images in the 20k dataset. Please can you upload the real images for this dataset? That would be helpful?