Abstract

The Deepfake-Synthetic-20K dataset significantly contributes to digital forensics and deepfake detection research. It comprises 20,000 high-resolution, synthetic human face images generated using the advanced StyleGAN-2 architecture. This dataset is designed to support the development and evaluation of machine-learning models that can differentiate between real and artificially synthesized human faces. Each image in the dataset has been meticulously crafted to ensure a diverse representation of age, gender, and ethnicity, reflecting the variability seen in global human populations. The images were produced under controlled conditions to mimic the subtleties and complexities of genuine human faces, thereby providing a robust platform for training and testing deepfake detection algorithms. Additionally, the dataset adheres to ethical guidelines, avoiding privacy violations and consent issues associated with using real human images. This dataset is available to academics and researchers under a Creative Commons license, facilitating wide access and collaborative advancements in combating deepfake technology's challenges.

Instructions:

This dataset has 20K synthetically generated face images for deepfake research.

Comments

Submitted by Ankita Sarkar on Thu, 08/01/2024 - 22:40

Can I have the dataset for research purpose. I will cite the dataset

Submitted by Safiul Chowdhury on Sun, 10/20/2024 - 15:38

Dear Sir,

I have downloaded the dataset you have provided. However, I noticed there are no real images for the corresponding fake images in the 20k dataset. Please can you upload the real images for this dataset? That would be helpful?

Submitted by Jireh Jam on Sat, 11/02/2024 - 04:15

isnt it available on Github or somewhere else so i can use it for research purpose.
or is there any process to get the dataset?

Submitted by Aakash Maurya on Tue, 02/04/2025 - 12:23

Dataset Files

128px Deepfake-Synthetic-20K Dataset-128px.zip (550.82 MB)
256px Deepfake-Synthetic-20K Dataset-256px.zip (1.95 GB)
512px Deepfake-Synthetic-20K Dataset-512px.zip (7.21 GB)
1024px Deepfake-Synthetic-20K Dataset-1024px.zip (24.76 GB)

Documentation

Attachment	Size
Readme file	123.15 KB

Datasets

Standard Dataset

Deepfake Synthetic-20K Dataset

Abstract

Comments

More from this Author

Colour-Rendered Bosphorus Projections Database

Dataset Files

Documentation

QUESTIONS?