ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems

ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems

Citation Author(s):
Yuan
Gong
University of Notre Dame
Jian
Yang
University of Notre Dame
Jacob
Huber
University of Notre Dame
Mitchell
MacKnight
University of Notre Dame
Christian
Poellabauer
University of Notre Dame
Submitted by:
Yuan Gong
Last updated:
Tue, 06/23/2020 - 23:25
DOI:
10.21227/1mhq-c052
Data Format:
Links:
License:
Dataset Views:
105
Rating:
0
0 ratings - Please login to submit your rating.
Share / Embed Cite

We introduce a new database of voice recordings with the goal of supporting research on vulnerabilities and protection of voice-controlled systems (VCSs). In contrast to prior efforts, the proposed database contains both genuine voice commands and replayed recordings of such commands, collected in realistic VCSs usage scenarios and using modern voice assistant development kits. Specifically, the database contains recordings from four systems (each with a different microphone array) in a variety of environmental conditions with different forms of background noise and relative positions between speaker and device. To the best of our knowledge, this is the first publicly available database1 that has been specifically designed for the protection of state-of-the-art voice-controlled systems against various replay attacks in various conditions and environments.

Instructions: 

The corpus consists of three sets: the core, evaluation, and complete set. The complete set contains all the data (i.e., complete set = core set + evaluation set) and allows the user to freely split the training/test set. Core/evaluation sets suggest a default training/test split. For each set, all *.wav files are in the /data directory and the meta information is in meta.csv file. The protocol is described in the readme.txt. A PyTorch data loader script is provided as an example of how to use the data. A python resample script is provided for resampling the dataset into the desired sample rate.

Dataset Files

You must login with an IEEE Account to access these files. IEEE Accounts are FREE.

Sign Up now or login.

Documentation

AttachmentSize
Plain text icon protocol1.35 KB
PDF icon paper2.62 MB

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

facebooktwittermailshare
[1] Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer, "ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems", IEEE Dataport, 2020. [Online]. Available: http://dx.doi.org/10.21227/1mhq-c052. Accessed: Jul. 08, 2020.
@data{1mhq-c052-20,
doi = {10.21227/1mhq-c052},
url = {http://dx.doi.org/10.21227/1mhq-c052},
author = {Yuan Gong; Jian Yang; Jacob Huber; Mitchell MacKnight; Christian Poellabauer },
publisher = {IEEE Dataport},
title = {ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems},
year = {2020} }
TY - DATA
T1 - ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems
AU - Yuan Gong; Jian Yang; Jacob Huber; Mitchell MacKnight; Christian Poellabauer
PY - 2020
PB - IEEE Dataport
UR - 10.21227/1mhq-c052
ER -
Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer. (2020). ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems. IEEE Dataport. http://dx.doi.org/10.21227/1mhq-c052
Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer, 2020. ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems. Available at: http://dx.doi.org/10.21227/1mhq-c052.
Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer. (2020). "ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems." Web.
1. Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer. ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems [Internet]. IEEE Dataport; 2020. Available from : http://dx.doi.org/10.21227/1mhq-c052
Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer. "ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems." doi: 10.21227/1mhq-c052