Deep Xi Training Set

Deep Xi Training Set

Citation Author(s):
Aaron
Nicolson
Signal Processing Laboratory, Griffith University
Submitted by:
Aaron Nicolson
Last updated:
Thu, 03/26/2020 - 01:19
DOI:
10.21227/3adt-pb04
Data Format:
Links:
License:
Dataset Views:
129
Rating:
0
0 ratings - Please login to submit your rating.
Share / Embed Cite

The clean-speech and noise recordings used to train Deep Xi (https://github.com/anicolson/DeepXi). A validation set is also included.

 

Clean speech:

The clean-speech recordings are from the test-clean-100 set of Librispeech (http://www.openslr.org/12/) and from the CSTR VCTK corpus (https://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html) (the recordings from speakers p232 and p257 are excluded as they are used in the test set of the DEMAND Voicebank dataset (http://ssw9.talp.cat/papers/ssw9_PS2-4_Valentini-Botinhao.pdf)). 

 

Noise:

The noise recordings are from the Environmental Background Noise dataset (https://personal.utdallas.edu/~nxk019000/VAD-dataset/), the Nonspeech dataset (http://web.cse.ohio-state.edu/pnl/corpus/HuNonspeech/HuCorpus.html), the QUT-NOISE dataset (https://research.qut.edu.au/saivt/databases/qut-noise-databases-and-protocols/), multiple Freesound packs (https://freesound.org/), the noise set of the MUSAN corpus (https://www.openslr.org/17/), the RSG-10 noise database (http://www.steeneken.nl/wp-content/uploads/2014/04/RSG-10_Noise-data-base.pdf) (voice babble, F16, and factory (welding) are excluded as they are used in the Deep Xi Test Set and the Test Set From 10.1016/J.SPECOM.2019.06.002) and the Urban Sound dataset (http://www.justinsalamon.com/uploads/4/3/9/4/4394963/salamon_urbansound_acmmm14.pdf) (street music no. 26,270 is excluded as it is used in the Deep Xi Test Set and the Test Set From 10.1016/J.SPECOM.2019.06.002).

 

Note that the clean-speech and noise recordings used for this training set are separate from those used in the following test sets: Deep Xi Test Set, the Test Set From 10.1016/J.SPECOM.2019.06.002, and the DEMAND Voicebank test set (http://ssw9.talp.cat/papers/ssw9_PS2-4_Valentini-Botinhao.pdf).

Instructions: 

The directories are pre-configured for Deep Xi, as seen here: https://github.com/anicolson/DeepXi/tree/master/set.

Dataset Files

You must login with an IEEE Account to access these files. IEEE Accounts are FREE.

Sign Up now or login.

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

facebooktwittermailshare
[1] Aaron Nicolson, "Deep Xi Training Set", IEEE Dataport, 2020. [Online]. Available: http://dx.doi.org/10.21227/3adt-pb04. Accessed: Apr. 04, 2020.
@data{3adt-pb04-20,
doi = {10.21227/3adt-pb04},
url = {http://dx.doi.org/10.21227/3adt-pb04},
author = {Aaron Nicolson },
publisher = {IEEE Dataport},
title = {Deep Xi Training Set},
year = {2020} }
TY - DATA
T1 - Deep Xi Training Set
AU - Aaron Nicolson
PY - 2020
PB - IEEE Dataport
UR - 10.21227/3adt-pb04
ER -
Aaron Nicolson. (2020). Deep Xi Training Set. IEEE Dataport. http://dx.doi.org/10.21227/3adt-pb04
Aaron Nicolson, 2020. Deep Xi Training Set. Available at: http://dx.doi.org/10.21227/3adt-pb04.
Aaron Nicolson. (2020). "Deep Xi Training Set." Web.
1. Aaron Nicolson. Deep Xi Training Set [Internet]. IEEE Dataport; 2020. Available from : http://dx.doi.org/10.21227/3adt-pb04
Aaron Nicolson. "Deep Xi Training Set." doi: 10.21227/3adt-pb04