This dataset is part of our research on malware detection and classification using Deep Learning. It contains 42,797 malware API call sequences and 1,079 goodware API call sequences. Each API call sequence is composed of the first 100 non-repeated consecutive API calls associated with the parent process, extracted from the 'calls' elements of Cuckoo Sandbox reports.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Angelo Oliveira, "Malware Analysis Datasets: API Call Sequences", IEEE Dataport, 2019. [Online]. Available: http://dx.doi.org/10.21227/tqqm-aq14. Accessed: Feb. 08, 2025.
@data{tqqm-aq14-19,
doi = {10.21227/tqqm-aq14},
url = {http://dx.doi.org/10.21227/tqqm-aq14},
author = {Angelo Oliveira },
publisher = {IEEE Dataport},
title = {Malware Analysis Datasets: API Call Sequences},
year = {2019} }
TY - DATA
T1 - Malware Analysis Datasets: API Call Sequences
AU - Angelo Oliveira
PY - 2019
PB - IEEE Dataport
UR - 10.21227/tqqm-aq14
ER -
Angelo Oliveira. (2019). Malware Analysis Datasets: API Call Sequences. IEEE Dataport. http://dx.doi.org/10.21227/tqqm-aq14
Angelo Oliveira, 2019. Malware Analysis Datasets: API Call Sequences. Available at: http://dx.doi.org/10.21227/tqqm-aq14.
Angelo Oliveira. (2019). "Malware Analysis Datasets: API Call Sequences." Web.
1. Angelo Oliveira. Malware Analysis Datasets: API Call Sequences [Internet]. IEEE Dataport; 2019. Available from : http://dx.doi.org/10.21227/tqqm-aq14
Angelo Oliveira. "Malware Analysis Datasets: API Call Sequences." doi: 10.21227/tqqm-aq14