Malware Analysis Datasets: API Call Sequences

Citation Author(s):
Angelo
Oliveira
Submitted by:
Angelo Oliveira
Last updated:
Wed, 12/11/2019 - 20:28
DOI:
10.21227/tqqm-aq14
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

This dataset is part of our research on malware detection and classification using Deep Learning. It contains 42,797 malware API call sequences and 1,079 goodware API call sequences. Each API call sequence is composed of the first 100 non-repeated consecutive API calls associated with the parent process, extracted from the 'calls' elements of Cuckoo Sandbox reports.

Comments

Hi Oliveira,

First of all, Great work on producing a great amount of dataset! I wanted to use it in my research for cyber security and I wanted to know how many classes of malware have been used in this dataset, and which are they? Kindly send me some analysis  and information about this dataset. I will be forever grateful.

Yours Sincerely,

Richa Dasila.

Masters[Cyber Security].

Submitted by Richa Dasila on Thu, 03/14/2024 - 02:41

Dataset Files

LOGIN TO ACCESS DATASET FILES
Open Access dataset files are accessible to all logged in  users. Don't have a login?  Create a free IEEE account.  IEEE Membership is not required.