Cyber Threat Intelligent (CTI) dataset generated from public security reports and malware repositories

- Citation Author(s):
- Submitted by:
- Daegeon Kim
- Last updated:
- DOI:
- 10.21227/dpat-qd69
- Data Format:
- Links:
- Categories:
- Keywords:
Abstract
This dataset contains Cyber Threat Intelligence (CTI) data generated from public security reports and malware repositories.
The dataset is stored in a structured format (XML) and includes approximately 640,000 records from 612 security reports published from January 2008 to June 2019.
Several data types are contained in this dataset such as URL, host, IP address, e-mail account, hashes (MD5, SHA1, and SHA256), common vulnerabilities and exposures (CVE), registry, file names ending with specific extensions, and the program database (PDB) path.
Instructions:
For more instruction about the dataset as well as the system generating the dataset, please see following paper:
Daegeon Kim and Huy Kang Kim, “Automated Dataset Generation System for Collaborative Research of Cyber Threat Analysis,” Security and Communication Networks, vol. 2019, Article ID 6268476, 10 pages, 2019. https://doi.org/10.1155/2019/6268476.
In reply to Hello, by Romeo Bigodo Ngueyep
In reply to It should be on this page - by A Miller
In reply to Sorry, but this dataset is by Romeo Bigodo Ngueyep
In reply to Hello, by Romeo Bigodo Ngueyep
In reply to hello, the data set of by chen jerry