Datasets
Open Access
Cyber Threat Intelligent (CTI) dataset generated from public security reports and malware repositories
- Citation Author(s):
- Submitted by:
- Daegeon Kim
- Last updated:
- Sat, 01/22/2022 - 01:33
- DOI:
- 10.21227/dpat-qd69
- Data Format:
- Links:
- License:
- Categories:
- Keywords:
Abstract
This dataset contains Cyber Threat Intelligence (CTI) data generated from public security reports and malware repositories.
The dataset is stored in a structured format (XML) and includes approximately 640,000 records from 612 security reports published from January 2008 to June 2019.
Several data types are contained in this dataset such as URL, host, IP address, e-mail account, hashes (MD5, SHA1, and SHA256), common vulnerabilities and exposures (CVE), registry, file names ending with specific extensions, and the program database (PDB) path.
For more instruction about the dataset as well as the system generating the dataset, please see following paper:
Daegeon Kim and Huy Kang Kim, “Automated Dataset Generation System for Collaborative Research of Cyber Threat Analysis,” Security and Communication Networks, vol. 2019, Article ID 6268476, 10 pages, 2019. https://doi.org/10.1155/2019/6268476.
Dataset Files
- CTIDataset.zip (2.67 MB)
Open Access dataset files are accessible to all logged in users. Don't have a login? Create a free IEEE account. IEEE Membership is not required.
Comments
pl provide access
Hello,
why is it specified that the dataset is in JSON format, but the one available for download is in XML format?
thank you
It should be on this page - right-hand side; ctrl-f "CTIDataset.zip" (no quotes - and go to the second result now that this comment will steal the first one)
Sorry, but this dataset is not in Json format.
Please rephrase this part of the dataset description.
Thank you
Correct!
The dataset format is XML.
The description is modified now.
Thank you.
Correct!
The dataset format is XML.
The description is modified now.
Thank you.
hello, the data set of "CTIDataset.zip" is in xml format, I connot find the json format. If possible, can I send a copy to my mailbox? 1120382898@qq.com Thank you
You can easily change the format from XML to JSON using free converters.