Datasets
Standard Dataset
BCSA
- Citation Author(s):
- Submitted by:
- Jiang Du
- Last updated:
- Fri, 10/04/2024 - 08:50
- DOI:
- 10.21227/ndm2-yt19
- License:
- Categories:
- Keywords:
Abstract
This dataset is derived from the original dataset published by Dongkwan Kim et al. in their paper "Revisiting Binary Code Similarity Analysis using Interpretable Feature Engineering and Lessons Learned."
The main modifications include:
- Reduction of the original dataset size, selecting representative samples.
- Feature extraction from the original ELF files, transforming binary code into vector representations.
These modifications aim to enhance the dataset's usability, facilitating research in binary code similarity analysis and related fields. When using this dataset, please cite the original paper and acknowledge the data source.
We hope this dataset will be beneficial for research in relevant areas. For any questions or concerns, please contact the publisher.
This dataset is derived from the original dataset published by Dongkwan Kim et al. in their paper "Revisiting Binary Code Similarity Analysis using Interpretable Feature Engineering and Lessons Learned."
The main modifications include:
- Reduction of the original dataset size, selecting representative samples.
- Feature extraction from the original ELF files, transforming binary code into vector representations.
These modifications aim to enhance the dataset's usability, facilitating research in binary code similarity analysis and related fields. When using this dataset, please cite the original paper and acknowledge the data source.
We hope this dataset will be beneficial for research in relevant areas. For any questions or concerns, please contact the publisher.