BCSA

Citation Author(s):
Jiang
Du
Information Engineering University
Submitted by:
Jiang Du
Last updated:
Fri, 10/04/2024 - 08:50
DOI:
10.21227/ndm2-yt19
License:
20 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

This dataset is derived from the original dataset published by Dongkwan Kim et al. in their paper "Revisiting Binary Code Similarity Analysis using Interpretable Feature Engineering and Lessons Learned."

The main modifications include:

  1. Reduction of the original dataset size, selecting representative samples.
  2. Feature extraction from the original ELF files, transforming binary code into vector representations.

These modifications aim to enhance the dataset's usability, facilitating research in binary code similarity analysis and related fields. When using this dataset, please cite the original paper and acknowledge the data source.

We hope this dataset will be beneficial for research in relevant areas. For any questions or concerns, please contact the publisher.

Instructions: 

This dataset is derived from the original dataset published by Dongkwan Kim et al. in their paper "Revisiting Binary Code Similarity Analysis using Interpretable Feature Engineering and Lessons Learned."

The main modifications include:

  1. Reduction of the original dataset size, selecting representative samples.
  2. Feature extraction from the original ELF files, transforming binary code into vector representations.

These modifications aim to enhance the dataset's usability, facilitating research in binary code similarity analysis and related fields. When using this dataset, please cite the original paper and acknowledge the data source.

We hope this dataset will be beneficial for research in relevant areas. For any questions or concerns, please contact the publisher.

Dataset Files

    Files have not been uploaded for this dataset