Datasets
Standard Dataset
Community-imbalanced graph dataset
- Citation Author(s):
- Submitted by:
- Genghuai Bai
- Last updated:
- Thu, 11/28/2024 - 21:56
- DOI:
- 10.21227/qffd-ec06
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
The dataset are served for community-imbalanced graph sampling algorithm performance experiments. In the algorithm performance experiment, we selected 30 graph datasets, 15 of which were derived from real-world graph datasets (https://snap.stanford.edu/data/), and 15 were adapted from real-world datasets or simulated datasets. The simulated datasets can address the problems of incomplete coverage of imbalance degree of community and uncertain community detection results in real-world datasets. These 30 datasets encompass varying node and edge scales, different numbers of communities, and varying degrees of community imbalance. Moreover, these datasets encompass various application domains. Details about these graph datasets were provided in dataset description.
There are a total of 30 objective data sets, which are saved as.csv files. Thesecsv files hold all the side information of the graph in the form of source and target, one side per line.
Documentation
Attachment | Size |
---|---|
dataset description.docx | 86.32 KB |