Community-imbalanced graph dataset

Citation Author(s):
Genghuai
Bai
Central South University
Submitted by:
Genghuai Bai
Last updated:
Thu, 11/28/2024 - 21:56
DOI:
10.21227/qffd-ec06
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

The dataset are served for community-imbalanced graph sampling algorithm performance experiments. In the algorithm performance experiment, we selected 30 graph datasets, 15 of which were derived from real-world graph datasets (https://snap.stanford.edu/data/), and 15 were adapted from real-world datasets or simulated datasets. The simulated datasets can address the problems of incomplete coverage of imbalance degree of community and uncertain community detection results in real-world datasets. These 30 datasets encompass varying node and edge scales, different numbers of communities, and varying degrees of community imbalance. Moreover, these datasets encompass various application domains. Details about these graph datasets were provided in dataset description. 

Instructions: 

There are a total of 30 objective data sets, which are saved as.csv files. Thesecsv files hold all the side information of the graph in the form of source and target, one side per line.

 

Documentation

AttachmentSize
File dataset description.docx86.32 KB