Datasets
Standard Dataset
Arxiv
- Citation Author(s):
- Submitted by:
- Feihu Che
- Last updated:
- Wed, 11/13/2024 - 08:47
- DOI:
- 10.21227/6v50-r718
- License:
20 Views
- Categories:
- Keywords:
0 ratings - Please login to submit your rating.
Abstract
The Ogbn-Arxiv dataset (Arxiv for short) represents an academic citation network. In this network structure, papers serve as nodes, citations between papers form edges, and paper abstracts constitute the textual attributes. The primary task involves subject prediction for papers. We utilize the publicly available partitions, ground truth labels, and textual data from OGB
Instructions:
The dataset has 169343 nodes, 1166243 edges, and the train, valid, test ratios are 0.54, 0.18, 0.28. The task is to classify each node (one paper) into one class.
Comments
.