Skip to main content

Datasets

Standard Dataset

Arxiv

Citation Author(s):
feihu che
Submitted by:
Feihu Che
Last updated:
DOI:
10.21227/6v50-r718
38 views
Categories:
Keywords:
No Ratings Yet

Abstract

The Ogbn-Arxiv dataset (Arxiv for short) represents an academic citation network. In this network structure, papers serve as nodes, citations between papers form edges, and paper abstracts constitute the textual attributes. The primary task involves subject prediction for papers. We utilize the publicly available partitions, ground truth labels, and textual data from OGB

Instructions:

The dataset has 169343 nodes, 1166243 edges, and the train, valid, test ratios are 0.54, 0.18, 0.28. The task is to classify each node (one paper) into one class.

Dataset Files

Files have not been uploaded for this dataset

More from this Author