Arxiv

Citation Author(s):: feihu che
Submitted by:: Feihu Che
Last updated:: Wed, 11/13/2024 - 13:47
DOI:: 10.21227/6v50-r718

38 views

Categories:

Artificial Intelligence

Keywords:

academic citation network

ACCESS DATASET CITE

Abstract

The Ogbn-Arxiv dataset (Arxiv for short) represents an academic citation network. In this network structure, papers serve as nodes, citations between papers form edges, and paper abstracts constitute the textual attributes. The primary task involves subject prediction for papers. We utilize the publicly available partitions, ground truth labels, and textual data from OGB

Instructions:

The dataset has 169343 nodes, 1166243 edges, and the train, valid, test ratios are 0.54, 0.18, 0.28. The task is to classify each node (one paper) into one class.