In this paper, two datasets for text classification were primarily used in the experiments: AG News and IMDB. The AG News dataset is a widely used four-class news dataset, including four categories: World News, Sports News, Business News, and Technology News. The dataset contains a total of 120,000 samples, with 114,000 samples in the training set and the remaining 6,000 samples in the test set. The IMDB dataset is a movie review dataset used for sentiment analysis, primarily for binary classification tasks, i.e., positive and negative reviews.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Xinxin Li, "AG News and IMDB", IEEE Dataport, 2024. [Online]. Available: http://dx.doi.org/10.21227/f9vv-5898. Accessed: Feb. 11, 2025.
@data{f9vv-5898-24,
doi = {10.21227/f9vv-5898},
url = {http://dx.doi.org/10.21227/f9vv-5898},
author = {Xinxin Li },
publisher = {IEEE Dataport},
title = {AG News and IMDB},
year = {2024} }
TY - DATA
T1 - AG News and IMDB
AU - Xinxin Li
PY - 2024
PB - IEEE Dataport
UR - 10.21227/f9vv-5898
ER -
Xinxin Li. (2024). AG News and IMDB. IEEE Dataport. http://dx.doi.org/10.21227/f9vv-5898
Xinxin Li, 2024. AG News and IMDB. Available at: http://dx.doi.org/10.21227/f9vv-5898.
Xinxin Li. (2024). "AG News and IMDB." Web.
1. Xinxin Li. AG News and IMDB [Internet]. IEEE Dataport; 2024. Available from : http://dx.doi.org/10.21227/f9vv-5898
Xinxin Li. "AG News and IMDB." doi: 10.21227/f9vv-5898