This dataset consists of 737 documents from the BBC Sport website, corresponding to sports news articles in five topical areas from 2004-2005. The class labels are divided into five categories: athletics, cricket, football, rugby, and tennis. The datasets have been pre-processed using the Porter stemming algorithm, stop-word removal, and filtering out terms with low frequency (count < 3).

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

Documentation: 
AttachmentSize
File bbcsport.docx10.69 KB
[1] Lina Ren, "BBCSport", IEEE Dataport, 2024. [Online]. Available: http://dx.doi.org/10.21227/re2y-ph59. Accessed: Dec. 21, 2024.
@data{re2y-ph59-24,
doi = {10.21227/re2y-ph59},
url = {http://dx.doi.org/10.21227/re2y-ph59},
author = {Lina Ren },
publisher = {IEEE Dataport},
title = {BBCSport},
year = {2024} }
TY - DATA
T1 - BBCSport
AU - Lina Ren
PY - 2024
PB - IEEE Dataport
UR - 10.21227/re2y-ph59
ER -
Lina Ren. (2024). BBCSport. IEEE Dataport. http://dx.doi.org/10.21227/re2y-ph59
Lina Ren, 2024. BBCSport. Available at: http://dx.doi.org/10.21227/re2y-ph59.
Lina Ren. (2024). "BBCSport." Web.
1. Lina Ren. BBCSport [Internet]. IEEE Dataport; 2024. Available from : http://dx.doi.org/10.21227/re2y-ph59
Lina Ren. "BBCSport." doi: 10.21227/re2y-ph59