Datasets
Standard Dataset
Fake News Dataset
- Citation Author(s):
- Submitted by:
- Utkarsh Rajput
- Last updated:
- Tue, 08/06/2024 - 03:33
- DOI:
- 10.21227/avrv-tp46
- License:
- Categories:
- Keywords:
Abstract
The dataset contains two types of articles fake and real News. This dataset was collected from realworld sources; the truthful articles were obtained by crawling articles from Reuters.com (News website). As for the fake news articles, they were collected from different sources. The fake news articles were collected from unreliable websites that were flagged by Politifact (a fact-checking organization in the USA) and Wikipedia. The dataset contains different types of articles on different topics, however, the majority of articles focus on political and World news topics.
The dataset consists of two CSV files. The file named “train.csv” and "Test.csv" contains more than 12,600 articles from reuter.com and also contains different fake news outlet resources. Each article contains the following information: article title, text, type and the date the article was published on. we focused mostly on collecting articles from 2016 to 2024. The data collected were cleaned and processed, however, the punctuations and mistakes that existed in the fake news were kept in the text.
Comments
Very helpful!