Skip to main content

Datasets

Standard Dataset

COVID-19 News Articles

Citation Author(s):
Piyush Ghasiya (Kyushu University)
Koji Okamura (Kyushu University)
Submitted by:
Piyush Ghasiya
Last updated:
DOI:
10.21227/gdq8-ej60
Data Format:
Research Article Link:
No Ratings Yet

Abstract

This dataset is consist news articles related to COVID-19 from UK, India, Japan and South Korea newspapers. 

Instructions:

All the files are in .csv format and can be directly to various Natural Language Processing tasks such as topic modeling and sentiment analysis.

In the Data file, two folders are present: 1) Sentiment Analysis and 2) Topic Modeling. The Sentiment Analysis folder contain 6 files: 1) All_Label_Covid_Headlines (CSV file), 2) checkpoint-03-0.9000.h5, 3) Headline_India (CSV file), 4) Headline_Japan (CSV file), 5) Headline_Korea (CSV file), and 6) Headline_UK (CSV file). The Topic Modeling folder contain 8 files: 1) India_Articles (CSV file), 2) India_Covid-19_Top2Vec_model, 3) Japan_Articles (CSV file), 4) Japan_Covid-19_Top2Vec_model, 5) Korea_Articles (CSV file), 6) Korea_Covid-19_Top2Vec_model, 7) UK_Articles (CSV file), and 8) UK_Covid-19_Top2Vec_model.
Piyush Ghasiya Wed, 01/20/2021 - 21:11 Permalink