Datasets
Standard Dataset
Salient sentence extraction dataset from COVID-19 news reports
- Citation Author(s):
- Submitted by:
- Sumanta Banerjee
- Last updated:
- Tue, 01/31/2023 - 05:19
- DOI:
- 10.21227/g4z2-ab91
- Data Format:
- License:
126 Views
- Categories:
- Keywords:
0 ratings - Please login to submit your rating.
Abstract
The dataset contains labeled sentences. The sentences having information related to (1) infections, (2) suffering from pneumonea, (3) deaths, and (4) health updates from government/WHO, are labeled with 1 and the rest are labeled with 0. Source of all the news articles: https://www.thehindu.com/archive/
Instructions:
There is a file with labeled sentences and another with the document IDs of each of the sentences.