The dataset contains labeled sentences. The sentences having information related to (1) infections, (2) suffering from pneumonea, (3) deaths, and (4) health updates from government/WHO, are labeled with 1 and the rest are labeled with 0. Source of all the news articles: https://www.thehindu.com/archive/
Multi-label event classification label of each sample-document is done with nine bits. The first bit signifies whether an event is present or absent with 1 or 0 respectively. The remaining eight bits signifies presence or absence of (i) covid, (ii) flood, (iii) storm, (iv) heavy rain, (v) cloudburst, (vi) landslide, (vii) earthquake, (viii) Tsunami with 1 or 0. The location and the impact sentence classification labeling are similar.
The disaster-news healline generation dataset (news_articles_and _titles) contains a set of disaster-news articles and their headlines/titles. This dataset may be used to develop a method to generate a good quality headline for a disaster-news article.