This dataset comprises 500,153 Instagram posts related to COVID-19, published between January 2020 and September 2024. The posts span 161 unique languages and include a total of 535,021 distinct hashtags. Following the creation of the dataset, a multilingual sentiment analysis was conducted. This analysis classified each post into one of three categories: positive, negative, or neutral. The sentiment classification is provided as an additional attribute within the dataset, offering a comprehensive overview of the emotional tone expressed across the posts.
To download the dataset without purchasing an IEEE Dataport subscription, please visit: https://zenodo.org/records/13738598
Please cite the following paper when using this dataset:
To access this dataset without purchasing an IEEE Dataport subscription, please visit: https://zenodo.org/doi/10.5281/zenodo.11711229
Please cite the following paper when using this dataset:
Please cite the following paper when using this dataset:
N. Thakur, S. Cui, K. A. Patel, N. Azizi, V. Knieling, C. Han, A. Poon, and R. Shah, “Marburg Virus Outbreak and a New Conspiracy Theory: Findings from a Comprehensive Analysis and Forecasting of Web Behavior,” Journal of Computation, Vol. 11, Issue. 11, Article. 234, Nov. 2023, DOI: http://dx.doi.org/10.3390/computation11110234
Abstract
Please cite the following paper when using this dataset:
N. Thakur, K. A. Patel, I. Hall, Y. N. Duggal, and S. Cui, “A Dataset of Search Interests related to Disease X originating from different Geographic Regions”, Preprints 2023, 2023081701, DOI: https://doi.org/10.20944/preprints202308.1701.v1
Abstract:
Please cite the following paper when using this dataset:
N. Thakur and C.Y. Han, “An Exploratory Study of Tweets about the SARS-CoV-2 Omicron Variant: Insights from Sentiment Analysis, Language Interpretation, Source Tracking, Type Classification, and Embedded URL Detection,” Journal of COVID, 2022, Volume 5, Issue 3, pp. 1026-1049
Abstract
Please cite the following paper when using this dataset:
N. Thakur, “A Large-Scale Dataset of Twitter Chatter about Online Learning during the Current COVID-19 Omicron Wave,” Journal of Data, vol. 7, no. 8, p. 109, Aug. 2022, doi: 10.3390/data7080109
Abstract
Please cite the following paper when using this dataset:
N. Thakur, “MonkeyPox2022Tweets: A large-scale Twitter dataset on the 2022 Monkeypox outbreak, findings from analysis of Tweets, and open research questions,” Infect. Dis. Rep., vol. 14, no. 6, pp. 855–883, 2022, DOI: https://doi.org/10.3390/idr14060087.
Abstract
Please cite the following paper when using this dataset:
N. Thakur, "Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets from 2017–2022 and 100 Research Questions", Journal of Analytics, Volume 1, Issue 2, 2022, pp. 72-97, DOI: https://doi.org/10.3390/analytics1020007
Abstract
Any work using this dataset should cite this paper as follows:
Nirmalya Thakur and Chia Y. Han, "Country-Specific Interests towards Fall Detection from 2004–2021: An Open Access Dataset and Research Questions", Journal of Data, Volume 6, Issue 8, pp. 1-21, 2021.
Abstract