Name: COVID-19 tweets dataset for Bengali language
Creator: Avishek Garain
License: https://creativecommons.org/licenses/by/4.0/
Keywords: Artificial Intelligence, COVID-19, Machine Learning, Biomedical and Health Sciences, Other

Abstract

This dataset is very vast and contains Bengali tweets related to COVID-19. There are 36117 unique tweet-ids in the whole dataset that ranges from December 2019 till May 2020 . The keywords that have been used to crawl the tweets are 'corona', , 'covid ' , 'sarscov2 ', 'covid19', 'coronavirus '. For getting the other 33 fields of data drop a mail at "avishekgarain@gmail.com". Code snippet is given in Documentation file. Sharing Twitter data other than Tweet ids publicly violates Twitter regulation policies.

Instructions:

The script to load data is written in documentation.

Comments

For selecting Bibtex contents, double click on IEEE contents. Then use Ctrl+C to copy. It's a bug and we need to wait till its fixed. Till then this is how you can cite.

Submitted by Avishek Garain on Tue, 10/06/2020 - 03:45

The bug has been fixed.

Submitted by Avishek Garain on Tue, 11/24/2020 - 05:11

great

Submitted by Abdullah Yahya Amer on Tue, 02/02/2021 - 08:32

Dataset Files

corona_ids_bangla.zip (147.91 kB)

LOGIN TO ACCESS DATASET FILES
Open Access dataset files are accessible to all logged in users. Don't have a login? Create a free IEEE account. IEEE Membership is not required.

Documentation

Attachment	Size
doc-corona.txt	487 bytes

QUESTIONS?

Report a problem with this Dataset

Datasets

Open Access

COVID-19 tweets dataset for Bengali language

Abstract

Comments

More from this Author

Dataset for classification of handwritten and printed...

Dataset for Word Difficulty Prediction

Tweets related to Death of Sushant Singh Rajput

English language tweets dataset for COVID-19

Dataset Files

Documentation

QUESTIONS?