Datasets
Open Access
COVID-19 tweets dataset for Bengali language
- Citation Author(s):
- Submitted by:
- Avishek Garain
- Last updated:
- Thu, 06/11/2020 - 09:08
- DOI:
- 10.21227/wdt0-ya78
- Data Format:
- Links:
- License:
1481 Views
- Categories:
- Keywords:
0 ratings - Please login to submit your rating.
Abstract
This dataset is very vast and contains Bengali tweets related to COVID-19. There are 36117 unique tweet-ids in the whole dataset that ranges from December 2019 till May 2020 . The keywords that have been used to crawl the tweets are 'corona', , 'covid ' , 'sarscov2 ', 'covid19', 'coronavirus '. For getting the other 33 fields of data drop a mail at "avishekgarain@gmail.com". Code snippet is given in Documentation file. Sharing Twitter data other than Tweet ids publicly violates Twitter regulation policies.
Instructions:
The script to load data is written in documentation.
Dataset Files
- corona_ids_bangla.zip (147.91 kB)
Open Access dataset files are accessible to all logged in users. Don't have a login? Create a free IEEE account. IEEE Membership is not required.
Documentation
Attachment | Size |
---|---|
doc-corona.txt | 487 bytes |
Comments
For selecting Bibtex contents, double click on IEEE contents. Then use Ctrl+C to copy. It's a bug and we need to wait till its fixed. Till then this is how you can cite.
The bug has been fixed.
great