Coronavirus (COVID-19) Geo-tagged Tweets Dataset

4.444445
9 ratings - Please login to submit your rating.

Abstract 

This dataset contains IDs and sentiment scores of geo-tagged tweets related to the COVID-19 pandemic. The real-time Twitter feed is monitored for coronavirus-related tweets using 90+ different keywords and hashtags that are commonly used while referencing the pandemic. Complying with Twitter's content redistribution policy, only the tweet IDs are shared. You can re-construct the dataset by hydrating these IDs. For detailed instructions on the hydration of tweet IDs, please read this article. The tweet IDs in this dataset belong to the tweets created providing an exact location. The paper associated with this dataset is available here: Design and analysis of a large-scale COVID-19 tweets dataset.

Below is a quick overview of this dataset.

— Dataset name: GeoCOV19Tweets Dataset

— Number of tweets : 429,761 tweets

— Coverage : Global

— Language : English (EN)

— Dataset usage terms : By using this dataset, you agree to (i) use the content of this dataset and the data generated from the content of this dataset for non-commercial research only, (ii) remain in compliance with Twitter's Policy and (iii) cite the following paper:

Lamsal, R. Design and analysis of a large-scale COVID-19 tweets dataset. Appl Intell 51, 2790–2804 (2021). https://doi.org/10.1007/s10489-020-02029-z

BibTeX:

@article{lamsal2021design,
title={Design and analysis of a large-scale COVID-19 tweets dataset},
author={Lamsal, Rabindra},
journal={Applied Intelligence},
volume={51},
number={5},
pages={2790--2804},
year={2021},
publisher={Springer}
}

— Primary dataset : Coronavirus (COVID-19) Tweets Dataset (COV19Tweets Dataset)

— Dataset updates : Everyday

— Keywords and hashtags: keywords.tsv

Please visit this page (primary dataset) for more details.

Collection date & Number of tweets

(2020) March 20 - March 21: 1290 tweets

(2020) March 21 - March 22: 1020 tweets

(2020) March 22 - March 23: 1069 tweets

(2020) March 23 - March 24: 1072 tweets

(2020) March 24 - March 25: 949 tweets

(2020) March 25 - March 26: 913 tweets

(2020) March 26 - March 27: 810 tweets

(2020) March 27 - March 28: 855 tweets

(2020) March 28 - March 29: 828 tweets

(2020) March 29 - March 30: 5318 tweets (this file was added on June 29, 2021; its primary file corona_tweets_11b.csv was created while excluding retweets right at the API level; compared to other days the geo-tagged tweets are significantly higher for this day; Reason: Twitter's full-search endpoint was asked to create a corpus while excluding retweets; retweets have NULL geo and place objects, and since they were excluded I was able to come up with 5318 geo-tagged tweets out of 1,677,362 tweets collected for this day; this was quite an interesting observation to note)

(2020) March 30 - March 31: 538 tweets

(2020) March 31 - April 1: 636 tweets

(2020) April 1 - April 2: 608 tweets

(2020) April 2 - April 3: 661 tweets

(2020) April 3 - April 4: 592 tweets

(2020) April 4 - April 5: 661 tweets

(2020) April 5 - April 6: 709 tweets

(2020) April 6 - April 7: 549 tweets

(2020) April 7 - April 8: 593 tweets

(2020) April 8 - April 9: 491 tweets

(2020) April 9 - April 10: 507 tweets

(2020) April 10 - April 11: 534 tweets

(2020) April 11 - April 12: 539 tweets

(2020) April 12- April 13: 543 tweets

(2020) April 13 - April 14: 510 tweets

(2020) April 14 - April 15: 387 tweets

(2020) April 15 - April 16: 321 tweets

(2020) April 16 - April 17: 443 tweets

(2020) April 17 - April 18: 373 tweets

(2020) April 18 - April 19: 1020 tweets

(2020) April 19 - April 20: 884 tweets

(2020) April 20 - April 21: 869 tweets

(2020) April 21 - April 22: 878 tweets

(2020) April 22 - April 23: 831 tweets

(2020) April 23 - April 24: 818 tweets

(2020) April 24 - April 25: 747 tweets

(2020) April 25- April 26: 693 tweets

(2020) April 26 - April 27: 939 tweets

(2020) April 27 - April 28: 744 tweets

(2020) April 28 - April 29: 1408 tweets

(2020) April 29 - April 30: 1751 tweets

(2020) April 30 - May 1: 1637 tweets

(2020) May 1 - May 2: 1866 tweets

(2020) May 2 - May 3: 1839 tweets

(2020) May 3 - May 4: 1566 tweets

(2020) May 4 - May 5: 1615 tweets

(2020) May 5 - May 6: 1635 tweets

(2020) May 6 - May 7: 1571 tweets

(2020) May 7 - May 8: 1621 tweets

(2020) May 8 - May 9: 1684 tweets

(2020) May 9 - May 10: 1474 tweets

(2020) May 10 - May 11: 1130 tweets

(2020) May 11 - May 12: 1281 tweets

(2020) May 12- May 13: 1630 tweets

(2020) May 13 - May 14: 1480 tweets

(2020) May 14 - May 15: 1652 tweets

(2020) May 15 - May 16: 1583 tweets

(2020) May 16 - May 17: 1487 tweets

(2020) May 17 - May 18: 1341 tweets

(2020) May 18 - May 19: 1398 tweets

(2020) May 19 - May 20: 1389 tweets

(2020) May 20 - May 21: 1397 tweets

(2020) May 21 - May 22: 1562 tweets

(2020) May 22 - May 23: 1558 tweets

(2020) May 23 - May 24: 1299 tweets

(2020) May 24 - May 25: 1297 tweets

(2020) May 25- May 26: 1190 tweets

(2020) May 26 - May 27: 1184 tweets

(2020) May 27 - May 28: 1257 tweets

(2020) May 28 - May 29: 1277 tweets

(2020) May 29 - May 30: 1202 tweets

(2020) May 30 - May 31: 1209 tweets

(2020) May 31 - June 1: 1080 tweets

(2020) June 1 - June 2: 1233 tweets

(2020) June 2 - June 3: 917 tweets

(2020) June 3 - June 4: 1055 tweets

(2020) June 4 - June 5: 1117 tweets

(2020) June 5 - June 6: 1184 tweets

(2020) June 6 - June 7: 1093 tweets

(2020) June 7 - June 8: 1054 tweets

(2020) June 8 - June 9: 1180 tweets

(2020) June 9 - June 10: 1155 tweets

(2020) June 10 - June 11: 1131 tweets

(2020) June 11 - June 12: 1148 tweets

(2020) June 12- June 13: 1189 tweets

(2020) June 13 - June 14: 1045 tweets

(2020) June 14 - June 15: 1024 tweets

(2020) June 15 - June 16: 1663 tweets

(2020) June 16 - June 17: 1692 tweets

(2020) June 17 - June 18: 1634 tweets

(2020) June 18 - June 19: 1610 tweets

(2020) June 19 - June 20: 1698 tweets

(2020) June 20 - June 21: 1613 tweets

(2020) June 21 - June 22: 1419 tweets

(2020) June 22 - June 23: 1524 tweets

(2020) June 23 - June 24: 1431 tweets

(2020) June 24 - June 25: 1454 tweets

(2020) June 25- June 26: 1539 tweets

(2020) June 26 - June 27: 1403 tweets

(2020) June 27 - June 28: 1766 tweets

(2020) June 28 - June 29: 1405 tweets

(2020) June 29 - June 30: 1534 tweets

(2020) June 30 - June 31: 1519 tweets

(2020) July 1 - July 2: 1841 tweets

(2020) July 2 - July 3: 1434 tweets

(2020) July 3 - July 4: 1475 tweets

(2020) July 4 - July 5: 2028 tweets

(2020) July 5 - July 6: 1491 tweets

(2020) July 6 - July 7: 1275 tweets

(2020) July 7 - July 8: 1336 tweets

(2020) July 8 - July 9: 1428 tweets

(2020) July 9 - July 10: 1831 tweets

(2020) July 10 - July 11: 1578 tweets

(2020) July 11 - July 12: 1575 tweets

(2020) July 12 - July 13: 1346 tweets

(2020) July 13 - July 14: 1295 tweets

(2020) July 14 - July 15: 1372 tweets

(2020) July 15 - July 16: 1213 tweets

(2020) July 16 - July 17: 1339 tweets

(2020) July 17 - July 18: 1588 tweets

(2020) July 18 - July 19: 1647 tweets

(2020) July 19 - July 20: 1452 tweets

(2020) July 20 - July 21: 1344 tweets

(2020) July 21 - July 22: 1557 tweets

(2020) July 22 - July 23: 1556 tweets

(2020) July 23 - July 24: 1541 tweets

(2020) July 24 - July 25: 1670 tweets

(2020) July 25 - July 26: 1536 tweets

(2020) July 26 - July 27: 1415 tweets

(2020) July 27 - July 28: 1262 tweets

(2020) July 28 - July 29: 1192 tweets

(2020) July 29 - July 30: 1284 tweets

(2020) July 30 - July 31: 1198 tweets

(2020) July 31 - August 1: 1399 tweets

(2020) August 1 - August 2: 1462 tweets

(2020) August 2 - August 3: 1247 tweets

(2020) August 3 - August 4: 1375 tweets

(2020) August 4 - August 5: 1507 tweets

(2020) August 5 - August 6: 1557 tweets

(2020) August 6 - August 7: 1576 tweets

(2020) August 7 - August 8: 1634 tweets

(2020) August 8 - August 9: 1630 tweets

(2020) August 9 - August 10: 1427 tweets

(2020) August 10 - August 11: 1363 tweets

(2020) August 11 - August 12: 1501 tweets

(2020) August 12 - August 13: 1632 tweets

(2020) August 13 - August 14: 1674 tweets

(2020) August 14 - August 15: 1689 tweets

(2020) August 15 - August 16: 1716 tweets

(2020) August 16 - August 17: 1515 tweets

(2020) August 17 - August 18: 1593 tweets

(2020) August 18 - August 19: 1945 tweets

(2020) August 19 - August 20: 1571 tweets

(2020) August 20 - August 21: 1667 tweets

(2020) August 21 - August 22: 1592 tweets

(2020) August 22 - August 23: 1439 tweets

(2020) August 23 - August 24: 1273 tweets

(2020) August 24 - August 25: 1379 tweets

(2020) August 25 - August 26: 1538 tweets

(2020) August 26 - August 27: 1535 tweets

(2020) August 27 - August 28: 1447 tweets

(2020) August 28 - August 29: 1376 tweets

(2020) August 29 - August 30: 1421 tweets

(2020) August 30 - August 31: 1195 tweets

(2020) August 31 - September 1: 1298 tweets

(2020) September 1 - September 2: 1449 tweets

(2020) September 2 - September 3: 1326 tweets

(2020) September 3 - September 4: 1299 tweets

(2020) September 4 - September 5: 1586 tweets

(2020) September 5 - September 6: 1374 tweets

(2020) September 6 - September 7: 1274 tweets

(2020) September 7 - September 8: 1316 tweets

(2020) September 8 - September 9: 1379 tweets

(2020) September 9 - September 10: 1120 tweets

(2020) September 10 - September 11: 628 tweets

(2020) September 11 - September 12: 678 tweets

(2020) September 12 - September 13: 681 tweets

(2020) September 13 - September 14: 644 tweets

(2020) September 14 - September 15: 645 tweets

(2020) September 15 - September 16: 693 tweets

(2020) September 16 - September 17: 635 tweets

(2020) September 17 - September 18: 582 tweets

(2020) September 18 - September 19: 737 tweets

(2020) September 19 - September 20: 702 tweets

(2020) September 20 - September 21: 588 tweets

(2020) September 21 - September 22: 660 tweets

(2020) September 22 - September 23: 634 tweets

(2020) September 23 - September 24: 607 tweets

(2020) September 24 - September 25: 666 tweets

(2020) September 25 - September 26: 660 tweets

(2020) September 26 - September 27: 657 tweets

(2020) September 27 - September 28: 600 tweets

(2020) September 28 - September 29: 635 tweets

(2020) September 29 - September 30: 609 tweets

(2020) September 30 - October 1: 580 tweets

(2020) October 1 - October 2: 641 tweets

(2020) October 2 - October 3: 318 tweets

(2020) October 3 - October 4: 384 tweets

(2020) October 4 - October 5: 389 tweets

(2020) October 5 - October 6: 361 tweets

(2020) October 6 - October 7: 396 tweets

(2020) October 7 - October 8: 464 tweets

(2020) October 8 - October 9: 537 tweets

(2020) Ocotber 9 - October 10: 574 tweets

(2020) October 10 - October 11: 577 tweets

(2020) October 11 - October 12: 504 tweets

(2020) October 12 - October 13: 551 tweets

(2020) October 13 - October 14: 486 tweets

(2020) October 14 - October 15: 611 tweets

(2020) October 15 - October 16: 518 tweets

(2020) October 16 - October 17: 593 tweets

(2020) October 17 - October 18: 624 tweets

(2020) October 18  - October 19: 507 tweets

(2020) October 19 - October 20: 594 tweets

(2020) October 20 - October 21: 589 tweets

(2020) October 21 - October 22: 580 tweets

(2020) October 22 - October 23: 582 tweets

(2020) October 23 - October 24: 707 tweets

(2020) October 24 - October 25: 644 tweets

(2020) October 25 - October 26: 507 tweets

(2020) October 26 - October 27: 576 tweets

(2020) October 27 - October 28: 485 tweets

(2020) October 28 - October 29: 537 tweets

(2020) October 29 - October 30: 686 tweets

(2020) October 30 - October 31: 698 tweets

(2020) October 31 - November 01: 1070 tweets

(2020) November 1 - November 2: 780 tweets

(2020) November 2 - November 3: 690 tweets

(2020) November 3 - November 4: 763 tweets

(2020) November 4 - November 5: 838 tweets

(2020) November 5 - November 6: 944 tweets

(2020) November 6 - November 7: 734 tweets

(2020) November 7 - November 8: 691 tweets

(2020) November 8 - November 9: 616 tweets

(2020) November 9 - November 10: 463 tweets

(2020) November 10 - November 11: 618 tweets

(2020) November 11 - November 12: 632 tweets

(2020) November 12 - November 13: 586 tweets

(2020) November 13 - November 14: 620 tweets

(2020) November 14 - November 15: 674 tweets

(2020) November 15 - November 16: 637 tweets

(2020) November 16 - November 17: 521 tweets

(2020) November 17 - November 18: 591 tweets

(2020) November 18 - November 19: 639 tweets

(2020) November 19 - November 20: 673 tweets

(2020) November 20 - November 21: 657 tweets

(2020) November 21 - November 22: 694 tweets

(2020) November 22 - November 23: 644 tweets

(2020) November 23 - November 24: 595 tweets

(2020) November 24 - November 25: 649 tweets

(2020) November 25 - November 26: 675 tweets

(2020) November 26 - November 27: 821 tweets

(2020) November 27 - November 28: 606 tweets

(2020) November 28 - November 29: 633 tweets

(2020) November 29 - November 30: 560 tweets

(2020) November 30 - December 1: 551 tweets

(2020) December 1 - December 2: 613 tweets

(2020) December 2 - December 3: 615 tweets

(2020) December 3 - December 4: 624 tweets

(2020) December 4 - December 5: 582 tweets

(2020) December 5 - December 6: 630 tweets

(2020) December 6 - December 7: 422 tweets

(2020) December 7 - December 8: 649 tweets

(2020) December 8 - December 9: 526 tweets

(2020) December 9 - December 10: 542 tweets

(2020) December 10 - December 11: 521 tweets

(2020) December 11 - December 12: 630 tweets

(2020) December 12 - December 13: 690 tweets

(2020) December 13 - December 14: 515 tweets

(2020) December 14 - December 15: 528 tweets

(2020) December 15 - December 16: 618 tweets

(2020) December 16 - December 17: 577 tweets

(2020) December 17 - December 18: 604 tweets

(2020) December 18 - December 19: 597 tweets

(2020) December 19 - December 20: 482 tweets

(2020) December 20 - December 21: 591 tweets

(2020) December 21 - December 22: 530 tweets

(2020) December 22 - December 23: 555 tweets

(2020) December 23 - December 24: 616 tweets

(2020) December 24 - December 25: 752 tweets

(2020) December 25 - December 26: 701 tweets

(2020) December 26 - December 27: 540 tweets

(2020) December 27 - December 28: 430 tweets

(2020) December 28 - December 29: 535 tweets

(2020) December 29 - December 30: 573 tweets

(2020) December 30 - December 31: 559 tweets

(2020/2021) December 31 - January 1: 877 tweets

(2021) January 1 - January 2: 582 tweets

(2021) January 2 - January 3: 458 tweets

(2021) January 3 - January 4: 423 tweets

(2021) January 4 - January 5: 479 tweets

(2021) January 5 - January 6: 675 tweets

(2021) January 6 - January 7: 584 tweets

(2021) January 7 - January 8: 662 tweets

(2021) January 8 - January 9: 646 tweets

(2021) January 9 - January 10: 677 tweets

(2021) January 10 - January 11: 554 tweets

(2021) January 11 - January 12: 577 tweets

(2021) January 12 - January 13: 660 tweets

(2021) January 13 - January 14: 566 tweets

(2021) January 14 - January 15: 709 tweets

(2021) January 15 - January 16: 625 tweets

(2021) January 16 - January 17: 545 tweets

(2021) January 17 - January 18: 587 tweets

(2021) January 18 - January 19: 541 tweets

(2021) January 19 - January 20: 625 tweets

(2021) January 20 - Janaury 21: 505 tweets

(2021) Janaury 21 - January 22: 534 tweets

(2021) Janaury 22 - January 23: 584 tweets

(2021) January 23 - January 24: 591 tweets

(2021) January 24 - January 25: 561 tweets

(2021) January 25 - January 26: 516 tweets

(2021) January 26 - January 27: 596 tweets

(2021) January 27 - January 28: 580 tweets

(2021) January 28 - January 29: 660 tweets

(2021) January 29 - January 30: 784 tweets

(2021) January 30 - January 31: 774 tweets

(2021) January 31 - February 1: 703 tweets

(2021) February 1 - February 2: 777 tweets

(2021) February 2 - February 3: 688 tweets

(2021) February 3 - February 4: 713 tweets

(2021) February 4 - February 5: 651 tweets

(2021) February 5 - February 6: 743 tweets

(2021) February 6 - February 7: 631 tweets

(2021) February 7 - February 8: 685 tweets

(2021) February 8 - February 9: 627 tweets

(2021) February 9 - February 10: 665 tweets

(2021) February 10 - February 11: 697 tweets

(2021) February 11 - February 12: 614 tweets

(2021) February 12 - February 13: 736 tweets

(2021) February 13 - February 14: 601 tweets

(2021) February 14 - February 15: 594 tweets

(2021) February 15 - February 16: 648 tweets

(2021) February 16 - February 17: 656 tweets

(2021) February 17 - February 18: 675 tweets

(2021) February 18 - February 19: 627 tweets

(2021) February 19 - February 20: 637 tweets

(2021) February 20 - February 21: 609 tweets

(2021) February 21 - February 22: 559 tweets

(2021) February 22 - February 23: 644 tweets

(2021) February 23 - February 24: 615 tweets

(2021) February 24 - February 25: 646 tweets

(2021) February 25 - February 26: 643 tweets

(2021) February 26 - February 27: 578 tweets

(2021) February 27 - February 28: 669 tweets

(2021) February 28 - March 1: 581 tweets

(2021) March 1 - March 2: 568 tweets

(2021) March 2 - March 3: 564 tweets

(2021) March 3 - March 4: 383 tweets

(2021) March 4 - March 5: 599 tweets

(2021) March 5 - March 6: 588 tweets

(2021) March 6 - March 7: 632 tweets

(2021) March 7 - March 8: 597 tweets

(2021) March 8 - March 9: 566 tweets

(2021) March 9 - March 10: 617 tweets

(2021) March 10 - March 11: 655 tweets

(2021) March 11 - March 12: 715 tweets

(2021) March 12 - March 13: 702 tweets

(2021) March 13 - March 14: 700 tweets

(2021) March 14 - March 15: 658 tweets

(2021) March 15 - March 16: 751 tweets

(2021) March 16 - March 17: 685 tweets

(2021) March 17 - March 18: 842 tweets

(2021) March 18 - March 19: 762 tweets

(2021) March 19 - March 20: 679 tweets

(2021) March 20 - March 21: 724 tweets

(2021) March 21 - March 22: 692 tweets

(2021) March 22 - March 23: 694 tweets

(2021) March 23 - March 24: 760 tweets

(2021) March 24 - March 25: 778 tweets

(2021) March 25 - March 26: 784 tweets

(2021) March 26 - March 27: 788 tweets

(2021) March 27 - March 28: 686 tweets

(2021) March 28 - March 29: 626 tweets

(2021) March 29 - March 30: 713 tweets

(2021) March 30 - March 31: 736 tweets

(2021) March 31 - April 1: 736 tweets

(2021) April 1 - April 2: 738 tweets

(2021) April 2 - April 3: 726 tweets

(2021) April 3 - April 4: 662 tweets

(2021) April 4 - April 5: 627 tweets

(2021) April 5 - April 6: 720 tweets

(2021) April 6 - April 7: 732 tweets

(2021) April 7 - April 8: 776 tweets

(2021) April 8 - April 9: 702 tweets

(2021) April 9 - April 10: 722 tweets

(2021) April 10 - April 11: 626 tweets

(2021) April 11 - April 12: 589 tweets

(2021) April 12 - April 13: 706 tweets

(2021) April 13 - April 14: 799 tweets

(2021) April 14 - April 15: 547 tweets

(2021) April 15 - April 16: 522 tweets

(2021) April 16 - April 17: 641 tweets

(2021) April 17 - April 18: 511 tweets

(2021) April 18 - April 19: 578 tweets

(2021) April 19 - April 20: 640 tweets

(2021) April 20 - April 21: 624 tweets

(2021) April 21 - April 22: 584 tweets

(2021) April 22 - April 23: 611 tweets

(2021) April 23 - April 24: 631 tweets

(2021) April 24 - April 25: 570 tweets

(2021) April 25 - April 26: 519 tweets

(2021) April 26 - April 27: 523 tweets

(2021) April 27 - April 28: 577 tweets

(2021) April 28 - April 29: 608 tweets

(2021) April 29 - April 30: 831 tweets

(2021) April 30 - May 1: 535 tweets

(2021) May 1 - May 2: 601 tweets

(2021) May 2 - May 3: 507 tweets

(2021) May 3 - May 4: 565 tweets

(2021) May 4 - May 5: 575 tweets

(2021) May 5 - May 6: 605 tweets

(2021) May 6 - May 7: 1115 tweets

(2021) May 7 - May 8: 1263 tweets

(2021) May 8 - May 9: 530 tweets

(2021) May 9 - May 10: 481 tweets

(2021) May 10 - May 11: 518 tweets

(2021) May 11 - May 12: 424 tweets

(2021) May 12 - May 13: 508 tweets

(2021) May 13 - May 14: 632 tweets

(2021) May 14 - May 15: 627 tweets

(2021) May 15 - May 16: 484 tweets

(2021) May 16 - May 17: 451 tweets

(2021) May 17 - May 18: 505 tweets

(2021) May 18 - May 19: 486 tweets

(2021) May 19 - May 20: 522 tweets

(2021) May 20 - May 21: 493 tweets

(2021) May 21 - May 22: 450 tweets

(2021) May 22 - May 23: 451 tweets

(2021) May 23 - May 24: 353 tweets

(2021) May 24 - May 25: 457 tweets

(2021) May 25 - May 26: 421 tweets

(2021) May 26 - May 27: 454 tweets

(2021) May 27 - May 28: 489 tweets

(2021) May 28 - May 29: 484 tweets

(2021) May 29 - May 30: 432 tweets

(2021) May 30 - May 31: 369 tweets

(2021) May 31 - June 1: 420 tweets

(2021) June 1 - June 2: 457 tweets

(2021) June 2 - June 3: 398 tweets

(2021) June 3 - June 4: 400 tweets

(2021) June 4 - June 5: 470 tweets

(2021) June 5 - June 6: 390 tweets

(2021) June 6 - June 7: 310 tweets

(2021) June 7 - June 8: 366 tweets

(2021) June 8 - June 9: 381 tweets

(2021) June 9 - June 10: 408 tweets

(2021) June 10 - June 11: 402 tweets

(2021) June 11 - June 12: 376 tweets

(2021) June 12 - June 13: 331 tweets

(2021) June 13 - June 14: 295 tweets

(2021) June 14 - June 15: 330 tweets

(2021) June 15 - June 16: 395 tweets

(2021) June 16 - June 17: 358 tweets

(2021) June 17 - June 18: 341 tweets

(2021) June 18 - June 19: 351 tweets

(2021) June 19 - June 20: 293 tweets

(2021) June 20 - June 21: 237 tweets

(2021) June 21 - June 22: 276 tweets

(2021) June 22 - June 23: 311 tweets

(2021) June 23 - June 24: 329 tweets

(2021) June 24 - June 25: 313 tweets

(2021) June 25 - June 26: 306 tweets

(2021) June 26 - June 27: 319 tweets

(2021) June 27 - June 28: 281 tweets

(2021) June 28 - June 29: 292 tweets

(2021) June 29 - June 30: 342 tweets

(2021) June 30 - July 1: 307 tweets

(2021) July 1 - July 2: 349 tweets

(2021) July 2 - July 3: 319 tweets

(2021) July 3 - July 4: 281 tweets

(2021) July 4 - July 5: 281 tweets

(2021) July 5 - July 6: 281 tweets

(2021) July 6 - July 7: 296 tweets

(2021) July 7 - July 8: 313 tweets

(2021) July 8 - July 9: 324 tweets

(2021) July 9 - July 10: 327 tweets

(2021) July 10 - July 11: 287 tweets

(2021) July 11 - July 12: 268 tweets

(2021) July 12 - July 13: 293 tweets

(2021) July 13 - July 14: 311 tweets

(2021) July 14 - July 15: 323 tweets

(2021) July 15 - July 16: 333 tweets

(2021) July 16 - July 17: 347 tweets

(2021) July 17 - July 18: 351 tweets

(2021) July 18 - July 19: 301 tweets

(2021) July 19 - July 20: 317 tweets

(2021) July 20 - July 21: 314 tweets

(2021) July 21 - July 22: 321 tweets

(2021) July 22 - July 23: 298 tweets

(2021) July 23 - July 24: 330 tweets

(2021) July 24 - July 25: 297 tweets

(2021) July 25 - July 26: 237 tweets

(2021) July 26 - July 27: 279 tweets

(2021) July 27 - July 28: 298 tweets

(2021) July 28 - July 29: 297 tweets

(2021) July 29 - July 30: 279 tweets

(2021) July 30 - July 31: 302 tweets

(2021) July 31 - August 1: 292 tweets

(2021) August 1 - August 2: 286 tweets

(2021) August 2 - August 3: 295 tweets

(2021) August 3 - August 4: 301 tweets

(2021) August 4 - August 5: 281 tweets

(2021) August 5 - August 6: 318 tweets

(2021) August 6 - August 7: 302 tweets

(2021) August 7 - August 8: 280 tweets

(2021) August 8 - August 9: 255 tweets

(2021) August 9 - August 10: 297 tweets

(2021) August 10 - August 11: 278 tweets

(2021) August 11 - August 12: 264 tweets

(2021) August 12 - August 13: 301 tweets

(2021) August 13 - August 14: 293 tweets

(2021) August 14 - August 15: 296 tweets

(2021) August 15 - August 16: 230 tweets

(2021) August 16 - August 17: 258 tweets

(2021) August 17 - August 18: 314 tweets

(2021) August 18 - August 19: 302 tweets

(2021) August 19 - August 20: 306 tweets

(2021) August 20 - August 21: 331 tweets

(2021) August 21 - August 22: 383 tweets

(2021) August 22 - August 23: 189 tweets

(2021) August 23 - August 24: 237 tweets

(2021) August 24 - August 25: 343 tweets

(2021) August 25 - August 26: 293 tweets

(2021) August 26 - August 27: 338 tweets

(2021) August 27 - August 28: 315 tweets

(2021) August 28 - August 29: 271 tweets

(2021) August 29 - August 30: 317 tweets

(2021) August 30 - August 31: 242 tweets

(2021) August 31 - September 1: 238 tweets

(2021) September 1 - September 2: 279 tweets

(2021) September 2 - September 3: 251 tweets

(2021) September 3 - September 4: 262 tweets

(2021) September 4 - September 5: 255 tweets

(2021) September 5 - September 6: 240 tweets

(2021) September 6 - September 7: 216 tweets

(2021) September 7 - September 8: 238 tweets

(2021) September 8 - September 9: 250 tweets

(2021) September 9 - September 10: 224 tweets

(2021) September 10 - September 11: 256 tweets

(2021) September 11 - September 12: 225 tweets

(2021) September 12 - September 13: 202 tweets

(2021) September 13 - September 14: 217 tweets

(2021) September 14 - September 15: 257 tweets

(2021) September 15 - September 16: 236 tweets

(2021) September 16 - September 17: 248 tweets

(2021) September 17 - September 18: 300 tweets

(2021) September 18 - September 19: 233 tweets

(2021) September 19 - September 20: 194 tweets

(2021) September 20 - September 21: 245 tweets

(2021) September 21 - September 22: 245 tweets

(2021) September 22 - September 23: 266 tweets

(2021) September 23 - September 24: 223 tweets

(2021) September 24 - September 25: 247 tweets

(2021) September 25 - September 26: 227 tweets

(2021) September 26 - September 27: 226 tweets

(2021) September 27 - September 28: 259 tweets

(2021) September 28 - September 29: 289 tweets

(2021) September 29 - September 30: 282 tweets

(2021) September 30 - October 1: 249 tweets

(2021) October 1 - October 2: 125 tweets

(2021) October 2 - October 3: 110 tweets

(2021) October 3 - October 4: 91 tweets

(2021) October 4 - October 5: 158 tweets

(2021) October 5 - October 6: 174 tweets

(2021) October 6 - October 7: 244 tweets

(2021) October 7 - October 8: 292 tweets

(2021) October 8 - October 9: 191 tweets

(2021) October 9 - October 10: 168 tweets

(2021) October 10 - October 11: 241 tweets

(2021) October 11 - October 12: 171 tweets

(2021) October 12 - October 13: 220 tweets

(2021) October 13 - October 14: 243 tweets

(2021) October 14 - October 15: 262 tweets

(2021) October 15 - October 16: 214 tweets

(2021) October 16 - October 17: 197 tweets

(2021) October 17 - October 18: 170 tweets

(2021) October 18 - October 19: 207 tweets

(2021) October 19 - October 20: 211 tweets

(2021) October 20 - October 21: 188 tweets

(2021) October 21 - October 22: 231 tweets

Instructions: 

Each CSV file contains a list of tweet IDs. You can use these tweet IDs to download fresh data from Twitter (read this article: hydrating tweet IDs). To make it easy for the NLP researchers to get access to the sentiment analysis of each collected tweet, the sentiment score computed by TextBlob has been appended as the second column. To hydrate the tweet IDs, you can use applications such as Hydrator (available for OS X, Windows and Linux) or twarc (python library).

Getting the CSV files of this dataset ready for hydrating the tweet IDs:

import pandas as pd

dataframe=pd.read_csv("april28_april29.csv", header=None)

dataframe=dataframe[0]

dataframe.to_csv("ready_april28_april29.csv", index=False, header=None)

The above example code takes in the original CSV file (i.e., april28_april29.csv) from this dataset and exports just the tweet ID column to a new CSV file (i.e., ready_april28_april29.csv). The newly created CSV file can now be consumed by the Hydrator application for hydrating the tweet IDs. To export the tweet ID column into a TXT file, just replace ".csv" with ".txt" in the to_csv function (last line) of the above example code.

If you are not comfortable with Python and pandas, you can upload these CSV files to your Google Drive and use Google Sheets to delete the second column. Once finished with the deletion, download the edited CSV files: File > Download > Comma-separated values (.csv, current sheet). These downloaded CSV files are now ready to be used with the Hydrator app for hydrating the tweet IDs.

Comments

Hello sir,

twarc hydrate is not working in spite of giving correct twitter API credentials for configuring twarc. Its creating a blank json file. To test twarc, I used search, it is able to pull out tweets.

Submitted by Jayshree Ravi on Fri, 10/16/2020 - 04:30

Hello Jayshree. Please create an issue at twarc's github. And FYI, I am able to hydrate tweets at my end without any problem.

Submitted by Rabindra Lamsal on Sat, 10/17/2020 - 01:10

Thanks for your response. Only hydrate command is not working. All other commands like search, filter, users and dehydrate are able to connect to twitter and give me the requisite information. Hydrate command does not throw any error. It just produces a blank json file. I even tried with only one tweet id in the txt file. Your guidance would be of great help

Submitted by Jayshree Ravi on Sun, 10/18/2020 - 01:20

Its working now. Thanks

Submitted by Jayshree Ravi on Sun, 10/18/2020 - 02:19

That's great.

Submitted by Rabindra Lamsal on Mon, 10/19/2020 - 00:53

How do you get permission to access the S3 bucket? I'm getting "access denied" errors when I try to access through the aws app or the web. Thanks!

Submitted by Adam Dalton on Thu, 11/05/2020 - 10:55

For anyone looking to use the aws cli, here's what I did

1. Click on the "Access on AWS" link
2. Click "View AWS Security Credentials"
3. in "~/.aws/credentials" create an ieee profile
[ieee]
aws_access_key_id = ********
aws_secret_access_key = ********
4. Copy the files listed in "Access on AWS" into a file like covid19-geotagged.txt
5. Run `while read -r line;do aws s3 --profile=ieee cp "$line" .;done < covid19-geotagged.txt`

This should work on most unix machines. Windows will probably be slightly different.

Submitted by Adam Dalton on Thu, 11/05/2020 - 11:12

Great! Thanks for the follow-up.

Submitted by Rabindra Lamsal on Fri, 11/06/2020 - 00:04

Please make sure you have the exact AWS access ID and AWS Secret Access Key copied from your IEEE-DataPort profile.

things to note:
(a) protocol: not always required (Amazon S3)
(b) Address: not always required (s3.amazonaws.com)
(c) bucket: ieee-dataport
(d) Access Key id: enter your AWS access ID
(e) Secret: enter your AWS Secret Access Key

I hope this helps.

Submitted by Rabindra Lamsal on Sun, 11/08/2020 - 23:43

Sir, are these comments filtered from tweets which have local language tweets typed in english, for exmple Hindi message written in english ??

Submitted by GONGATI REDDY on Fri, 11/13/2020 - 05:08

Hello Gongati. Twitter adds a language identifier based on the machine-detected language of the tweet body. The tweets in this dataset are those which had "en" language identifier in their metadata.
Tweets composed in eg. romanized hindi cannot be supposed to be in English although they make use of English alphabets. I believe those kinds of tweets fall under undefined 'und' language category.

Submitted by Rabindra Lamsal on Sat, 11/14/2020 - 00:02

Thank You.....

Submitted by GONGATI REDDY on Sat, 11/14/2020 - 00:05

Glad to be of help.

Submitted by Rabindra Lamsal on Sat, 11/14/2020 - 23:53

Sir, is it true that we can fetch only 100 tweets at time? If true, is there a chance that tweets will repeating again in the next 100 from the previous 100?

Submitted by GONGATI REDDY on Sun, 11/15/2020 - 22:58

(i) Yes, 100 tweets can be fetched in a single request (v1.1 streaming API). However, Twitter puts limits on the number of requests that can be made per window period.
(ii) No, tweets do not repeat. The tweets are available via the streaming API as soon as they are tweeted (in near real-time).

Submitted by Rabindra Lamsal on Tue, 11/17/2020 - 10:33

Dear Rabindra,
thanks for the amazing work in helping make the researcher’s work easier and faster.
Though I need your help to know if I could download the entire dataset without the need to subscribe for IEEE DataPort or AWS? And if yes, how could that be?

Please help me know if anyone else knows about it.
Thanks.

Submitted by Vibhu Kumar on Thu, 12/03/2020 - 07:19

Hello Vibhu. You don't need any kind of subscription to IEEE DataPort to download the dataset. All you need is a normal IEEE.org account. The dataset is open access; therefore, no subscription is required.

You can access the AWS S3 bucket via the command line (this comment may help you out here: https://ieee-dataport.org/open-access/coronavirus-covid-19-geo-tagged-tw...).

Else, you can write to IEEE DataPort via this page (https://ieee-dataport.org/contact) and ask for other ways to download the entire dataset.

Submitted by Rabindra Lamsal on Sat, 12/05/2020 - 00:43

Dear Rabindra,
Thank you for developing this dataset and the Hydrate app.
However, when I download the "ready datast"from the google drive, the Hydreate app warned that the "invalid line 1 from the file".
Any solutions on this matter? Thanks!

Submitted by Xuanyi Zhao on Wed, 12/09/2020 - 19:30

Hello Xuanyi. The aim of using Google Spreadsheet is to remove the second column (i.e. sentiment score) and only keep the first column (tweet id). Just make sure that there are only tweet ids (one id per line) in the ready file. Maybe the file which you've made ready for hydration contains extra spaces or any sort of characters (by mistake) other than digits (ids). Please double-check this.

Submitted by Rabindra Lamsal on Wed, 12/09/2020 - 22:56

Hi Rabrindra,

Could I ask for some advice please, is it possible to filter the tweets to isolate those coming from one country? i.e. The united Kingdom? If so, how would I go about doing that?

Thanks.

Submitted by Craig Cowan on Sun, 01/24/2021 - 10:46

Hello Craig. Yes, it is possible to filter the tweets coming from one country. Please refer to my previous comments:

(i) https://ieee-dataport.org/open-access/coronavirus-covid-19-geo-tagged-tw...
(ii) https://ieee-dataport.org/open-access/coronavirus-covid-19-geo-tagged-tw...

I hope this helps.

Submitted by Rabindra Lamsal on Sun, 01/24/2021 - 23:56

Hi Rabindra,

Thanks a lot thats great information.

Submitted by Craig Cowan on Mon, 01/25/2021 - 07:50

I am however struggling to Hydrate the tweets using the hydrator which you have reccomended, DocNow's Hydrator.

I noticed in a previous comment you have said that when hydrating not to use Excels .csv files due to the numbers truncating, and to use google sheets instead. How do go about doing this? As when uploading the file to Google sheets, the numbers still truncate.

Any help is appreciated, thanks.

Submitted by Craig Cowan on Mon, 01/25/2021 - 08:03

I have not experienced the truncation issue with Google Sheets. If you are still having the issue, I would recommend you to use python or any language you're comfortable with to drop the second column of the CSV files.
(or you can simply make use of online "column drop" applications; there are multiple of them).

Submitted by Rabindra Lamsal on Tue, 01/26/2021 - 06:59

Hello,

first of all thank you for you effort, I greatly appreciate it.

I´m a little bit confused about the twitter API, maybe you can help me. When a tweet has a value for "coordinates", does this automatically assign the correct value for "place"? In the Twitter documentation (and also in your paper), it is said that "place" does not necessarily mean, that the tweet was posted in the particular location.

Thank you

Submitted by Sebastian Dueker on Wed, 02/17/2021 - 11:22

Hello Sebastian. If you go through the Geo objects documentation (https://developer.twitter.com/en/docs/twitter-api/v1/data-dictionary/obj...), you'll see that you can extract location data in two different levels (I'm excluding the third one i.e. location from a user's profile).
If the "coordinates" object is NOT NULL, the "place" object will have exact location information (point location). However, if the "coordinates" object is NULL, you can still have place information (Twitter place, if available). But, keep in mind that location information extract from the latter case might not represent the tweet's origin location.
I hope this clarifies your doubt.

Submitted by Rabindra Lamsal on Fri, 02/19/2021 - 01:00

Hello Rabindra,

thanks for clarifying. Still, I have another question:

I used your datset by using the "Hydrator" application. In the concluding dataset, there are no instances of tweets where the "coordinates" column is empty. Does this mean that all tweets in your dataset represent an excact location? Or does Twitter assign coordinates according to a "place" which was selected by a user afterwards?

For my research, I´m looking for Tweets which are from New York and London, therefore I´ll need the excact location from Tweets which were actually posted from these locations.

Submitted by Sebastian Dueker on Fri, 02/19/2021 - 04:36

Yes. The tweets in this dataset have exact locations. Tweets with "Twitter place" do not make their way to this dataset.

Submitted by Rabindra Lamsal on Fri, 02/19/2021 - 10:49

Hi Rabindra,
First of all, thank you so much for this data set. I have downloaded the data of a particular day into my pc but for that data total number of tweets is very low, only 1289. I am getting a very low accuracy or f1 score for my machine learning model. Can I get more data together for a larger span like one month so that I can feed a large volume of data?
Please help me.

Submitted by SUBHADIP MAITY on Tue, 02/23/2021 - 08:12

Hello Subhadip. I have emailed IEEE to implement a Combined_Files section to this dataset. I believe the section will be implemented by tomorrow.

And once you download all the CSV files in a zip (once the implementation is finished), you can concatenate all the files using python (or consider only the files you're interested in). Dropping the second column should be easy. Then you can easily hydrate the tweets. I would suggest you use twarc to hydrate the tweets, and while you're hydrating, you can also extract the corresponding sentiment scores via your custom-built sentiment classifier/regressor or third-party libraries.

I hope this helps.

[update] the Combined_Files section is now available.

Submitted by Rabindra Lamsal on Thu, 02/25/2021 - 11:11

Hi Rabindra, I have got the combined files. Thank you so much.

Submitted by SUBHADIP MAITY on Thu, 02/25/2021 - 21:30

Glad to be of help.

Submitted by Rabindra Lamsal on Fri, 02/26/2021 - 12:41

Hi Rabindra,

I realized that a lot of the tweets in the dateset are cut-off. Do you have any idea what´s the reason for that?

Submitted by Sebastian Dueker on Wed, 02/24/2021 - 07:14

Do you mean that the tweets are "not available"?
Yes, you will not be able to hydrate the tweet IDs of the tweets which have been deleted or made private. If you go through the paper, I have mentioned regarding this. However, the number of "not available" tweets is not that significant in comparison to the primary dataset.

Submitted by Rabindra Lamsal on Thu, 02/25/2021 - 00:32

I´m not sure if we´re talking about the same thing. I´m able to hydrate these tweets, but when I try to look at the text, I can only see a small part of the text and the rest is truncated by three dots at the end of the tweet.

In your paper, you wrote that about 2.80% of the tweets were either private or deleted. For my research I looked at a lot of the tweets manually. Maybe i just got the wrong tweets, but about 80% of the tweets were affected this way.

Submitted by Sebastian Dueker on Fri, 02/26/2021 - 03:05

Ohh, okay. You are talking about tweets getting truncated. Don't worry, you're getting the correct tweets. There is a Twitter object "truncated" which indicates if a tweet is truncated. Truncated tweets end in ellipsis, like this ... .

Dealing with truncated tweets: You can simply get the full tweet text by requesting data['extended_tweet']['full_text']. If the "truncated"
object is "false" just pull the tweet text with data['text'] and when the "truncated" object is "true" use data['extended_tweet']['full_text'].

if data['truncated']:
tweet = data['extended_tweet']['full_text']
else:
tweet = data['text']

And for a retweeted tweet (if there is truncation), the full text is placed under 'retweeted_status'.

I hope this helps.

Submitted by Rabindra Lamsal on Fri, 02/26/2021 - 09:22

Thanks for replying. But I think the problem I´m facing lies somewhere else: A lot of these truncated tweets just contain links to other social media posts (especially Instagram) at the end of the tweets. Can the full text of these kind of tweets be shown as well? Or is there any other way to resolve this?

Submitted by Sebastian Dueker on Fri, 02/26/2021 - 09:46

You can always check a tweet online using this URL: http://twitter.com/check/status/tweet_id.

Just replace "tweet_id" in the above URL with numeric ID. You can then check if the tweet is really truncated or the tweet has been written in that very way.

Submitted by Rabindra Lamsal on Fri, 02/26/2021 - 11:19

Yeah, sure. Here are some example IDs:
1245248640421179393
1245289072987443201
1245304684237090817

Submitted by Sebastian Dueker on Fri, 02/26/2021 - 11:21

You can use the above URL pattern to see the full tweet body of any tweet using its ID. I went through the tweet body of the IDs you shared. In the case of these tweets, there is nothing much we can do. It is due to character limitation on Twitter's side. And this case is pretty common when people share their Insta posts on Twitter.

Submitted by Rabindra Lamsal on Fri, 02/26/2021 - 12:40

Hi Rabindra,

I don't seem to be able to check the tweet using the link http://twitter.com/check/status/tweet_id. For instance from the excel file corona_tweets_01 I randomly put the IDs on rows 1:(1240727808080410000), 10:(1240727809225440000), 100: (1240727821661550000) and some others. Is there anything else that needs to be done apart from replacing these numbers with the "tweet_id"?

Thanks

Submitted by Tahahossein Movahedi on Tue, 03/23/2021 - 09:33

Hello Tahahossein. Sorry for replying late. I did not see your comment as it was a reply to an earlier thread.

You can always check a tweet using its ID by visiting this URL: http://twitter.com/check/status/tweet_id

The IDs you're mentioning are truncated (you can see all the IDs are ending with a couple of zeros at your side). The tweet IDs have 19 digits, and MS Excel seems to be not handling digits up to that precision (therefore last 4 digits are truncated and replaced with zeros). You can use Google Sheets to see the original IDs.

Submitted by Rabindra Lamsal on Wed, 04/07/2021 - 11:17

Could you add the year to the end of the individual filenames? For example, March1_March2_2021. You are a few days away from potentially having duplicate file names. This would fix that issue for you beforehand. Also, thank you for building this dataset!

Submitted by Jacob Motley on Tue, 03/16/2021 - 21:58

Hello Jacob. Thank you for your comment. The solution for the duplicate issue is planned (a NOTE has already been added in the abstract section of this dataset). Please note that the files after March 20, 2021, will have the "2021_" prefix.

Submitted by Rabindra Lamsal on Wed, 03/17/2021 - 12:58

Are you going to format the older files in a similar way or only those after March 20?

Submitted by Jacob Motley on Tue, 03/23/2021 - 16:01

[update: July 18, 2020] The "2020_" and "2021_" prefixes have been added to the respective files.

Submitted by Rabindra Lamsal on Sat, 07/17/2021 - 23:12

Hi Rabindra,
thanks for this data. I have downloaded the Geo-Tagged data. There are two columns of TweetID and Sentiment score and I'm just wondering if I'm missing the geo tags. Is it possible to see the regional location of the tweet and how small we can get it(country, county, town etc.?

Thanks,
Taha

Submitted by Tahahossein Movahedi on Tue, 03/23/2021 - 09:17

Hey, I'm not the author, but if you scroll down in the dataset documentation the author describes the hydration process. You must use a tool to hydrate the tweets, which will get the tweet data for each file id.

Submitted by Jacob Motley on Tue, 03/23/2021 - 16:02

Hi! thank you for uploading the dataset. october27_october28.csv is missing the sentiment scores can you please fix that. Thank you.

Submitted by Himalaya Sheoran on Thu, 06/10/2021 - 06:19

Hello Himalaya. Thank you for reaching out. Yes, the sentiment scores are not available in a couple of files. I've mentioned this on the primary dataset page: https://ieee-dataport.org/open-access/coronavirus-covid-19-tweets-dataset.

"Only the tweet IDs are available (sentiment scores are not available) for the tweets present in the files: corona_tweets_223.csv, corona_tweets_297.csv, corona_tweets_395.csv and the files containing tweets from before March 20, 2020."

The file you are mentioning was obtained from the corona_tweets_223.csv file of the primary dataset. Due to some technical reasons, I had to upload only the tweet IDs for that period. However, you can easily compute the sentiment scores for each tweet ID during hydration. I suggest you use twarc (as python library) and pass the "full_text" object through TextBlob or any sentiment analyzer.

I hope this helps.

Submitted by Rabindra Lamsal on Thu, 06/10/2021 - 10:27

Pages

Dataset Files

LOGIN TO ACCESS DATASET FILES