Bitcoin Tweets 2022

Citation Author(s):
Krishna
Kumari
Submitted by:
Shihab Muhtasim
Last updated:
Wed, 04/16/2025 - 13:26
DOI:
10.21227/tgeq-1e59
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

Bitcoin(₿) is a cryptocurrency invented in 2008 by an unknown person or group of people using the pseudonym Satoshi Nakamoto. The currency began use in 2009 when its implementation was released as open-source software.

Bitcoin is a blockchain-based decentralized digital currency, without a central bank or single administrator, that can be sent from user to user on the peer-to-peer bitcoin network without the need for intermediaries. Transactions are verified by network nodes through cryptography and recorded in a public distributed ledger called a blockchain. Bitcoins are created as a reward for a process known as mining. They can be exchanged for other currencies, products, and services.

I am sharing the Bitcoin Tweets Dataset to the research community containing large Tweets collected using Trackmyhashtag. The dataset consists of a total of 337,701 tweet IDs of the same number of tweets about bitcoin that were posted on Twitter from 15th Sept 2022 to 17th Sept 2022.

The dataset was collected using Trackmyhashtag, an easy & affordable platform.

A lot of international events that affected bitcoin happened during the collecting time period, which may make this dataset interesting to analyze.

Each Tweet contains different types of data :-

  • Tweet Id
  • Tweet URL
  • Posted time and date
  • Tweet Content
  • Other metadata
Instructions: 

The “Bitcoin Tweets 2022” dataset is provided in the form of multiple CSV files. Each file represents a subset of tweets collected between September 15, 2022 and September 17, 2022, using the TrackMyHashtag platform.

 

To properly utilize the dataset:

 

 

 

 

  1. Concatenate All CSV Files Row-Wise

     

  • Each CSV file shares the same schema (i.e., same column headers).

  • Users should concatenate all the CSVs row-wise to create a complete dataset.

  • This ensures the final dataset includes all 337,701 tweets.

 

 

 

  1. Explore the Data

     

    • Each row corresponds to a single tweet and includes fields such as:

       

      • Tweet ID

      • Tweet URL

      • Date and Time Posted

      • Tweet Text Content

      • User Metadata and Engagement Metrics (if available)

       

     

Data Descriptor Article DOI: