Dataset for Stock Market Prediction

Citation Author(s):
Rabia
Irfan
National University of Sciences & Technology
Submitted by:
Umara Umar
Last updated:
Mon, 07/08/2024 - 15:59
DOI:
10.21227/e29h-5c78
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

For the purpose of experimentation, the historical stock prices of three petroleum companies: Pakistan State Oil (PSO), Hascol, and Attock Petroleum Limited (APL), are extracted from the Pakistan Stock Exchange (PSX) website through a web scrapper for the last four years. Different attributes related to the stocks of each of these companies are extracted for each day. Along with this, for each of these companies, Twitter data for sentiment analysis is also extracted using Twint.

Instructions: 

The historical stock prices of three petroleum companies: Pakistan State Oil (PSO), Hascol, and Attock Petroleum Limited (APL), are extracted from the Pakistan Stock Exchange (PSX) website through a web scrapper for the last four years. Different attributes related to the stocks of each of these companies are extracted for each day. We extract attributes like total volume of trade for a day, high price, close price, low price, and open price. Along with this, for each of these companies, Twitter data for sentiment analysis is extracted using Twint, which is an open-source web scraper written in Python. It offers to scrape unlimited tweets despite of limitations of Twitter API. After tweets extraction, we need the user’s profile-based attributes and tweet-based attributes to calculate the composite influence score of each user. Tweet ID retrieved from Twint is used to extract all these attributes using Twitter API ‘Tweepy’.

Comments

I am student of NUST and need to access this dataset for my research purpose.

Submitted by Talha Saeed on Tue, 09/03/2024 - 03:11

I am student of RIT and need to access this dataset for my research purpose.

Submitted by MUNENDHIRA S.N on Thu, 10/10/2024 - 05:39