Datasets
Standard Dataset
numerical data
- Citation Author(s):
- Submitted by:
- Majed Aborokbah
- Last updated:
- Mon, 07/08/2024 - 15:58
- DOI:
- 10.21227/w9b3-x241
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
The focus has been on investors’ hopes for the stock market for a considerable amount of time.Becauseoftheanticipatedhighreturns,thisisthepreferredinvestmentoption.However,dueto the significance of accurate forecasting, such investments are high-risk. In order to analyze stock market forecasts, investors utilize a technical analyst and AI technologies. Predictions and forecasts may be impacted by several obstacles, including analyst bias towards specific firms, investor pro-clivities, the impactof corporate news and economic circumstances, and the reliance on human skills. One LSTM trained specifically on digital data was used to forecast the future share price of the company by analyzing the company’s stock data; its performance was evaluated using the mean squared error metric (MSE), which revealed an accuracy of 98%. The second model is the Support Vector Machine (SVM), which uses a calculation process that gives weight to the number of likes of the tweet, the number of retweets, and the number of followers of the person who tweeted this tweet, in addition to analysis of the text of the tweet, topredictthepositivistornegativistnatureofthetweetbasedonitsweight.Theperformanceofthemodel wasmeasuredbytheaccuracyratio,anditwasfoundthatthetextanalysiswasaccuratebyafactorof99%. Thus,acombinationoftextanddigitaldatawasusedtocreateaninteractivesitewhereuserscanestimate thedailyperfor-manceoftheStocksofselectedfirmsbydraggingTwittertweetsandmarketopeningdata. In the end, our research aids investors in making accurate share price predictions and limiting their losses.
Forex Data.csv
Digital data file withdrawn from Saudi Tadawul platform with data of six companies over ten years
price_model.h5
File keeps model training through which user input is tested
Text Data:
filter_with_weights.csv
Text data file withdrawn through Twitter platform for six Saudi companies with 200,000 records over ten years
pipeline.h5
File keeps model training through which user input is tested
Final Code:
Algorithm testing.ipynb
The final code file through which user tests are performed and results are displayed
Raw data used for testing
Etihad_Etisalat100day.csv
Etihad_EtisalatN.csv
Dataset Files
- Saudi Stock market numirical data Forex Data.csv (775.01 kB)
- Sentiment Data filter_with_weights.csv (29.62 MB)
- test data Etihad_Etisalat100day.csv (191.03 kB)