Skip to main content

Datasets

Standard Dataset

final data 20K

Citation Author(s):
Lulwah AL-Dowihi (Imam Abdulrahman Bin Faisal University)
Fatema Almahdood (Imam Abdulrahman Bin Faisal University)
Basmah Alhotail (Imam Abdulrahman Bin Faisal University)
Maryah Alshimer (Imam Abdulrahman Bin Faisal University)
Sara Aldossary (Imam Abdulrahman Bin Faisal University)
Submitted by:
Lulwah AL-Dowihi
Last updated:
DOI:
10.21227/vdsq-9278
Data Format:
No Ratings Yet

Abstract

Translator   Translator   Translator   

The rise of social media platforms such as Twitter has resulted in a significant increase in spam tweets, which may negatively impact both individual and platform providers. In this study, we propose an automated spam detection on Arabian Gulf Dialect Using Machine Learning Techniques to classify tweets as spam or legitimate. This research presents a machine learning-based technique for detecting spam on Twitter in the Arabian Gulf dialect. Support Vector Machines (SVM), Random Forest (RF), and Bidirectional Encoder Representations from Transformers (MarBERT) models have been specifically used for an Arabian Gulf dialect tweet dataset. The accuracy, precision, recall, and F1-score of the three models were used to evaluate their performance. The SVM model outperformed the RF and MarBERT models with 96% accuracy.

Instructions:

Translator   Translator  1- open Exel 2- select  Data 3-select get the data 4- select  fom file 5-select  workbook Translator