Bangla SMS Dataset for Smishing Detection

Citation Author(s):
Gazi
Tanbhir
MD. Farhan
Shahriyar
Submitted by:
Gazi Tanbhir
Last updated:
Mon, 08/26/2024 - 03:52
DOI:
10.21227/vxz9-ak04
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

 

This dataset comprises 2,287 Bangla text SMS messages, categorized into three classes: Normal, Smish, and Promotional. Collected via an online survey and validated by a cybersecurity expert, the dataset supports research in detecting smishing—a form of phishing via SMS. Each message is meticulously labeled to facilitate the development and evaluation of machine learning models aimed at identifying cyber threats within Bangla SMS communications. This dataset is a valuable resource for advancing cybersecurity measures, particularly in protecting Bangla-speaking users from smishing attacks.