Multi-Label Extremism and Jihadism Classification Tweets Dataset

Citation Author(s):
Mahamodul Hasan
Mahadi
Md. Nasif
Safwan
Submitted by:
Mahamodul Hasan...
Last updated:
Fri, 08/30/2024 - 08:47
DOI:
10.21227/6gmh-1b80
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

The "Multi-Label Extremism and Jihadism Classification Tweets Dataset" dataset is a multilingual resource designed for multi-label classification of online extremism and toxic behavior, including extremism and jihadism. Each comment is annotated with labels indicating the presence of various extremism traits: toxic, severe toxic, obscenity, threats, insults, identity hate, and jihadi content. This dataset is valuable for research in automated content moderation, enabling the detection of harmful and extremist content across multiple languages, and contributing to the development of safer online environments by providing a diverse array of real-world examples.

Comments

The "Terrorism and Multi-Toxic Labels Classification" dataset is a multilingual dataset curated to assist in the development and evaluation of models aimed at detecting online extremism and toxic behaviors. This dataset is particularly suited for tasks involving multi-label classification, where each comment may exhibit multiple forms of extremism and toxicity.

Submitted by Mahamodul Hasan... on Fri, 08/30/2024 - 08:46