Hindi(Devanagari Script)

The Comprehensive Hindi Hostile Post Detection Dataset (CM-HTHPD) is collection of Twitter posts written in the Hindi language, focusing on various forms of hostile content. The dataset was gathered using the Twitter Developer API and subsequently annotated manually with sentiment labels using the Label Studio platform. The dataset is primarily aimed at facilitating research and analysis in the domain of hostile content detection and sentiment analysis in Hindi-language social media discourse. The size of the dataset is approx 8300.
