Skip to main content

Datasets

Standard Dataset

Telugu Conversational Dataset for Sarcasm Detection

Citation Author(s):
Ravi Teja Gedela (NIT Silchar)
Ujwala Baruah (NIT Silchar)
Badal Soni (NIT Silchar)
Submitted by:
Ravi Gedela
Last updated:
DOI:
10.21227/s0yz-en95
Data Format:
No Ratings Yet

Abstract

Sentiment analysis, which aims to identify the positive or negative tone of a given text, has seen a surge in interest over the past two decades, making it one of the most studied areas of study in the fields of Natural Language Processing and Information Extraction. Due to the ambiguous nature of sarcasm, however, sarcasm detection is an essential part of sentiment analysis. The task becomes exceedingly challenging when applied to a language with a more intricate morphology and a lack of available resources, such as Telugu. Collecting appropriate and well-annotated corpora is the main challenge in this area of study. In this work, we developed a Telugu dataset of 10,000 conversations, of which 5,000 are sarcastic, and the remaining 5,000 are not. Sarcastic conversations have been collected from various television comedy shows like Jabardasth, Extra Jabardasth, Pataas, etc., and various internet resources. At the same time, non-sarcastic conversations are collected from serials, movies, and celebrity interviews. Three experts, who are teachers and practitioners, have annotated the collected dataset.

Instructions:

Please cite in order to use the dataset.

I am currently engaged in an educational project that requires access to datasets for analysis and research purposes. Your dataset seems to align perfectly with the objectives of my project. I would be incredibly grateful if you could grant me access to your dataset, as it would greatly contribute to my educational endeavors. Rest assured, I will handle the data responsibly and in accordance with any privacy or usage agreements you may have. Thank you very much for considering my request. I look forward to hearing from you soon. Best regards,
Selim Rachdi Tue, 02/27/2024 - 13:27 Permalink

I am currently engaged  in the project which require the telugu sentiment dataset which align perfectly for my project and reasearch. It would be grateful if you grant access for your dataset. I will handle the data responsibly and in accordance with any privacy or usage agreements you may have. Thank you very much for considering my request. I look forward to hearing from you soon. Best regards,

AVULA MOURYA REDDY Tue, 08/06/2024 - 11:54 Permalink

we are doing a translation based project and would require a dataset for the same kindly provide 

shail garg Wed, 10/02/2024 - 13:18 Permalink

Please give me access to the dataset as it is required for my research project.I will definitely cite your dataset and follow all the privacy and copyright agreements.

Angana Chakraborty Sat, 01/18/2025 - 04:10 Permalink

Dataset Files

Files have not been uploaded for this dataset

DOCUMENTATION