Skip to main content

Datasets

Standard Dataset

SART

Citation Author(s):
Alexandra Ciobotaru (University of Bucharest, Faculty of Mathematics and Computer Science)
Submitted by:
Alexandra Ciobotaru
Last updated:
DOI:
10.21227/5fnc-tk84
Data Format:
No Ratings Yet

Abstract

SART contains 3000 tweets labelled with respect to the polarity of the sentiment expressed: positive, negative or neutral. Each class contains 1300 tweets and the dataset is split into train/validation/test csv files.

Instructions:

# SART - Sentiment Analysis from Romanian Tweets

This dataset contains tweets in Romanian labelled with: 0 (Negative), 1 (Neutral) and 2 (Positive). Each class contains 1300 tweets and the dataset is split into train/validation/test csv files: 3120 tweets for training, 390 tweets for validation and 390 tweets for testing.

| Class Name | No. of labelled tweets |
| ------- | --- |
| Negative | 1300 |
| Positive | 1300 |
| Neutral | 1300 |

To protect confidentiality of Twitter users, we removed usernames from this dataset. 

Dataset Files

Files have not been uploaded for this dataset

More from this Author