COVIDSentiRO contains 19319 Romanian tweets extracted in the time-frame 01.01.2021 - 28.02.2022 using query words related to COVID-19 vaccination. Each tweet has its timestamp associated and is labelled with positive, negative and neutral, using the SART dataset for sentiment analysis.
SART contains 3000 tweets labelled with respect to the polarity of the sentiment expressed: positive, negative or neutral. Each class contains 1300 tweets and the dataset is split into train/validation/test csv files.