Data Set MR and sst-Binary

Name: Data Set MR and sst-Binary
Creator: Jinlan Chen
License: https://creativecommons.org/licenses/by/4.0/
Keywords: Other

Citation Author(s):: Jiajing Zhang (Anhui Jianzhu University)
Submitted by:: Jinlan Chen
Last updated:: Sat, 12/30/2023 - 15:04
DOI:: 10.21227/bp27-xy39

55 views

Categories:

Other

Keywords:

movie reviews;Stanford Sentiment Treebank

ACCESS DATASET CITE

Abstract

MR is a textual dataset of movie reviews for binary sentiment classification, where each review contains only one sentence. The corpus has 5,331 positive and 5,331 negative reviews with an average length of 20.39 tokens. SST-2 is a subset of the Stanford Sentiment Treebank, where the data are labeled positive or negative, and contains 9,613 utterances with an average length of 20.32 tokens.

Instructions:

MR is a textual dataset of movie reviews for binary sentiment classification, where each review contains only one sentence.

数据来自https://github.com/FKarl/short-text-classification/tree/main/data

Jinlan Chen Sat, 12/30/2023 - 15:23 Permalink

Datasets

Standard Dataset

Data Set MR and sst-Binary

Abstract

Instructions:

Dataset Files

QUESTIONS?

More like this Dataset

List of Indexed Journal: Web of Science, Scopus, and DOAJ

Dataset for classification of handwritten and printed text in a Doctor's prescription

Stock Market Tweets Data

Hotel Reviews from around the world with Sentiment Values and Review Ratings in different Categories for Natural Language Processing

SU-AIS BB-MAS (Syracuse University and Assured Information Security - Behavioral Biometrics Multi-device and multi-Activity data from Same users) Dataset

A Dataset on Online Learning-based Web Behavior from Different Countries Before and After COVID-19