Datasets
Standard Dataset
Hotel review dataset
- Citation Author(s):
- Submitted by:
- Qiang Lu
- Last updated:
- Fri, 10/18/2019 - 14:04
- DOI:
- 10.21227/b7hp-ps55
- Data Format:
- License:
743 Views
- Categories:
- Keywords:
0 ratings - Please login to submit your rating.
Abstract
Chinese Hotel Review Dataset
Instructions:
In order to test the model, two data sets were used, which were collected by Tsinghua University Chinese Hotel Comment Corpus and Professor Tan Songbo. The Sentiment Dictionary uses the Chinese Sentiment Vocabulary Ontology Library of DUTIR. There are 22012 Emotional Vocabularies, which are divided into 7 categories and 21 subcategories. Each category of vocabulary has the sentiment polarity score, including 11229 positive sentiment words and 10783 negative sentiment words. Negative words and degree words are in HowNet Chinese Negative Words Dictionary and Degree Words Dictionary. There are 59 negative words and 219 degree words.
Dataset Files
- Hotel Review Dataset.zip (2.73 MB)
- All dictionaries.zip (158.37 kB)
Comments
tks